By continuing to browse this site you agree to us using cookies as described in About Cookies
Notice: Wiley Online Library will be unavailable on Saturday 7th Oct from 03.00 EDT / 08:00 BST / 12:30 IST / 15.00 SGT to 08.00 EDT / 13.00 BST / 17:30 IST / 20.00 SGT and Sunday 8th Oct from 03.00 EDT / 08:00 BST / 12:30 IST / 15.00 SGT to 06.00 EDT / 11.00 BST / 15:30 IST / 18.00 SGT for essential maintenance. Apologies for the inconvenience.
 The Denali and Kokoxili strike-slip earthquakes are two of the longest recent intracontinental ruptures. Previous studies report a range of rupture velocities. Here we image these earthquakes by reverse time migration of the intermediate-frequency P wave train recorded by global broadband seismometers. This technique permits a relatively direct measure of rupture velocity (speed and direction) as constrained by the radiated seismic energy, free from restrictive assumptions or rupture speed bounds placed on the solution. We compare our results with published seismic, GPS displacement, and surface slip inversion results. Both ruptures were initially subshear and transitioned over a distance no longer than 40 km to supershear speeds close to the P wave speed of ∼5.6 km/s. We investigate the accuracy of our results with synthetic data and experiment with using different imaging parameters and seismic subnetworks. These tests allow us to rule out the possibility of subshear speeds along the supershear segments. Although we cannot exclude supershear speeds of 4.5–6.5 km/s, our most reliable rupture velocities of ∼5.6 km/s are close to the local P wave speeds. We hypothesize that these intracontinental faults have weak shear strengths or high breakdown slips or crustal rigidities and experience at least moderate slip or slip rate weakening. Our observations and previous published results lead us to speculate that very long, surface-extending faults with general homogeneity in prestress and fault strength, together with smaller adjacent fault segments to provide triggering, may be necessary ingredients for the sub-Rayleigh to supershear rupture speed transition in strike-slip earthquakes.
 On 14 November 2001, the Mw 7.8 Kokoxili (Kunlunshan) earthquake ruptured the Kunlun strike-slip fault in northern Tibet. Twelve months later, on 3 November 2002, the Mw 7.9 Denali fault earthquake ruptured the Denali and Totschunda strike-slip faults in Alaska. Both ruptures initiated on a secondary fault, ruptured toward the east for at least 300 km, and were well recorded at teleseismic distances, providing two similar events with which to investigate the kinematics and dynamics of the rupture process of very long strike-slip earthquakes.
 One of the important quantities in understanding the physics of these long ruptures and the hazards they present to society is the speed with which the rupture front propagates [e.g., Das, 2007]. Strike-slip ruptures are thought to be akin to movement of a mode II crack, in which the rupture direction coincides with the slip direction on one side of the fault. Theoretical results predict that the rupture is allowed to propagate within two steady state speed regimes: below Rayleigh wave speed (subshear) or between the S and P wave speeds (supershear). Burridge  and Andrews  proposed the “BAM model,” which explains how a subshear rupture can transition to supershear. As a subshear crack propagates, a peak in shear stress traveling at the S wave speed can develop ahead of the crack tip such that if the peak in stress is above a critical threshold and the fault is longer than a critical length, supershear can initiate. Otherwise, simple rupture at sub-Rayleigh wave speeds will occur. This model has been supported and expanded upon [e.g., Das and Aki, 1977]. Technological advancements in computers and fracture mechanics experimentation techniques have permitted further investigation of the transition phenomenon [Day, 1982; Rosakis et al., 1999; Madariaga and Olsen, 2000; Rosakis, 2002; Dunham et al., 2003; Xia et al., 2004; Dunham, 2007].
 In order to understand the physical processes involved in these very long ruptures and to relate them to recent modeling studies, the rupture speeds must be constrained better in space and time. Here, we show that the teleseismic P wave train of these earthquakes can be reverse time migrated to unambiguously illuminate the rupture speed and direction along most of the rupture. We find that both earthquakes began as subshear ruptures but at about 100 km into the rupture clearly transitioned to supershear speeds near the local P wave speed, over a transition distance no longer than 40 km, and continued for at least another 200 km. To first order, these results are consistent with those of Robinson et al.  and Vallée et al.  for the Kokoxili rupture. For the Denali rupture, our results extend the localized constraints on supershear speeds of Dunham and Archuleta  and Frankel  to the end of the rupture. Both ruptures are remarkably similar despite the ruptured fault segments' different geologic ages and geometries. We compare our results to those obtained with other techniques, and make inferences about the dynamics of the rupture process for the Denali and Kokoxili earthquakes.
2. Tectonic Setting
2.1. Kunlun Fault, Northern Tibetan Plateau
 The left-lateral Kunlun fault separates the high Tibetan Plateau to the south from the sedimentary basins to the north. Just south of the Kunlun Mountains, this Miocene fault extends east/west for about 1600 km, accommodating about 10–12 mm/a of eastward motion of the Tibetan Plateau (with respect to the basins in the north) for a total horizontal offset of less than 100 km (about a third to a half of the total motion [Peltzer and Saucier, 1996; Van der Woerd et al., 1998, 2000, 2002]). The Qaidam Basin underthrusts the Kunlun Mountains, accommodating about half of the north/south convergence between the Indian subcontinent and Eurasia [Meyer et al., 1998; Tapponnier et al., 2001].
 The 2000-km-long Denali fault is part of a series of parallel faults that accommodate right-lateral to oblique convergence between the North American and Pacific plates. The Denali fault separates the Wrangellia, southern margin, and Yakutat terranes to the south from the Alaska margin to the north [Plafker and Berg, 1994]. The Wrangellia terrane was accreted to the Alaska margin during the middle Cretaceous. The subsequent counterclockwise rotation of Alaska due to the opening of the Canada basin and a more northerly Pacific plate motion led to right-lateral slip along the Denali fault. This slip rate intensified during the early Tertiary when compression between the Eurasian and North American plates led to further counterclockwise rotation and a space problem for northward moving terranes in southeast Alaska [Plafker and Berg, 1994]. Since the Oligocene, right-lateral strike slip continued along the Denali fault with a Holocene slip rate of ∼15 mm/a and a total horizontal offset of ∼400 km [Lanphere, 1978; Nokleberg et al., 1994; Plafker et al., 1994].
 The top of the subducted Pacific plate is located between 80 and 100 km depth beneath the western Denali fault system [Ferris et al., 2003]. The slab does not exist beneath the eastern end of the fault. The Denali fault also demarks a 10 to 15 km step in crustal thickness, with a thickness of 40–45 km to the south, and 30 km to the north [e.g., Brocher et al., 2004; Rossi et al., 2006]. Because of the extensive history of slip and deformation, the Denali fault juxtaposes different rock units along the strike of the fault, but most units are either metasedimentary or metaigneous rocks. There are also a number of glaciers located along the fault to the east of the Totschunda fault, and hydrothermal activity in the region is associated with fractured, granitic plutons [Miller, 1994].
2.3. Seismic Modeling and Imaging
 Two seismic approaches to investigating the rupture dynamics of earthquakes are inverse modeling of waveforms and imaging. A variety of waveform modeling techniques can provide an estimate of the spatiotemporal slip distribution. Most of these techniques require a priori knowledge of at least some fault parameters and the calculation of Green's functions between points along the assumed fault plane and the stations. An inversion is performed with stability constraints to minimize the difference between predicted and observed seismograms. Some issues with these techniques are discussed by Beresnev . For large earthquakes, arguably the biggest issue is that there are many model solutions that fit the data well and inversion results among different groups exhibit only limited agreement.
 There are a number of seismic reflection imaging techniques that have been advanced by the seismic exploration community over the last 20 years. These techniques rely on phase coherence in order to reverse time migrate (or back project) the energy recorded by geophones to impedance contrasts (geologic interfaces) at depth, which is especially important for eliminating diffractions that make distinguishing the edges of impedance contrasts difficult. Reverse time migration, hereinafter referred to as “back projection,” is one of these techniques and does not require many assumptions. Although there are many back projection implementations [e.g., Claerbout, 1971; Stolt, 1978; McMechan, 1983; Baysal et al., 1983; Bleistein, 1987; Sun et al., 2000], the physical concept is the same and can be applied to imaging the seismic energy that is radiated during an earthquake rupture.
Ishii et al. [2005, 2007] used the Japanese High-Sensitivity Seismograph Network (Hi-net) array to image the rupture extent, direction, and velocity of the Sumatra-Andaman Mw 9.2 2004 earthquake. They used short-period records from the bottom of 100 m boreholes in the Japanese Hi-net array, and back projected high-frequency, phase-coherent energy to the source location ∼50° away. The success of the technique was due to the phase coherence across the array, which was a result of the high quality of the recorded direct P waves, the favorable source-to-station geometry, and the relatively small along-rupture changes in the velocity structure to the stations. Walker et al.  used a similar approach with 0.03–0.5 Hz waveforms recorded by global seismic stations to image the rupture details of the Sumatra 2005 Mw 8.6 earthquake. They used a station-weighting scheme that effectively regularized their uneven station distribution. Comparisons with the results from the Hi-net array suggested that, for this earthquake, the global network produced less smearing artifacts and a sharper image. In contrast to the resolved unilateral rupture of the 2004 Mw 9.2 earthquake, the 2005 Mw 8.6 earthquake was bilateral, rupturing to the north and south. Larmat et al.  used a reverse-time wave propagation method on the complete wavefield recorded by global seismic stations to image the 2004 Sumatra earthquake. Their synthetic seismogram approach naturally took into account many of the complexities of the different phases and propagation paths, although the long periods of the Rayleigh waves that dominated their solution limited the resolution they obtained at the source.
 A different approach was taken by Krüger and Ohrnberger . They used the European subnetwork as an array, a curved wavefront assumption, and beam formed a low-frequency band (0.03–0.05 Hz) of the P wave train of the 2004 Sumatra rupture to obtain back azimuth and slowness as a function of time using a European reference point. They then shot P rays in the directions determined by the beam forming, and tracked the intersection of those rays with the hypocentral depth to recover rupture details that were similar to those of Ishii et al. . A similar approach was taken by Spudich and Cranswick  to image the rupture propagation of the 1979 Ms 6.9 Imperial Valley earthquake. They used a short baseline accelerometer array of five stations to image a supershear rupture in the 1.5+ Hz frequency band along a 7-km-long segment of the ∼35-km-long rupture. However, they could only reliably measure vertical slowness, which limited their ability to accurately track the rupture due to the unfavorable source array geometry. Kao and Shan [2004, 2007] applied a related technique that they refer to as “source scanning” to resolve the location of tremor in the Cascadia subduction zone and the ruptures of the San Simeon and Parkfield earthquakes. Their technique is different in that they back project envelope functions and absolute amplitudes using local seismic data. They back projected to a three-dimensional spatial grid that included depth as well as horizontal position. Kao et al.  also used back projection with three-dimensional ray tracing to identify the rupture plane of the 2001 Nisqually, Washington, earthquake. Finally, Allmann and Shearer  implemented the back projection approach to image a secondary high-frequency event during the 2004 Parkfield earthquake. They back projected S waves recorded in local strong motion data to a spatial three-dimensional source grid. Although appropriate for local imaging, resolution with depth is relatively poor at teleseismic distances.
3. Methodology, Data Selection, and Processing
 The back projection methodology that we employ below is a modified version of that used by Walker et al. . We assume that the energy in the P wave train in the ∼0.1–1.0 Hz band originates directly from the rupture, and we use a simple ray theoretical approach that only considers travel times (because we apply automatic gain control “AGC” to the data, we ignore geometrical spreading, directivity and other amplitude effects). A 3-D grid (x, y, t) of possible source points is defined at the hypocentral depth encompassing the source region (Figure 1). For each possible source point and time sample, a travel time curve is defined using the IASP91 velocity model [Kennett and Engdahl, 1991], along which the amplitudes are summed to form the stack S(x, y, t), which is an estimate of the source time function for a source at the associated grid point. Space-time peaks in the S(x, y, t) function represent likely sources of seismic radiation, which, given sufficient resolution, can be used to map the rupture history of the event.
 Velocity heterogeneity along the raypaths between the source region and the stations accounts for time shift deviations of up to ∼5 s from the IASP91 predictions (Figure 1a). Therefore, a modified iterative cross-correlation analysis [Houser et al., 2008] is used on the first several seconds of the P wave train to define a time shift correction (Figure 1b) and polarity correction (Figure 1c), which maximizes S(0,0,t) (at the hypocenter). Subevents with similar focal mechanisms to the first-motion mechanism also coherently stack at later times (Figure 1d). However, if the subevents have mechanisms that are significantly different than that the first-motion mechanism, S(x, y, t) is attenuated unless specific subnetworks of the available stations carefully selected for use.
 We selected all available vertical component P wave trains that were archived by the IRIS Data Management Center. Velocity waveforms were selected for stations within 30–90° from the epicenter, filtered, and resampled to 50 Hz. To improve phase coherence within the P wave train, a five-pole, zero-phase Butterworth band-pass filter was applied in the 0.2–0.9 Hz band for the Kokoxili rupture and 0.1–0.5 Hz band for the Denali rupture. These frequency bands were chosen on the basis of many resolution tests to find the optimum frequency band that maximized the imaging coherence. These filters also attenuated the lower-frequency surface reflections, such as PP, which does not arrive until after 70 s for the closest stations. The PP phase, as well as other phases like PcP, have different move out curves, and will not align along that for the direct P wave (Figure 1). Therefore, this misalignment along with the applied AGC makes their contribution to S(x, y, t) extremely small.
 The cross-correlation analysis used a subset of the stations as defined by a signal-to-noise ratio (SNR) threshold of 12 dB. The resolution of the back projection approach depends upon many factors including the frequency content of the data and the network geometry. We have found that better results are generally obtained for more uniform station distributions and thus we apply the station-weighting scheme used by Walker et al.  to even out the station distribution and prevent a large density of stations in a single region from dominating the solution. For the cross correlation, the weight of each station is 1/n, where n is the number of stations within 300 km. We apply an STA/LTA autopicker [Earle and Shearer, 1994] to pick the first arrival at each station and use the median of the residual between the picks and the IASP91 predicted times as the initial reference time. The signal window was between −3 and 6 s and was allowed to shift by up to ±6 s. The algorithm loops over the starting station, performing five iterations for each trial alignment and calculating the average correlation coefficient ravg. The optimum alignment is that associated with the starting station that yields the best ravg.
 The final master stack, a proxy for the source time function at the hypocenter, has the best SNR of all the traces. To correct for the radially symmetric velocity model (or any potential inaccuracy in the origin time of the earthquake), we apply the autopicker to the master stack to pick the onset of the rupture and provide an additional timing correction tc. We also discard any traces for which the final shifts are greater than 5 s (ttmax) of the predicted arrival time (minus tc). The cross-correlation analysis also provides a polarity correction for each trace. These polarity corrections should be consistent with that predicted by first-motion focal mechanisms. We discuss later for each earthquake how we select a subnetwork of stations on the basis of the first-motion and CMT focal mechanisms so we can better image the rupture as it undergoes along-fault changes in the focal mechanism.
 Prior to back projection, we apply a time-variable gain (automatic gain control) to normalize the amplitudes using a 5-s time window, maximizing our ability to track the rupture velocity along the entire rupture. We only use waveforms for which the cross-correlation coefficient is greater than some threshold rmin. We also used the same station regularization scheme, but with different parameters (wb). These data are then back projected and the resulting stacks S(x, y, t) are decimated in time to a 3 Hz sampling rate. At its full time-space resolution, the resulting S(x, y, t) image contains numerous high-frequency and short-wavelength features that are not reliable measures of source properties. The back projection method involves the constructive and destructive interference of wavefronts from the different stations. The destructive interference of incoherent energy is never complete and obvious artifacts can sometimes be seen that sweep through the image at the apparent speeds of the direct teleseismic P phase of 14 to 22 km/s (i.e., the horizontal speed of the P arrivals at each station). To suppress these artifacts and focus on the highest amplitude and most reliable parts of the image, we apply a space-time smoothing operator to S(x, y, t). We use a scaling speed of 3 km/s to convert time to pseudo-space, and integrate S(x, y, t)2 with a running cube operator of width dc, resulting in Si(x, y, t)2. This operator preserves energy that travels through S(x, y, t) at slow speeds (such as the constructively interfering energy that delineates the rupture), but helps attenuate artifacts that sweep through at the faster apparent speeds of the direct P phase. We then search Si(x, y, t)2 for energetic points that are defined as local maxima (in x, y, t) that are greater than emin of the global maximum. We then sort these local maxima by their amplitude. Starting from the maximum energy points and working down, we discard any points that are within a threshold distance de (using the 3 km/s scaling speed) of all previously accepted points. The final subset of points represents our most reliable estimates of the true locations of sources of radiated seismic energy. Table 1 indicates the back projection parameters used for each earthquake.
Table 1. Back Projection Imaging Parameters
3.1. Kokoxili Mw 7.8 Earthquake
 The 2001 rupture occurred along five fault segments. On the basis of surface mapping [Xu et al., 2002; Lin et al., 2002] and modeling of teleseismic body waveforms [Antolik et al., 2004; Ozacar and Beck, 2004], the earthquake initiated at 35.95°N, 90.54°E on a ∼30-km-long, east striking fault about 40 km south of the Kunlun fault and propagated northeast across an extensional graben to the western Kusai Hu segment of the Kunlun fault. From there it propagated eastward about 85 km to the Hong Shui River gorge and eastern Kusai Hu segment. Propagation continued for another 150 km and branched onto the Kunlun Pass fault, persisting for another 100 km and a total length of ∼400 km. We use the term “main branch” to refer to the rupture along the segments of the Kunlun fault and the Kunlun Pass fault. The average and maximum slip reported by Xu et al.  along the main branch is 2 and 7.6 m, respectively. The maximum slip reported by Lin et al.  was 16 m. The total duration and moment of the rupture was about 120 s and between 4.6 × 1020 and 5.3 × 1020 N m [Antolik et al., 2004; Ozacar and Beck, 2004].
 The first-motion pattern of this rupture can be explained by a pure strike-slip mechanism. However, the first 30 s of this rupture is best modeled as a combination of left-lateral slip propagating to the west, and normal slip propagating to the east-northeast, with an optimum single-mechanism of left-lateral, oblique slip [e.g., Ozacar and Beck, 2004]. The mechanisms found for the +30 s duration of this rupture were mostly left-lateral strike slip, consistent with the Harvard CMT [Antolik et al., 2004; Ozacar and Beck, 2004], which is located close to Kusai Hu Lake.
 The Kokoxili earthquake was well recorded at teleseismic distances, mostly by the European and west Asian seismic stations but also by an IRIS PASSCAL 40-station array in Ethiopia. On the basis of the available data that passed the quality control criteria, we selected a subnetwork of stations to use in the back projection based on the polarities predicted by the first-motion and CMT focal mechanisms (Figures 2a and 2b). The cross-correlation analysis did not identify any polarity corrections that were required to align the data in the −3 to +6 s time window (Figure 2c). We attribute this to the length of the time window that we use in the cross-correlation analysis, which is long enough such that the waveforms are aligning on energy from the bilateral, oblique slip moment release, the mechanism of which does not predict polarity flips (Figure 2b). There were only a handful of other stations to the east of the hypocenter, but many of them did not pass the quality control criteria. Of those that did pass, the polarities were not consistent for nearby stations, and were discarded.
 The 70 stations remaining for analysis are contained entirely within the same quadrant for the Ozacar and Beck  0–30 s moment mechanism, and the Harvard centroid moment mechanism. Consequently, any subevents in the rupture that are consistent with the Harvard CMT should not be attenuated in the back projection due to polarity changes. The seismic data show two distinct pulses of energy in the 0.2–0.9 Hz band that do not correlate with the timing of any standard seismic phases: between 10 to 20 s and 80 to 100 s (Figure 2d). The PP phase is predicted to come in at least 100 s following the direct P phase for almost all of the stations used in the analysis. The pulses of major moment release occur between 0 and 15 s, 65–75 s, and 85–100 s [Antolik et al., 2004; Ozacar and Beck, 2004].
 Back projecting the teleseismic P waves illuminates the first 300–350 km of the rupture well. Figure 3 shows the RMS average of Si(x, y, t)2 over 0–120 s. Reds indicate portions of the rupture that radiated energy that was phase coherent at the seismic stations. Tracking of the energetic points and visualization of time slices (Figure 4 and Movies S1–S4 in auxiliary material) show an initial propagation from the hypocenter to the west and northeast, followed by a better constrained unilateral eastward propagation along the main branch.
 The easternmost part of the rupture is not well imaged. The rupture reaches the Harvard CMT location at about 70 s but then weakens in amplitude and bifurcates into two branches around 85 s: one propagates north and another continues propagating east until 115 s, the point at which coherence is completely lost, presumably due to rupture termination. The northward branch is likely an artifact due to a significant change in the 3-D velocity heterogeneity between one group of stations and the segment of the Kunlun fault east of the Harvard CMT. Evidence for this comes by analyzing a Mw 5.6 aftershock located near the junction between the Kunlun fault and Kunlun Pass fault (Figures 3 and 4). We performed the cross correlation on the aftershock waveforms in the same frequency band as the main shock. The resulting time shifts were visually validated for all stations. Only four of the stations used in the main shock did not record the aftershock, and those stations were not used in the analysis. We back projected the aftershock waveforms using both the aftershock cross-correlation traveltime corrections and the main shock traveltime corrections (Figures S1–S4). Specifically, we obtained a 56% power reduction (34% amplitude reduction) when we use the main shock-determined corrections, suggesting an along-rupture change in the 3-D velocity heterogeneity between the rupture and the stations. The aftershock is still imaged in the correct location, but at a later time the energy diverges along two paths (Figure S2, 9–12 s). This is similar in amplitude and geometry to the bifurcation observed in Figure 4, 90–100 s (Movie S1 provides a better amplitude comparison) and suggests that the energy to the NW of the fault seen during 90–100 s maps back to a location on the fault during 80–90 s. Almost every other aftershock was not strong enough to be detected at most of the stations used in the imaging of the main shock, inhibiting efforts to use several aftershocks to carefully fine tune the teleseismic imaging. However, most of the rupture was imaged well enough to permit tracking of the rupture. Such aftershock corrections have a greater influence in higher frequency imaging. For example, Ishii et al.  performed such corrections to improve imaging the 2004 and 2005 Sumatra earthquakes with 1+ Hz waveforms recorded by the Japanese Hi-net array.
 The rupture speed can be measured by projecting Si(x, y, t)2 onto a line parallel to the rupture. We use the single projection line shown in Figure 3, beginning 40 km west of the hypocenter and extending to the east. The lack of direct overlap with the fault segments is acceptable given our resolution of ∼40 km, which is based on our space-time smoothing operator. Figure 5 shows the RMS average within 50 km of the projection line. The slope in this plot is a direct measure of the rupture propagation speed. Energy traveling at the apparent teleseismic P wave speed of 14–22 km/s is indicative of arrivals that are not interfering constructively from one or more seismic stations located east of the rupture, close to the projection plane. Alternatively, energy with the same slope but mirrored about the y axis is from similar in-plane stations located west of the rupture. Although such energy would have also been partially attenuated by the 3-D integration filter, a minor amount remains in the hypocentral region and near 80–150 km.
 The back projection images and the positions of the energetic points have adequate resolution to unambiguously resolve two rupture speeds: 2.6 km/s between ∼20 and ∼120 km and 5.7 km/s between ∼120 and ∼290 km. The former is not as well constrained between 20 and 100 km because of the bilateral nature of the initial rupture, and the deviation of the strike of the graben fault from the projection line. The transition between the two speeds appears abrupt, but given our 40 km resolution, the transition seems to have occurred over a length no greater than 40 km.
 As discussed above, the trace amplitudes were normalized (AGC) to assist in the illumination of the entire rupture to obtain a more accurate rupture speed estimate. Therefore, the reds in Figure 5 indicate the relative distribution of energy that was phase coherent at the seismic stations. The time slices and animations show that the last 100 km of the rupture was not well imaged, which also explains the lack of energy in Figure 5. However, there is a remarkable lack of energy from 25 to 40 s in Figure 5 that is not associated with mislocated energy in the time slices. This likely reflects a much smaller intensity or duration of energy radiated at the source during this time since the normal faulting mechanism that is mostly associated with this energy [Ozacar and Beck, 2004] would have yielded consistent polarities with those of the first 6 s (Figure 2b).
 Beginning at ∼50 s just west of the Hong Shui River gorge, the rupture coherence increases greatly and appears to have a consistent speed of ∼5.7 km/s. This 5.7 km/s segment correlates in time with the dominant components of the moment rate functions [Antolik et al., 2004; Ozacar and Beck, 2004; Robinson et al., 2006]. A local peak in two of the moment rate functions after 85 s correlates with the bifurcation discussed above.
3.2. Resolution and Uncertainty
 One of the advantages of using earthquake imaging to track the rupture in space and time, without regard for accurate amplitudes, is that it is relatively simple to determine when there is an imaging problem that affects resolution and uncertainty. Specifically, the manifestation of an imaging problem is energy that is mislocated or smeared over some distance, or energy that has separated into high-amplitude lobes that are not located along the rupture (i.e., the rupture is out of focus). The existence of such a problem is perhaps the easiest to determine for strike-slip faults because the fault plane is nearly vertical, and energy mislocation is therefore easy to spot. It is more challenging to evaluate the possibility of energy misplaced to another location along the rupture. In all these cases, visualizing the data in different ways provides the most useful diagnostic. Figure 4 shows a probable resolution issue at about 90–100 s. Figure 5 shows a minor smearing toward the west with an apparent P wave speed. Fortunately these effects are minor and we are still able to accurately track the rupture front to 90 s.
 To further evaluate the theoretical resolution of our images and the stability of our results, we performed a number of synthetic tests using different parameters applied to the same station geometry that recorded the actual earthquakes. Our first test was to determine if we can resolve the difference between a subshear and supershear rupture. We created synthetic point source events (impulses) at the spatial locations of the energy peaks (the peaks in the smoothed S(x, y, t) function) obtained from our imaging analysis. For the synthetic supershear case, we also used the observed times of the energy peaks. For the subshear case, we assumed a rupture speed of 2.6 km/s for the entire rupture and adjusted the times of the input points accordingly. The source amplitudes were assigned to the observed peak amplitudes. The point sources were then filtered with the same 0.2–0.9 Hz filter used for the real data. In other words, we created synthetic ruptures of two different rupture speeds assuming that each rupture gives rise to a series of 0.2–0.9 Hz filtered impulses that propagate outward in all directions to the same stations used in the imaging.
 Applying identical imaging parameters to those used previously, we find that the pulses of energy are correctly resolved, and slightly smeared along a negative rupture speed slope, reflecting the influence of the European stations to the northwest of the Kunlun fault (Figure 6). Note that the shape of the individual red clusters in Figure 6 is an estimate of the theoretical resolution kernel for a single point source. These energy peaks are not circular because of the nonuniform station coverage; they tend to be elongated in the direction of the bulk of the stations. Note that in this case the station geometry does not promote the smearing of energy in the direction of propagation, which could result in an upward biased rupture speed estimate. The energetic points that are recovered are midway between the input energetic points. Additional tests showed that these energetic points are slightly mislocated due to smearing associated with the 3-D integration operator. Because this mislocation does not affect the rupture speed, we can clearly distinguish between the two cases, and recover the same input rupture speeds.
 We performed a number of other tests using different frequency bands, imaging parameters, and station networks. For example, we removed the restriction to include stations to the east of the hypocenter that were also within the consistent polarities region of the two focal spheres in Figure 2b, but that degraded the results. We also flipped the polarities of stations in a controlled fashion at specific times in the rupture, and found that we lose coherence at those points in the rupture.
 We also studied the effect of the back projection station weighting radius wb. Values less than and greater than 500 km led to more smearing in the northeast direction (toward Europe). This is as expected, as too large or too small a radius will tend to weigh all stations equally. We determined by trial and error the optimum value of 500 km, which produces the least smearing of the resolution kernel, and which is a function of the source-station geometry, frequency content, and the degree of heterogeneity in station density.
 We also experimented with changing the scaling speed of 3 km/s used to smooth S(x, y, t)2 in space-time and obtained slightly different results for different speeds. Removal of the filter yielded images that were higher in frequency, but did not greatly affect our overall results. This test along with other tests involving the station weighting radius, dc, and de indicate that we cannot rule out supershear speeds anywhere between 4.7 and 6.5 km/s, although 5.7 km/s is the most reliable estimate.
 Last, we also tried to back project traces from the IRIS PASSCAL Ethiopian array, which had an aperture of about 500 km. We used higher frequency ranges of 0.5–4.0 Hz and 0.8–4.0 Hz. The trace alignment obtained was excellent, but the source-station geometry was too poor or changes in the along-rupture 3-D velocity heterogeneity between the rupture and the array were too great to unambiguously resolve any features of the rupture.
3.3. Denali Mw 7.9 Earthquake
 The 2002 Denali rupture occurred along three fault segments [Eberhart-Phillips et al., 2003]. From surface offset mapping, the Denali rupture initiated at 63.52°N, 147.44°W on the previously unrecognized Susitna Glacier reverse fault and propagated west for about 48 km [Haeussler et al., 2004]. At some point during this time, rupture initiated on the McKinley strand of the Denali fault system and proceeded eastward 70 km to the main segment of the Denali fault. This is also the location of the Trans-Alaska Pipeline, where a strong motion station, PS10, recorded the ground motion [Ellsworth et al., 2004]. From this point, the rupture continued to the east another 156 km, stepped across a ∼20-km-long transfer zone, and propagated southeast along the Totschunda fault for another 66 km, yielding a total surface rupture length of 341 km [Eberhart-Phillips et al., 2003; Haeussler et al., 2004].
 In contrast to the Kokoxili earthquake, the Denali fault earthquake was well recorded at local, regional, and teleseismic distances. There are a total of 242 station traces that pass the quality control measures previously discussed. However, the final seismic recordings that are back projected are sorted into two subnetworks. The rupture initiation is well represented by the first-motion mechanism [Ozacar and Beck, 2004], and we find that our cross-correlation analysis on the 0.1–0.5 Hz data only flags a few of these stations for polarity flips (Figures 7a and 7b). However, the rest of the rupture is not similar to the first 35 s. We therefore adopt the moment tensor solution of Ozacar and Beck  for the 35+ s of the rupture, which is derived by “pulse stripping” the first 35 s from the data, then modeling the rest for a single-event model. We then sort stations into two subnetworks depending on their location with respect to the two mechanisms. In each subnetwork, there are no changes in predicted polarities between stations for each focal mechanism (Figure 2b). However, if we simply back projected the entire network of stations, there would be significant attenuation along most of the rupture due to destructive interference. Subnet 1 and 2 contain 102 and 140 stations, respectively.
 As has been documented already, there is remarkable directivity in the 0.1–0.5 Hz band [Eberhart-Phillips et al., 2003; Frankel, 2004] (Figure 7d). The first 10 s of the rupture contains the greatest amount of P wave energy. This may in part be due to the mechanisms; the reverse-faulting mechanism predicts high amplitudes in the initial part of the rupture, but the stations are closer to the P nodal planes for the Ozacar and Beck 35+ s subevent mechanism. The only visibly coherent energy in the time series after 35 s is for stations to the southeast of the rupture around 50 s.
 The optimum back projection of these data comes from processing subnet 1 and 2 separately and averaging the two Si(x, y, t)2. The map view RMS averages for 0–100 s are shown in Figure 8. Both subnetworks illuminate the entire fault rupture, and the energetic points line up approximately along the ruptured fault as expected. However, subnet 1 images the hypocenter and the first 200 km of the Denali rupture better than subnet 2, whereas subnet 2 images the far end of the rupture better. The average (also shown in Figure 9) yields a more continuous distribution of energy along the rupture is our preferred result with which to measure rupture velocity. However, both subnetworks resolve more coherent P wave energy at the hypocenter than anywhere else along the rupture.
 The time slices and animations show good coherence out to 90 s, indicating that the rupture is fairly well imaged out to its termination on the Totschunda fault (Figure 9). The rupture starts on the Sustina Glacier fault and continues toward the west for the first 20 s. It then propagates east along the McKinley strand of the Denali fault system to the junction with the Denali fault at PS10 at about 40 s, which is consistent with the first arrival and fault-normal arrival of the rupture front at PS10 (37 and 40 s [Ellsworth et al., 2004]). An increase in coherence begins at about 50 s as the rupture propagates along the Denali fault to the Totschunda fault junction. From the junction until the end of the rupture at 90 s, the resolution is not adequate to differentiate between rupture on the Denali or Totschunda faults. As was done by Ishii et al. , one could use aftershocks to correct for the apparently minor effect of along-rupture changes in 3-D source-station velocity heterogeneity, but almost all aftershocks were too small to be well recorded at the stations used in the teleseismic imaging of the main shock.
 The images of the rupture across the fault as a function of time also show the resolution capabilities of each subnetwork (Figure 10). Subnet 1 resolves the initial rupture to the west along the Sustina Glacier fault at a horizontal speed of ∼2.5 km/s. About 20 s later, rupture begins to propagate to the east along the Denali fault at an estimated rate of 2.9 km/s to the Harvard CMT location (km 130), where the rupture accelerates to 4.5 km/s. Subnet 2 (mostly North American stations) is associated with more smearing than subnet 1 because of the more limited distribution of stations. The smearing is in the direction of propagation at the apparent P wave speeds. Regardless of this smearing, energetic points and centers of significant energy suggest 2.7 km/s for the first 60 km of eastward propagation, and 4.9 km/s for propagation along the Totschunda fault. However, we consider the images using the average Si(x, y, t)2 to be the most reliable. These results suggest 3.3 km/s out to km 130 and 5.5 km/s out to a distance of 330 km.
3.4. Resolution and Uncertainty
 In a similar fashion to the Kokoxili rupture, we performed synthetic and other tests to determine the robustness of our rupture speed estimates (Figure 11). For the subshear case, we chose a rupture speed of 3.3 km/s for the entire rupture, consistent with the first part of the rupture in our preferred image (Figure 10). For the supershear case, we again used the energetic points from Figure 11 as the input subevents. The synthetic tests show that we are able to clearly discriminate between the subshear and supershear cases, and that we recover both the amplitudes and positions of the input energetic points better than for the Kokoxili rupture. This improved resolution is due mostly to the improved station coverage, which also allowed us to use a smaller 3-D integration operator (20 km versus 40 km for the Kokoxili rupture).
 Our imaging suggests that the strike-slip motion on the Denali fault system began about 20 s following rupture initiation. Ozacar and Beck  inverted for moment distribution on both the reverse-faulting subevent and the strike-slip component of the main rupture. Instead of using two subnetworks independently and adding their respective Si(x, y, t)2, we tried to use the entire network by flipping the polarity of the traces from subnet 2 after 30 s such that they should be in phase along the main part of the rupture during the back projection. The images were significantly degraded. Specifically, the rupture was discontinuous and lower in power than that in Figure 10. We also back projected all the waveforms without flipping the polarity of subnet 2 and energy was imaged better in some locations along the rupture, but the overall rupture image was worse. The result of our tests indicated that for the supershear segment that we image, 5.5 km/s is our most reliable estimate, although as with the Kokoxili earthquake, we cannot exclude speeds between roughly 4.5–6.5 km/s.
4.1. Poorly Constrained Parts of the Ruptures
 The first three quarters of the Kokoxili earthquake are fairly well imaged with the network of stations to the west of the rupture. However, Si(x, y, t)2 loses some coherence east of Kusai Lake where the energy splits into two branches, one that moves ∼100 km northward, and the other continuing east along the Kunlun Pass fault (Figures 3 and 4 and the auxiliary material). The probable explanation for this is a significant change in the 3-D velocity heterogeneity along source-receiver raypaths for the Eurasian or African stations. For this model, simple geometrical relationships indicate that a 5 s time shift for either group of stations is needed to create this bifurcation. This is consistent with a comparison of the main shock traveltime corrections with those obtained by cross-correlation analysis of waveforms from a Mw 5.6 aftershock near the Kunlun Pass junction. The analysis shows that about a quarter of the stations (all from the Ethiopia-Kenya region) have a difference of 3 to 7 s in their respective optimum travel time correction to account for 3-D velocity heterogeneity.
 Another possibility is that the intensity of radiated energy may have quickly decayed along the rupture at 90–100 s such that waveform coherence at the stations is generally lost and energy that was correctly located earlier in time still dominates. If that is the case, the energy in the 100–110 s time frame is likely correctly located, which would indicate that we image almost the entire ∼400-km-long rupture (Figures 4 and 5). Dunham et al.  performed a series of numerical experiments and found that stronger along-fault barriers can focus the rupture front, leading to an order of magnitude increase in slip speed and radiated seismic energy. This may be a physical explanation for the high burst of energy observed just before the dramatic loss of coherence. Another possible physical mechanism for the reduction in seismic radiation intensity may be the transition onto the Kunlun Pass fault, which has a different geometry than the long and straight Kunlun fault [Robinson et al., 2006; Das, 2007] and may have had a different strength or preslip stress.
 The Denali rupture was fairly well imaged with two subnetworks instead of using the entire network (with or without polarity flips), with or without polarity flips to adjust for the two different primary source mechanisms. Although there is a marked decrease Si(x, y, t)2 along the length of the rupture, the power levels remain significantly above the noise level. These two observations mean that either the waveform coherence of the rupture was better in some directions than in others, or the 3-D velocity heterogeneity time shift corrections obtained by the cross-correlation analysis for the hypocenter-station raypaths do not adequately correct for 3-D velocity heterogeneity along the entire rupture. We suggest that the primary mechanism is a lack of global waveform coherence. First, there is already ample evidence for strong directivity associated with this rupture [e.g., Eberhart-Phillips et al., 2003] (Figure 7d). Second, the strike-slip part of the Denali rupture was not along a straight fault, but along a series of faults that gave rise to a 60 degree rotation in strike, which is likely to produce changing directivity and focal mechanism effects along the rupture. For example, a coherent waveform packet is observed for many of the southeast stations at about 50 s that is not coherent elsewhere (Figure 7d). Third, time animations do not show evidence of defocusing, as there is generally a single center of high power along the rupture until about 90 s on the Totschunda fault. Finally, when subnets 1 and 2 are back projected independently, which generally reduces the possibility and impact of 3-D velocity heterogeneity changes along the entire rupture, only distinctly different parts of the rupture are illuminated, suggesting that only these rupture segments radiated coherent energy to their respective subnetworks.
4.2. Comparison to Previous Work
 The resolution tests and other experiments that we performed suggest that we can measure rupture speeds on the supershear segments of the Kokoxili and Denali faults accurately enough to exclude the possibility of subshear speeds. The range of speeds for both cases is roughly 4.5 to 6.5 km/s, with the most reliable values being 5.7 and 5.5 km/s for the Kokoxili and Denali ruptures, respectively.
 There have been a number of rupture speeds reported for the Kokoxili rupture (Table 2). Most of these speeds range from 2.0 to 4.0 km/s and were derived by inverse modeling the seismic moment distribution using long- to intermediate-period seismic data. Reports of supershear speeds with uncertainties that probably exclude subshear speeds are from Bouchon and Vallée , Robinson et al. , and Vallée et al. . Bouchon and Vallée analyzed surface waves recorded by regional broadband stations. For four 100-km segments of the fault, they found rupture speeds of 2.4, 5.0, 4.8, and 5.0 km/s. These values are not precisely constrained, especially the last segment of the rupture. Robinson et al. inverted teleseismic SH waveforms to track the rupture front. They found a rupture speed of 3.3 km/s during the first 120 km, and a higher speed of 6.7 km/s between 120 and 270 km where we find a speed of 5.7 km/s supershear speed (Figure 5). A rupture speed closer to 5.7 km/s is also consistent with the results of Robinson et al. if the interpreted segment is extended to 350 km. After 350 km, the rupture front they find was poorly resolved and not reliably trackable, similar to what we observe at about 300 km (Figure 4). Vallée et al.  also find evidence for supershear rupture speeds close to the P wave speed by analyzing seismic data from the regional Nepal broadband array.
Table 2. Review of Rupture Speeds Estimated for the Kokoxili and Denali Rupturesa
Rupture Speed (km/s)
Distance and time are with respect to the hypocentral parameters. The multiple time window inversions were some variation of the Hartzell and Heaton  technique. The pulse-stripping inversion was based on the technique by Kikuchi and Kanamori . Unambiguous supershear speeds are indicated in bold and are discussed in the text.
Multiple window inversion of regional seismic data
 There have also been a number of rupture speeds in the 3.0 to 4.0 km/s range reported for the Denali rupture (Table 2). A supershear speed far from the subshear speed regime is reported by Frankel . He analyzed local and regional 0.02–0.5 Hz broadband waveforms, and high-frequency envelopes recorded by strong motion stations. He inverted the broadband waveforms for the spatiotemporal moment distribution. Although he constrained his moment solution to occur along the fault within 14 s of two 3.5 km/s rupture speed lines, some key features in his solution match our imaging results. Leading up to PS10, the centroids of 3 s back projection time slices show that the energy in the first part of the rupture tracks Frankel's results. Starting at km 130, Frankel fits a line with a 5 km/s slope along the leading edge of his moment distribution and extending to km 250 where it intersects the model solution lower boundary. Our images agree with this, and provide a means of extending Frankel's interpretation farther east. Asano et al.  inverted local 0.1–0.5 Hz seismic and GPS-measured static displacements using a multiple time window, linear, kinematic waveform inversion. The positions of their moment centers do not agree with our imaging results, or those of Frankel  (Figure 10).
 Our results suggest supershear speeds along at least 200 km of the Kokoxili and Denali ruptures. There have been other reports of supershear or near-supershear rupture speeds, but they are for smaller earthquakes. The 1979 Imperial Valley earthquake in California was reported to have an average rupture speed of 2.5–3 km/s [Hartzell and Heaton, 1983], with a short patch of supershear rupture (4–5 km/s [Olsen and Apsel, 1982; Archuleta, 1984; Spudich and Cranswick, 1984]). The 1992 Landers earthquake appeared to have an average rupture speed of 2.7–3.0 km/s on the basis of strong motion seismic data and geodetic data from GPS and radar interferometry [Hernandez et al., 1999] and 5 km/s on the basis of a finite difference simulation [Olsen et al., 1997]. For the 1999 Izmit earthquake, Sekiguchi and Iwata  modeled strong motion data recorded by ten stations and inferred a rupture speed of 5.8 km/s along the middle third of the rupture. S-P times from two strong motion stations were interpreted by Bouchon et al. [2000, 2001] in terms of an average eastward supershear rupture of 4.7–4.9 km/s and 4.3 km/s for the subsequent Duzce earthquake.
 If seismic moment and seismic energy release are proportional during an earthquake, one should expect agreement between rupture speeds estimated by back projection imaging and seismic moment inversions. We generally do not find this agreement (Table 2) because the majority of the finite slip models do not exhibit supershear rupture speeds. On the other hand, the high rupture speeds obtained by the moment inversions of Frankel  (5.0 km/s) for the Denali rupture and Sekiguchi and Iwata  (5.8 km/s) for the Izmit rupture suggest that relaxing certain solution constraints, or using constraints directly from back projection imaging, may be necessary to yield consistent results between the two approaches for very long strike-slip earthquakes.
 If seismic moment and seismic energy release are correlated, seismic energy and global phase coherence are correlated, and no imaging problems exist, one would also expect the moment distributions in time and space from other techniques to correlate with the distribution of Si(x, y, t)2 along the fault. The spatial moment density estimated by Hreinsdóttir et al.  (Figure 10) is similar to that by Frankel  and Asano et al.  for the Denali rupture. Although the spatial moment distribution does not correlate with most features in S(x, y, t), the temporal moment rate function [Ozacar and Beck, 2004] does correlate to some degree, especially with the results using subnet 1. The same is true for the Kokoxili rupture (Figure 5). This may also suggest that the rupture speeds in the moment inversions were controlled too tightly.
 Inverting 1–10 Hz waveform envelopes for the source factor relative to the M 6.7 Nenana Mountain earthquake (Denali preshock) assuming a 3.0 km/s rupture speed yields a curve that matches the 0.2–0.9 Hz back projection power distribution for subnet 1 (Figure 10). The distribution for subnet 2 may not match because the stations used by Frankel span most of the azimuthal range, except to the east and southeast where subnet 2 is located. This also supports the hypothesis that the two subnetworks image different parts of the rupture because of variations in regional phase coherence. However, these comparisons are also limited in general by the fact that the frequency distributions are different and the rupture speed that we image appears to change along the fault. Frankel also tried a single 3.5 km/s rupture speed, but that produced an envelope at PS10 that did not fit the data as well.
4.3. Implications for Material Properties
 The imaged transitions from subshear to supershear rupture speeds that approach the local P wave speed are consistent with that observed in laboratory experiments on the dynamics of mode II shear crack growth. These transitions relate to the mechanical strength of the fault. Rosakis et al.  made laboratory measurements of crack growth along a preexisting plane separating two bonded plates of Homalite, a linearly elastic polyester resin that is representative of a class of materials that experience brittle fracture and stress-induced birefringence. Their primary observations confirmed certain predictions of the Burridge-Andrews propagation model (BAM), including the stability range of supershear rupture propagation between Vs and Vp, where Vs and Vp are the S and P wave speeds [Andrews, 1976; Burridge et al., 1979]. Rupture propagation in the Homalite experiments began close to the Rayleigh wave speed (VR), accelerated to a supershear speed close to Vp, and then decelerated to Vs for the remainder of the rupture.
 There were a few observed inconsistencies with the BAM model, including the finite structure of the Mach waves, the formation of secondary tensile microfractures trailing the shear crack tip, and issues concerning the stability and speed of the crack tip. Rosakis  and Samudrala et al.  explained these observations with a semi-infinite crack model where rupture propagation depends on (1) a slip rate weakening cohesive zone of a characteristic distance ahead of the rupture tip and (2) a constant crack tip sliding displacement criterion. The slip rate weakening parameter, β, is defined as a linear rate of change of the steady state coefficient of sliding friction with slip rate [Ruina, 1983; Rice and Ruina, 1983; Kilgore et al., 1993; Kato and Hirasawa, 1996]. The dependence of friction on the local “state” was ignored. The model states that a point along the locked fault undergoes an instantaneous jump in slip rate (from zero) as the rupture front passes. The dynamic strength of the fault reduces by an amount proportional to the slip rate. The static fault strength returns as the slip rate then decays behind the rupture front. These stability results have also been found by Burridge et al.  and Freund  using a slip-weakening law.
 On the basis of past theoretical and numerical work and their recent lab results, Rosakis  and Samudrala et al.  suggest relationships between the critical far-field shear stress required to sustain dynamic crack growth, the velocity weakening parameter, and the shear strength of the interface. These relationships can be interpreted as predictors of regions of stable rupture speeds. We can therefore relate our rupture speed observations to these stability regions and make inferences as to the mechanical properties of the ruptured faults. Zhao and Zeng  used P and S wave arrivals from local Tibetan Plateau earthquakes to find an average P and S wave upper crustal speed of 5.6 and 3.3 km/s. Jin and Herrin  inverted surface waves along the western Denali fault from North Pacific earthquakes to derive an average P and S wave upper crustal speed of about 5.8 and 3.4 km/s. We have plotted some of the Rosakis results in Figure 12, using crustal seismic speed values of Vp = 5.8 km/s and Vs = 3.4 km/s. The y axis is the critical far-field shear stress (required to satisfy the critical crack tip sliding displacement criterion) acting on the interface some distance D ahead of the cohesive zone (where the solution with no cohesive zone applies) normalized by the critical far-field shear stress required for quasi-static crack propagation (σD)c dyn/(σD)c0. The variation of the shear strength parameter of the preslip interface
where τ0 is the bond strength (shear strength of the crack plane), μ is the shear modulus, and δtc is the critical crack tip sliding displacement (breakdown slip), is graphed in Figure 12a. Curves with negative slopes represent unstable conditions; a rupture is likely to accelerate to higher speeds because propagation at those speeds requires less available energy to sustain the rupture (sometimes called “fracture energy,” “surface energy,” or “breakdown work”). Specifically, for a critical stress ratio less than 1 (y axis), the initial rupture speed is subshear and unstable. If there is an instability along the fault plane (e.g., variation in preslip stress or strength), the rupture can accelerate to stability at VR. If the critical stress ratio intersects one of the curves in the supershear domain, the acceleration can jump past the forbidden region (between VR and Vs, where propagation is physically unrealistic based on energy considerations [Broberg, 1964]) to a supershear speed, and accelerate under additional instabilities to the speed corresponding to the minimum between Vs and Vp. If the critical stress ratio is greater than 1, the rupture will initiate “spontaneously” at an unstable supershear speed between Vs and Vs, and if perturbed, can also accelerate to the stable minimum.
 This cohesive zone theory predicts that stable supershear rupture speed will vary as a function of both Z, the strength parameter, and β, the slip rate weakening parameter. This is shown by the shifts in the minima locations for the curves plotted in Figures 12a and 12b. Regardless of how the supershear rupture begins, faults with relatively large Z (strong faults, faults with low breakdown slips, or low rigidity) promote rupture speeds closer to Vs, whereas faults with smaller Z may yield stable rupture speeds closer to Vp. The strength parameter Z does not influence the rupture speed for subshear speeds. The slip rate weakening parameter β influences the stable rupture speed regimes for both the subshear and supershear cases (Figure 12b). Greater velocity weakening (smaller β) is associated with greater instability for both domains. For low-Z faults in the supershear domain, β ≤ −0.4 promotes rupture speeds near Vp. For high-Z faults in the supershear domain, the value of β only marginally influences the stability regime centered on Vs. As mentioned earlier, the above stability results have also been found by Burridge et al.  and Freund  using a slip-weakening law.
 In the bonded Homalite experiments, Rosakis et al.  obtained a stable rupture just above Vs and estimated β = −0.4. They used a ratio of shear modulus to bond strength of 136, which corresponds to an exceptionally strong fault strength of 200 MPa in an equivalent crustal fault system [Rosakis, 2002]. Consequently, for crustal earthquakes, stable rupture speeds close to the crustal P wave speed suggest both a weaker fault and a slip rate weakening parameter β ≤ −0.4.
 Our estimated Kokoxili and Denali supershear rupture speeds (5.7 and 5.5 km/s) do not rule out Vs speeds (i.e., 4.7 to 4.8 km/s; see resolution discussions above), but they correlate well with Vp, suggesting that the ruptures occurred along fault segments with relatively small Z values (weak faults) and significant slip or slip rate weakening. This is consistent with the high ratio of radiated energy to seismic moment reported for both earthquakes (Kokoxili 6 × 10–5, Denali 4.5 × 10–5) as well as scaling relations applied to well-defined global strike-slip ruptures [Ozacar and Beck, 2004; Antolik et al., 2004]. The fracture energy per unit area (energy absorbed by the crack tip to allow the rupture to advance, sometimes referred to as “surface energy” or “breakdown work”) has been estimated for the Kokoxili and Denali ruptures as 0.2–9.4 and 10–20 MJ/m2 [Antolik et al., 2004]. The low fracture energy for the Kokoxili rupture may be consistent with high rupture speeds, but some recent estimates of fracture energy for several M 5.6–7.2 earthquakes with presumably subshear rupture speeds yielded a wide range of fracture energies between 0.4 and 20 MJ/m2 [Tinti et al., 2005].
 The discussion above is mostly in the context of theoretical, numerical, and laboratory results using an approach by Samudrala et al.  and Rosakis . A more common approach involves the S parameter, introduced by Hamano , defined as
where τs is the critical static frictional shear stress, τd is the dynamic frictional shear stress (due to weakening), and τ1 is the preslip stress on the fault. The numerator is the stress increase required to initiate slip and the denominator is the stress drop. Burridge  analytically determined for 2-D unbounded cracks that S = 1.63 is a critical value beneath which a slip-weakening rupture will accelerate from sub-Rayleigh to supershear speeds as the crack length grows. This value was later corrected to 1.77 [Andrews, 1985]. Dunham  performed the same analysis for a 3-D unbounded elliptical crack and found a critical value of S = 1.19. Higher values of S will always lead to an initial sub-Rayleigh rupture speed that accelerates to VR as the crack grows. Higher values of the critical stress ratio (y axis in Figure 12a) lead to smaller S values (Figure 12a); however, the exact relationship between the two is undefined because the semi-infinite crack in the Samudrala et al.  model undergoes no stress drop as the stress across the crack is equal to that infinitely far ahead of the rupture (E. Dunham, personal communication, 2008). However, both ruptures extended to the surface. Recent work has shown that a strike-slip rupture bounded by a free surface can initiate, at the surface, a secondary rupture front that propagates at a near-sonic rupture speed [Chen and Zhang, 2006; Zhang and Chen, 2006]. Chen and Zhang  and Zhang and Chen  performed a number of 3-D numerical rupture experiments by extending the boundary integral equation method to a half-space model and found that the critical value of S can be as small as 1.0 for such strike-slip ruptures. Therefore, our imaging results suggest that S ≤ 1.0 for the main parts of both ruptures.
where for our purposes, W is the half depth of the fault, Dc is the critical slip-weakening distance, and the other parameters are defined above. Dunham  has expressed κ in terms of S as
where l(S) is the ratio of the friction length (a length scale that depends on the friction law parameters governing the nucleation process) to the transition length (the length the subshear rupture must propagate before transitioning to a supershear rupture speed). Madariaga and Olsen  report that depending on the fault model type, the critical value of κ is 0.7 to 0.8. For smaller values, the rupture promptly arrests. Slightly larger values lead to rupture speed acceleration to VR. For even larger values, the rupture starts at sub-Rayleigh speeds and accelerates to supershear speeds. The imaging results suggest that κ ≥ 0.9 for the main parts of the ruptures.
4.4. Transition Zones
 The Kokoxili rupture transitioned from 2.6 to 5.7 km/s over a distance no greater than ∼40 km just east of the Hong Shui River gorge (Figures 3, 5, and 12), which also marks a change in the surface slip distribution from simple in the west to complex in the east for 70 km, past Kusai Hu Lake, to the junction point with the Kunlun Pass fault [Klinger et al., 2005; Li et al., 2005]. This complexity may be associated with the supershear rupture, since supershear propagation is associated with localized, amplified shear stresses just ahead of the rupture tip and possibly the opening of mode I cracks subparallel to the fault [e.g., Rosakis, 2002; Bhat et al., 2007].
 From our images, we also estimate that the Denali rupture transitioned from 3.3 km/s to 5.5 km/s over a distance of no greater than ∼40 km. This transition begins at the intersection of the McKinley and main strand of the Denali fault system. This is also the location of strong motion station PS10, the station for which Ellsworth et al.  and Dunham and Archuleta  analyzed and modeled anomalously high-amplitude fault-perpendicular and fault-normal motions, suggesting that an asperity of higher initial shear stress, with a minimum width of 35 km, propelled the rupture from ∼2.2 to ∼5.4 km/s beginning about 30 km before PS10.
 It has recently been proposed on the basis of laboratory experiments of rupture dynamics that the length of transition from subshear to supershear speeds is on the order of tens of kilometers for crustal earthquakes [Xia et al., 2004]. Our results suggest that the transition for both ruptures occurred over a length no longer than ∼40 km. This is also generally consistent with the results of Bouchon and Vallée  and Robinson et al. .
 Numerical modeling results of Andrews  show that there is a gradual reduction in the width of the rupture front as the rupture prepares to transition from subshear to supershear speeds. This may explain why for both the Kokoxili and Denali ruptures, we do not image well the rupture just before the transition zone. Andrews' results also show that the rupture front has two branches during the transition, creating an effective rupture front that is much wider, and that following the transition, the supershear rupture front that remains also becomes gradually narrower with time. These qualitative predictions are also consistent with our imaging results.
4.5. Comparisons and Speculations on the Rupture Process
 The kinematics of the Kokoxili and Denali fault earthquake ruptures are similar. Both ruptures were triggered by smaller ruptures on adjacent faults, initial rupture speeds were stable and close to the Rayleigh wave speed, and a transition occurred to supershear speeds near the local P wave speeds.
Robinson et al.  and Das  hypothesized that supershear rupture speeds may require relatively straight faults. The supershear segment that we image along the Kokoxili rupture occurred along a straight fault (similar to the results of Robinson et al.). The supershear segments of the Denali rupture also occurred along relatively straight faults, assuming the Denali and Totschunda faults are considered separately.
 It is interesting to speculate why these ruptures occurred. Rather than nucleate somewhere on the very long fault planes that ruptured, the events were triggered by failure on a neighboring fault. One can speculate that if there was great heterogeneity in prestress or strength along the ruptures, different patches of the faults would have ruptured at different times, prohibiting the buildup of stress that enabled the long ruptures to propagate at supershear speeds. As discussed above, our images also show rupture characteristics that appear similar to those obtained by numerical models that generally assume homogeneous stress and strength along generic faults [e.g., Andrews, 1976]. The shear stress on the Kokoxili and Denali faults was not so high that the faults ruptured spontaneously at supershear speeds, but rather at a critical speed that supported dynamic growth and transition to supershear speeds (Figure 12). Therefore, a very long fault that extends to the surface and has general homogeneity in prestress and fault strength, together with smaller adjacent fault segments to provide triggering, may be necessary ingredients to observe a sub-Rayleigh to supershear rupture transition in strike-slip earthquakes.
 Global back projection imaging of the teleseismic P wave train illuminates a transition in rupture speed from subshear to supershear speeds (∼5.6 km/s) that approach the local P wave speed for the Kokoxili and Denali fault earthquakes. The transition occurs over a distance no greater than ∼40 km, which is consistent with an order-of-magnitude estimate based on recent lab experiments. The transitions begin after ∼100 km, and supershear persists for at least 200 km thereafter. Synthetic tests and other experiments with different frequency bands and imaging parameters suggest that our results are robust in that we can rule out subshear speeds along the imaged supershear segments. However, we cannot exclude supershear speeds within the range 4.5 to 6.5 km/s.
 The Kokoxili and Denali fault earthquakes produced perhaps the longest intracontinental strike-slip ruptures in modern seismic recording history, reaching rupture lengths of 340 to 400 km and maximum slips of at least 8 to 9 m. The rupture speed is a critical parameter for understanding the physics and evolution of the rupture process. Theoretical, numerical, and laboratory experiments show that rupture speeds approaching the P wave speed and the transition from subshear to supershear rupture speeds are physically possible. The most reliable, stable supershear rupture speeds that we image are remarkably similar, and close to the local P wave speed. This may suggest that these intracontinental faults are generally weak, or have a high breakdown slip or crustal rigidity. Regardless, it appears that the faults exhibited at least moderate slip or slip rate weakening, and that relatively little work was required to sustain the supershear ruptures for long durations. That both faults are similar in these regards is interesting given that they have remarkably different geologic ages and slip histories. Previous published results and our observations lead us to speculate that very long strike-slip faults that extend to the surface and have general homogeneity in prestress and fault strength, together with smaller adjacent fault segments to provide triggering, may have been the necessary ingredients that permitted the sub-Rayleigh to supershear rupture speed transition to occur for both the Kokoxili and Denali ruptures.
 We thank Mike Taylor and Jeff Freymueller for fault location data for the Tibet and Alaska areas, respectively. We also thank Sigrun Hreinsdóttir for her Denali slip model and Michael Antolik for his preferred Kokoxili slip model. We thank Paul Earle for helpful feedback and organizing Denali waveform data during the initial part of this study and Christine Houser for her database of teleseismic P wave time shift corrections. We also thank Eric Dunham and Shamita Das for discussions that helped improve the manuscript and Alex Hutko, Nelson Pulido, and Eric Dunham for constructive reviews. Waveforms used in this study were downloaded from the IRIS Data Management Center. We thank the National Science Foundation and the USGS NEHRP program for funding this research.