I’m catching up on this thread, another large honey-do project has me distracted.
I would agree with these comments that the sound of the first shot might sound somewhat different based on the gun position. As I recall, and maybe it was in Max Hollands National Geo video, that an early shot would require a steeper downward aim, and that would require putting the end of the barrel further back, perhaps in or near the plane of the window rather than sticking out much. I also seem to recall that Elsie Dorman mentioned the first sound being different and like some of it was actually coming from inside the building.
I’m not an expert in ballistics, but did look at shock waves a little bit during an earlier discussion with someone on another forum, so I’ll try to share what I know (or at least think I know) related to your questions. Others with expertise can probably correct or clarify any issues.
From what I understand, I think you basically have it right in what you have described, and that the shock was propagates out as sort of a straight line wave, but in three dimensions, along the sides of an expanding cone as it radiates out.
So the differences as I understand it is that in 3 dimensions the (point source) muzzle blast expands as a sphere, were as the components of the shock wave are constantly created along the path of the bullet, and the sound at the point of creation also radiates as a sphere, but due to the bullet movement the sounds created superimpose creating the surface of a cone which builds up the strong shock wave that radiates out at an angle from the bullet path determined by bullet speed (Mach value).
Comparatively, the way I understand it, is that the muzzle blast is from the hot gasses exiting the barrel which disrupts the air at a concentrated point area at the muzzle exit, the sound source. As you said, the sound propagates omnidirectional from there, basically as the surface of an expanding sphere. The sound Intensity (as power/unit area) decreases as 1/r^2 from the source since the surface area of the expanding sphere expands with r^2. Where r is the distance from source i.e. the sphere center.
The shock wave is created as the bullet constantly sheds off sound (a pressure wave) as the bullet travels and is seen as the angle of a cone relative to the bullet path, the angle depends on how fast beyond the speed of sound the bullet is going. The direction of shock wave propagation is perpendicular to the cone surface as it grows. The base of the expanding cone starts at the muzzle with the bullet moving forward and the wave goes mostly forward and out to the side from there, so behind the shooter there may be no shock wave encountered. I think the sound heard should be slightly different than a muzzle blast because of how the sound is created, and at its creation point along the path of the bullet the shock wave's net initial intensity may be lower than the initial intensity of the muzzle blast when one is closer to the gun (but intensity differences change with distance) and is probably at a higher frequency, hence the crack. The perceived direction of source for a muzzle blast would be directed towards the sounds source, the muzzle, and the shock wave would seem to radiate from a direction that is perpendicular to its source, the cone formed along the bullet path, which in not necessarily towards the gun. Being able to discern two separate sounds would depend on their arrival time to one's ear, which would probably need to be greater than 50-80 ms, so being able to hear two distinct sounds vs one compounded sound would also depend on direction and distance that one was positioned from the gun.
The boat wave is a good example of the shock type wave in two dimensions. Rotate that V-shape wake shape in 360 degrees around the boat path and you would have a “cone” shape. Maybe that’s what a fast submarine leaves behind?
I struggle with explaining stuff, so let me know if something seems unclear or not in agreement with the material you have seen.