share/optics/text/5_wave_optics.tex

   1 Electron microscopes can make images of individual atoms,
   2 but why will a visible-light microscope never be able to?
   3 Stereo speakers create the illusion of music that comes from
   4 a band arranged in your living room, but why doesn't the
   5 stereo illusion work with bass notes? Why are computer chip
   6 manufacturers investing billions of dollars in equipment to
   7 etch chips with x-rays instead of visible light?
   8
   9 The answers to all of these questions have to do with the
  10 subject of wave optics. So far this book has discussed the
  11 interaction of light waves with matter, and its practical
  12 applications to optical devices like mirrors, but we have
  13 used the ray model of light almost exclusively. Hardly ever
  14 have we explicitly made use of the fact that light is an
  15 electromagnetic wave. We were able to get away with the
  16 simple ray model because the chunks of matter we were
  17 discussing, such as lenses and mirrors, were thousands of
  18 times larger than a wavelength of light. We now turn to
  19 phenomena and devices that can only be understood using the
  20 wave model of light.
  21
  22 <% begin_sec("Diffraction",0) %>
  23
  24 <% marg(m4_ifelse(__sn,1,[:40:],[:0:])) %>
  25 <%
  26   fig(
  27     'double-slit-water-waves',
  28     %q{%
  29       In this view from overhead, a straight, sinusoidal
  30       water wave encounters a barrier with two gaps in it. Strong wave vibration occurs at
  31       angles X and Z, but there is none at all at angle Y. (The figure has been retouched
  32       from a real photo of water waves. In reality, the waves beyond the barrier would be
  33       much weaker than the ones before it, and they would therefore be difficult to see.)
  34     }
  35   )
  36 %>
  37 \spacebetweenfigs
  38 <%
  39   fig(
  40     'double-slit-no-diffraction',
  41     %q{This doesn't happen.}
  42   )
  43 %>
  44
  45 <% end_marg %>
  46 Figure \figref{double-slit-water-waves} shows a typical problem in wave optics, enacted
  47 with water waves. It may seem surprising that we don't get a
  48 simple pattern like figure \figref{double-slit-no-diffraction}, but the pattern would only
  49 be that simple if the wavelength was hundreds of times
  50 shorter than the distance between the gaps in the barrier
  51 and the widths of the gaps.
  52
  53 Wave optics is a broad subject, but this example will help
  54 us to pick out a reasonable set of restrictions to make
  55 things more manageable:
  56
  57 (1) We restrict ourselves to cases in which a wave travels
  58 through a uniform medium, encounters a certain area in which
  59 the medium has different properties, and then emerges on the
  60 other side into a second uniform region.
  61
  62 (2) We assume that the incoming wave is a nice tidy
  63 sine-wave pattern with wavefronts that are lines (or, in
  64 three dimensions, planes).
  65
  66 (3) In figure \figref{double-slit-water-waves} we can see that the wave pattern
  67 immediately beyond the barrier is rather complex, but
  68 farther on it sorts itself out into a set of wedges
  69 separated by gaps in which the water is still. We will
  70 restrict ourselves to studying the simpler wave patterns
  71 that occur farther away, so that the main question of
  72 interest is how intense the outgoing wave is at a given angle.
  73
  74 The kind of phenomenon described by restriction (1) is
  75 called \index{diffraction!defined}\emph{diffraction}. Diffraction
  76 can be defined as the behavior of a wave when it encounters
  77 an obstacle or a nonuniformity in its medium. In general,
  78 diffraction causes a wave to bend around obstacles and make
  79 patterns of strong and weak waves radiating out beyond the
  80 obstacle. Understanding diffraction is the central problem
  81 of wave optics. If you understand diffraction, even the
  82 subset of diffraction problems that fall within restrictions
  83 (2) and (3), the rest of wave optics is icing on the cake.
  84
  85 Diffraction can be used to find the structure of an unknown
  86 diffracting object: even if the object is too small to study
  87 with ordinary imaging, it may be possible to work backward
  88 from the diffraction pattern to learn about the object. The
  89 structure of a crystal, for example, can be determined from
  90 its x-ray diffraction pattern.
  91
  92 Diffraction can also be a bad thing. In a telescope, for
  93 example, light waves are diffracted by all the parts of the
  94 instrument. This will cause the image of a star to appear
  95 fuzzy even when the focus has been adjusted correctly. By
  96 understanding diffraction, one can learn how a telescope
  97 must be designed in order to reduce this problem ---
  98 essentially, it should have the biggest possible diameter.
  99
 100 There are two ways in which restriction (2) might commonly
 101 be violated. First, the light might be a mixture of
 102 wavelengths. If we simply want to observe a diffraction
 103 pattern or to use diffraction as a technique for studying
 104 the object doing the diffracting (e.g., if the object is too
 105 small to see with a microscope), then we can pass the light
 106 through a colored filter before diffracting it.
 107 <% marg(m4_ifelse(__sn,1,[:50:],[:25:])) %>
 108 <%
 109   fig(
 110     'double-slit-setup',
 111     %q{%
 112       A practical, low-tech setup for observing
 113       diffraction of light.
 114     }
 115   )
 116 %>
 117 \vspace{15mm}
 118 <%
 119   fig(
 120     'scaling',
 121     %q{%
 122       The bottom figure is simply a copy of the
 123       middle portion of the top one, scaled up by a factor of two. All the angles
 124       are the same. Physically, the angular pattern of the diffraction fringes
 125       can't be any different if we scale both $\lambda$ and $d$ by the same
 126       factor, leaving $\lambda/d$ unchanged.
 127     }
 128   )
 129 %>
 130 <% end_marg %>
 131
 132 A second issue is that light from sources such as the sun or
 133 a lightbulb does not consist of a nice neat plane wave,
 134 except over very small regions of space. Different parts of
 135 the wave are out of step with each other, and the wave is
 136 referred to as \index{incoherent light}\emph{incoherent}. One way
 137 of dealing with this is shown in figure \figref{double-slit-setup}. After filtering
 138 to select a certain wavelength of red light, we pass the
 139 light through a small pinhole. The region of the light that
 140 is intercepted by the pinhole is so small that one part of
 141 it is not out of step with another. Beyond the pinhole,
 142 light spreads out in a spherical wave; this is analogous to
 143 what happens when you speak into one end of a paper towel
 144 roll and the sound waves spread out in all directions from
 145 the other end. By the time the spherical wave gets to the
 146 double slit it has spread out and reduced its curvature, so
 147 that we can now think of it as a simple plane wave.
 148
 149 If this seems laborious, you may be relieved to know that
 150 modern technology gives us an easier way to produce a
 151 single-wavelength, coherent beam of light: the laser.
 152
 153 The parts of the final image on the screen in \figref{double-slit-setup} are called
 154 diffraction \index{diffraction!fringe}\index{fringe!diffraction}fringes.
 155 The center of each fringe is a point of maximum brightness,
 156 and halfway between two fringes is a minimum.
 157
 158 \startdq
 159
 160 \begin{dq}
 161 Why would x-rays rather than visible light be used to find
 162 the structure of a crystal? Sound waves are used to make
 163 images of fetuses in the womb. What would influence the
 164 choice of wavelength?
 165 \end{dq}
 166
 167 <% end_sec() %>
 168 <% begin_sec("Scaling of Diffraction",0) %>\index{diffraction!scaling of}
 169
 170 This chapter has ``optics'' in its title, so it is nominally
 171 about light, but we started out with an example involving
 172 water waves. Water waves are certainly easier to visualize,
 173 but is this a legitimate comparison? In fact the analogy
 174 works quite well, despite the fact that a light wave has a
 175 wavelength about a million times shorter. This is because
 176 diffraction effects scale uniformly. That is, if we enlarge
 177 or reduce the whole diffraction situation by the same
 178 factor, including both the wavelengths and the sizes of the
 179 obstacles the wave encounters, the result is still a valid solution.
 180
 181 This is unusually simple behavior! In __subsection_or_section(scaling)
 182 we saw many examples of more complex scaling, such as
 183 the impossibility of bacteria the size of dogs, or the need
 184 for an elephant to eliminate heat through its ears because
 185 of its small surface-to-volume ratio, whereas a tiny shrew's
 186 life-style centers around conserving its body heat.
 187
 188 Of course water waves and light waves differ in many ways,
 189 not just in scale, but the general facts you will learn
 190 about diffraction are applicable to all waves. In some ways
 191 it might have been more appropriate to insert this chapter
 192 after __section_or_chapter(bounded-waves) on bounded waves, but many of the
 193 important applications are to light waves, and you would
 194 probably have found these much more difficult without any
 195 background in optics.
 196
 197 Another way of stating the simple scaling behavior of
 198 diffraction is that the diffraction angles we get depend
 199 only on the unitless ratio $\lambda $/d, where $\lambda$ is
 200 the wavelength of the wave and $d$ is some dimension of the
 201 diffracting objects, e.g., the center-to-center spacing
 202 between the slits in figure \figref{double-slit-water-waves}. If, for instance, we scale
 203 up both $\lambda $ and $d$ by a factor of 37, the ratio
 204 $\lambda /d$ will be unchanged.
 205 m4_ifelse(__sn,1,[:
 206 %%%%%%%%%%%% figure needs to be on a different page for LM -- see below
 207 <% marg() %>
 208 <%
 209   fig(
 210     'huygens',
 211     %q{Christiaan Huygens (1629-1695).}
 212   )
 213 %>
 214 <% end_marg %>
 215 :],[::])
 216
 217 <% end_sec() %>
 218 <% begin_sec("The Correspondence Principle",0) %>
 219
 220 The only reason we don't usually notice diffraction of light
 221 in everyday life is that we don't normally deal with objects
 222 that are comparable in size to a wavelength of visible
 223 light, which is about a millionth of a meter. Does this mean
 224 that wave optics contradicts ray optics, or that wave optics
 225 sometimes gives wrong results? No. If you hold three fingers
 226 out in the sunlight and cast a shadow with them, \emph{either}
 227 wave optics or ray optics can be used to predict the
 228 straightforward result: a shadow pattern with two bright
 229 lines where the light has gone through the gaps between your
 230 fingers. Wave optics is a more general theory than ray
 231 optics, so in any case where ray optics is valid, the two
 232 theories will agree. This is an example of a general idea
 233 enunciated by the physicist \index{Bohr!Niels}Niels Bohr,
 234 called the \index{correspondence principle}\emph{correspondence
 235 principle:\/} when flaws in a physical theory lead to the
 236 creation of a new and more general theory, the new theory
 237 must still agree with the old theory within its more
 238 restricted area of applicability. After all, a theory is
 239 only created as a way of describing experimental observations.
 240 If the original theory had not worked in any cases at all,
 241 it would never have become accepted.
 242
 243 m4_ifelse(__sn,1,[::],[:
 244 %%%%%%%%%%%% figure needs to be on a different page for SN -- see above
 245 <% marg(70) %>
 246 <%
 247   fig(
 248     'huygens',
 249     %q{Christiaan Huygens (1629-1695).}
 250   )
 251 %>
 252 <% end_marg %>
 253 :])
 254
 255 In the case of optics, the correspondence principle tells us
 256 that when $\lambda /d$ is small, both the ray and the wave
 257 model of light must give approximately the same result.
 258 Suppose you spread your fingers and cast a shadow with them
 259 using a coherent light source. The quantity $\lambda /d$ is
 260 about $10^{-4}$, so the two models will agree very closely. (To
 261 be specific, the shadows of your fingers will be outlined by
 262 a series of light and dark fringes, but the angle subtended
 263 by a fringe will be on the order of $10^{-4}$ radians, so
 264 they will be too tiny to be visible.
 265
 266 \pagebreak
 267
 268 <% self_check('diffract-around-body',<<-'SELF_CHECK'
 269 What kind of wavelength would an electromagnetic wave have
 270 to have in order to diffract dramatically around your body?
 271 Does this contradict the correspondence principle?
 272   SELF_CHECK
 273   ) %>
 274
 275 <% end_sec() %>
 276 <% begin_sec("Huygens' Principle",0) %>
 277
 278 <% marg(m4_ifelse(__sn,1,[:70:],[:10:])) %>
 279 <%
 280   fig(
 281     'double-slit-water-waves-photo',
 282     %q{Double-slit diffraction.}
 283   )
 284 %>
 285 \spacebetweenfigs
 286 <%
 287   fig(
 288     'huygens-1',
 289     %q{%
 290       A wavefront can be analyzed by the principle of superposition,
 291       breaking it down into many small parts.
 292     }
 293   )
 294 %>
 295 \spacebetweenfigs
 296 <%
 297   fig(
 298     'huygens-2',
 299     %q{%
 300       If it was by itself, each of the parts would spread out as a
 301       circular ripple.
 302     }
 303   )
 304 %>
 305 \spacebetweenfigs
 306 <%
 307   fig(
 308     'huygens-3',
 309     %q{Adding up the ripples produces a new wavefront.}
 310   )
 311 %>
 312
 313 <% end_marg %>
 314 Returning to the example of double-slit diffraction, \figref{double-slit-water-waves-photo},
 315 note the strong visual impression of two overlapping sets of
 316 concentric semicircles. This is an example of \index{Huygens'
 317 principle}\emph{Huygens' principle}, named after a Dutch physicist
 318 and astronomer. (The first syllable rhymes with ``boy.'')
 319 Huygens' principle states that any wavefront can be broken
 320 down into many small side-by-side wave peaks, \figref{huygens-1}, which
 321 then spread out as circular ripples, \figref{huygens-2}, and by the
 322 principle of superposition, the result of adding up these
 323 sets of ripples must give the same result as allowing the
 324 wave to propagate forward, \figref{huygens-3}. In the case of sound or
 325 light waves, which propagate in three dimensions, the
 326 ``ripples'' are actually spherical rather than circular, but
 327 we can often imagine things in two dimensions for simplicity.
 328
 329 In double-slit diffraction the application of Huygens'
 330 principle is visually convincing: it is as though all the
 331 sets of ripples have been blocked except for two. It is a
 332 rather surprising mathematical fact, however, that Huygens'
 333 principle gives the right result in the case of an
 334 unobstructed linear wave, \figref{huygens-2} and \figref{huygens-3}. A theoretically
 335 infinite number of circular wave patterns somehow conspire
 336 to add together and produce the simple linear wave motion
 337 with which we are familiar.
 338
 339 Since Huygens' principle is equivalent to the principle of
 340 superposition, and superposition is a property of waves,
 341 what Huygens had created was essentially the first wave
 342 theory of light. However, he imagined light as a series of
 343 pulses, like hand claps, rather than as a sinusoidal wave.
 344
 345 The history is interesting. Isaac \index{Newton, Isaac!particle theory of light}Newton
 346 loved the atomic theory of matter so much that he searched
 347 enthusiastically for evidence that light was also made of
 348 tiny particles. The paths of his light particles would
 349 correspond to rays in our description; the only significant
 350 difference between a ray model and a particle model of light
 351 would occur if one could isolate individual particles and
 352 show that light had a ``graininess'' to it. Newton never did
 353 this, so although he thought of his model as a particle
 354 model, it is more accurate to say he was one of the builders
 355 of the ray \index{ray model of light}model.
 356
 357 Almost all that was known about reflection and refraction of
 358 light could be interpreted equally well in terms of a
 359 \index{particle model of light}particle model or a wave
 360 model, but Newton had one reason for strongly opposing
 361 Huygens' \index{wave model of light}wave theory. Newton knew
 362 that waves exhibited diffraction, but diffraction of light
 363 is difficult to observe, so Newton believed that light did
 364 not exhibit diffraction, and therefore must not be a wave.
 365 Although Newton's criticisms were fair enough, the debate
 366 also took on the overtones of a nationalistic dispute
 367 between England and continental Europe, fueled by English
 368 resentment over Leibniz's supposed plagiarism of Newton's
 369 calculus. Newton wrote a book on optics, and his prestige
 370 and political prominence tended to discourage questioning of his model.
 371
 372 <% marg(m4_ifelse(__sn,1,[:70:],[:53.5:])) %>
 373 <%
 374   fig(
 375     'young',
 376     %q{Thomas Young}
 377   )
 378 %>
 379 \spacebetweenfigs
 380 <%
 381   fig(
 382     'double-slit-water-waves',
 383     %q{Double-slit diffraction.},
 384     {'suffix'=>'2'}
 385   )
 386 %>
 387 \spacebetweenfigs
 388 <%
 389   fig(
 390     'double-slit-overlapping',
 391     %q{Use of Huygens' principle.}
 392   )
 393 %>
 394 \spacebetweenfigs
 395 <%
 396   fig(
 397     'double-slit-path-length',
 398     %q{Constructive interference along the center-line.}
 399   )
 400 %>
 401
 402 <% end_marg %>
 403 \index{Young, Thomas}Thomas Young (1773-1829) was the person
 404 who finally, a hundred years later, did a careful search for
 405 wave interference effects with light and analyzed the
 406 results correctly. He observed double-slit diffraction of
 407 light as well as a variety of other diffraction effects, all
 408 of which showed that light exhibited wave interference
 409 effects, and that the wavelengths of visible light waves
 410 were extremely short. The crowning achievement was the
 411 demonstration by the experimentalist Heinrich \index{Hertz,
 412 Heinrich!Heinrich}Hertz and the theorist James Clerk
 413 \index{Maxwell, James Clerk}Maxwell that light was an
 414 \emph{electromagnetic} wave. Maxwell is said to have related
 415 his discovery to his wife one starry evening and told her
 416 that she was the only other person in the world who knew what starlight was.
 417
 418 <% end_sec() %>
 419 <% begin_sec("Double-Slit Diffraction",0) %>\index{double-slit diffraction}\index{diffraction!double-slit}
 420
 421 Let's now analyze double-slit diffraction, \figref{double-slit-water-waves2}, using
 422 Huygens' principle. The most interesting question is how to
 423 compute the angles such as X and Z where the wave
 424 intensity is at a maximum, and the in-between angles like
 425 Y where it is minimized. Let's measure all our angles
 426 with respect to the vertical center line of the figure,
 427 which was the original direction of propagation of the wave.
 428
 429 If we assume that the width of the slits is small (on the
 430 order of the wavelength of the wave or less), then we can
 431 imagine only a single set of Huygens ripples spreading out
 432 from each one, \figref{double-slit-overlapping}.  White lines represent peaks, black
 433 ones troughs. The only dimension of the diffracting
 434 slits that has any effect on the geometric pattern of the
 435 overlapping ripples is then the center-to-center distance,
 436 $d$, between the slits.
 437
 438 We know from our discussion of the scaling of diffraction
 439 that there must be some equation that relates an angle like
 440 $\theta_Z$ to the ratio $\lambda /d$,
 441 \begin{equation*}
 442         \frac{\lambda}{d} \leftrightarrow \theta_Z \qquad .
 443 \end{equation*}
 444 If the equation for $\theta_Z$ depended on some other
 445 expression such as $\lambda +d$ or $\lambda^2/d$, then it
 446 would change when we scaled $\lambda $ and $d$ by the same
 447 factor, which would violate what we know about the
 448 scaling of diffraction.
 449
 450 Along the central maximum line, X, we always have positive
 451 waves coinciding with positive ones and negative waves
 452 coinciding with negative ones. (I have arbitrarily chosen to
 453 take a snapshot of the pattern at a moment when the waves
 454 emerging from the slit are experiencing a positive peak.)
 455 The superposition of the two sets of ripples therefore
 456 results in a doubling of the wave amplitude along this line.
 457 There is constructive interference. This is easy to explain,
 458 because by symmetry, each wave has had to travel an equal
 459 number of wavelengths to get from its slit to the center line, \figref{double-slit-path-length}:
 460 Because both sets of ripples have ten wavelengths to cover in
 461 order to reach the point along direction X, they will be in step when they get there.
 462
 463 <% marg(55) %>
 464 <%
 465   fig(
 466     'double-slit-derivation-1',
 467     %q{%
 468       The waves travel distances $L$ and $L'$ from the two
 469       slits to get to the same point in space, at an angle $\theta$ from the center line.
 470     }
 471   )
 472 %>
 473 \spacebetweenfigs
 474 <%
 475   fig(
 476     'double-slit-derivation-2',
 477     %q{%
 478       A close-up view of figure \figref{double-slit-derivation-1}, showing
 479       how the path length difference $L-L'$ is related to $d$ and to the angle $\theta$.
 480     }
 481   )
 482 %>
 483
 484 <% end_marg %>
 485 At the point along direction Y shown in the same figure,
 486 one wave has traveled ten wavelengths, and is therefore at a
 487 positive extreme, but the other has traveled only nine and a
 488 half wavelengths, so it at a negative extreme. There is
 489 perfect cancellation, so points along this line experience no wave motion.
 490
 491 But the distance traveled does not have to be equal in order
 492 to get constructive interference. At the point along
 493 direction Z, one wave has gone nine wavelengths and the
 494 other ten. They are both at a positive extreme.
 495
 496 <% self_check('trough-trough',<<-'SELF_CHECK'
 497 At a point half a wavelength below the point marked along
 498 direction X, carry out a similar analysis.
 499   SELF_CHECK
 500   ) %>
 501
 502 To summarize, we will have perfect constructive interference
 503 at any point where the distance to one slit differs from the
 504 distance to the other slit by an integer number of
 505 wavelengths. Perfect destructive interference will occur
 506 when the number of wavelengths of path length difference
 507 equals an integer plus a half.
 508
 509 Now we are ready to find the equation that predicts the
 510 angles of the maxima and minima. The waves travel different
 511 distances to get to the same point in space, \figref{double-slit-derivation-1}. We need to
 512 find whether the waves are in phase (in step) or out of
 513 phase at this point in order to predict whether there will
 514 be constructive interference, destructive interference, or
 515 something in between.
 516
 517 One of our basic assumptions in this chapter is that we will
 518 only be dealing with the diffracted wave in regions very far
 519 away from the object that diffracts it, so the triangle is
 520 long and skinny. Most real-world examples with diffraction
 521 of light, in fact, would have triangles with even skinner
 522 proportions than this one. The two long sides are therefore
 523 very nearly parallel, and we are justified in drawing the
 524 right triangle shown in figure \figref{double-slit-derivation-2}, labeling one leg of the
 525 right triangle as the difference in path length , $L-L'$,
 526 and labeling the acute angle as $\theta $. (In reality this
 527 angle is a tiny bit greater than the one labeled $\theta $
 528 in figure \figref{double-slit-derivation-1}.)
 529
 530 The difference in path length is related to $d$ and
 531 $\theta $ by the equation
 532 \begin{equation*}
 533         \frac{L-L'}{d}          =  \sin  \theta    \qquad   .
 534 \end{equation*}
 535 Constructive interference will result in a maximum at angles
 536 for which $L-L'$ is an integer number of wavelengths,
 537 \begin{multline*}
 538         L-L'  =  m\lambda \qquad . \hfill
 539                   \shoveright{\text{[condition for a maximum;}}\\
 540                   \text{$m$ is an integer]}
 541 \end{multline*}
 542 Here $m$ equals 0 for the central maximum, $-1$ for the first
 543 maximum to its left, $+2$ for the second maximum on the right,
 544 etc. Putting all the ingredients together, we find
 545 $m\lambda/d=\sin \theta $, or
 546 \begin{multline*}
 547         \frac{\lambda}{d} = \frac{\sin\theta}{m} \qquad . \hfill
 548                   \shoveright{\text{[condition for a maximum;}}\\
 549                   \text{$m$ is an integer]}
 550 \end{multline*}
 551 Similarly, the condition for a minimum is
 552 \begin{multline*}
 553         \frac{\lambda}{d} = \frac{\sin\theta}{m} \qquad . \hfill
 554                   \shoveright{\text{[condition for a minimum;}}\\
 555                   \text{$m$ is an integer plus $1/2$]}
 556 \end{multline*}
 557 That is, the minima are about halfway between the maxima.
 558
 559 \vspace{2mm plus 3mm}
 560
 561 As expected based on scaling, this equation relates angles
 562 to the unitless ratio $\lambda /d$. Alternatively, we could
 563 say that we have proven the scaling property in the special
 564 case of double-slit diffraction. It was inevitable that the
 565 result would have these scaling properties, since the whole
 566 proof was geometric, and would have been equally valid when
 567 enlarged or reduced on a photocopying machine!
 568
 569 <% marg(m4_ifelse(__sn,1,[:120:],[:81:])) %>
 570 <%
 571   fig(
 572     'double-slit-d',
 573     %q{Cutting $d$ in half doubles the angles of the diffraction fringes.}
 574   )
 575 %>
 576 \spacebetweenfigs
 577 <%
 578   fig(
 579     'double-slit-wavelength',
 580     %q{%
 581       Double-slit diffraction patterns of long-wavelength red light (top)
 582       and short-wavelength blue light (bottom).
 583     }
 584   )
 585 %>
 586
 587 <% end_marg %>
 588
 589 \vspace{2mm plus 3mm}
 590
 591 Counterintuitively, this means that a diffracting object
 592 with smaller dimensions produces a bigger diffraction pattern, \figref{double-slit-d}.
 593
 594 \vspace{4mm plus 3mm}
 595
 596 \enlargethispage{-4\baselineskip}
 597
 598 \begin{eg}{Double-slit diffraction of blue and red light}
 599 Blue light has a shorter wavelength than red. For a given
 600 double-slit spacing $d$, the smaller value of $\lambda /d$
 601 for leads to smaller values of $\sin \theta $, and
 602 therefore to a more closely spaced set of diffraction fringes, (g)
 603 \end{eg}
 604
 605 \vspace{2mm plus 3mm}
 606
 607 \begin{eg}{The correspondence principle}
 608 Let's also consider how the equations for double-slit
 609 diffraction relate to the correspondence principle. When the
 610 ratio $\lambda /d$ is very small, we should recover the case
 611 of simple ray optics. Now if $\lambda /d$ is small,
 612 $\sin\theta $ must be small as well, and the spacing between the
 613 diffraction fringes will be small as well. Although we have
 614 not proven it, the central fringe is always the brightest,
 615 and the fringes get dimmer and dimmer as we go farther from
 616 it. For small values of $\lambda /d$, the part of the
 617 diffraction pattern that is bright enough to be detectable
 618 covers only a small range of angles. This is exactly what we
 619 would expect from ray optics: the rays passing through the
 620 two slits would remain parallel, and would continue moving
 621 in the $\theta =0$ direction. (In fact there would be images
 622 of the two separate slits on the screen, but our analysis
 623 was all in terms of angles, so we should not expect it to
 624 address the issue of whether there is structure within a set
 625 of rays that are all traveling in the $\theta =0$ direction.)
 626 \end{eg}
 627 <% marg(-20) %>
 628 <%
 629   fig(
 630     'diffraction-graph',
 631     %q{Interpretation of the angular spacing $\Delta\theta$ in example \ref{eg:diffraction-delta-theta}.
 632        It can be defined
 633        either from maximum to maximum or from
 634        minimum to minimum. Either way, the result is the same. It does not make sense to try to interpret
 635        $\Delta\theta$ as the width of a fringe; one can see from the graph and from the
 636        image below that it is not obvious either
 637        that such a thing is well defined or that it would be the same for all fringes.}
 638   )
 639 %>
 640 <% end_marg %>
 641
 642 \enlargethispage{\baselineskip}
 643
 644 \begin{eg}{Spacing of the fringes at small angles}\label{eg:diffraction-delta-theta}
 645 At small angles, we can use the approximation $\sin\theta\approx\theta$, which is
 646 valid if $\theta $ is measured in radians. The equation for
 647 double-slit diffraction becomes simply
 648 \begin{equation*}
 649  \frac{\lambda}{d} = \frac{\theta}{m} \qquad ,
 650 \end{equation*}
 651 which can be solved for $\theta $ to give
 652 \begin{equation*}
 653   \theta = \frac{m\lambda}{d} \qquad .
 654 \end{equation*}
 655 The difference in angle between successive fringes is the
 656 change in $\theta $ that results from changing $m$
 657 by plus or minus one,
 658 \begin{equation*}
 659                 \Delta\theta  = \frac{\lambda}{d}    \qquad   .
 660 \end{equation*}
 661 For example, if we write $\theta_7$ for the angle of the
 662 seventh bright fringe on one side of the central maximum and
 663 $\theta_8$ for the neighboring one, we have
 664 \begin{align*}
 665                 \theta_8-\theta_7 &= \frac{8\lambda}{d}-\frac{7\lambda}{d}\\
 666                          &= \frac{\lambda}{d} \qquad ,
 667 \end{align*}
 668 and similarly for any other neighboring pair of fringes.
 669 \end{eg}
 670
 671 Although the equation $\lambda /d=\sin \theta /m$ is only
 672 valid for a double slit, it is can still be a guide to our
 673 thinking even if we are observing diffraction of light by a
 674 virus or a flea's leg: it is always true that
 675
 676 (1) large values of $\lambda /d$ lead to a broad diffraction pattern, and
 677
 678 (2) diffraction patterns are repetitive.
 679
 680 In many cases the equation looks just like
 681 $\lambda /d =\sin \theta /m$ but with an extra numerical factor thrown
 682 in, and with $d$ interpreted as some other dimension of the
 683 object, e.g., the diameter of a piece of wire.
 684
 685 \enlargethispage{-4\baselineskip}
 686
 687 <% end_sec() %>
 688 <% begin_sec("Repetition",3) %>
 689
 690 Suppose we replace a double slit with a triple slit, \figref{triple-slit}. We
 691 can think of this as a third \index{repetition of
 692 diffracting objects}repetition of the structures that were
 693 present in the double slit. Will this device be an
 694 improvement over the double slit for any practical reasons?
 695 <% marg(0) %>
 696 <%
 697   fig(
 698     'triple-slit',
 699     %q{A triple slit.}
 700   )
 701 %>
 702 \spacebetweenfigs
 703 <%
 704   fig(
 705     'two-slits-and-five',
 706     %q{A double-slit diffraction pattern (top), and a pattern made by five slits (bottom).}
 707   )
 708 %>
 709 <% end_marg %>
 710
 711 The answer is yes, as can be shown using figure \figref{triple-slit-numerical}.
 712  For ease of visualization, I have violated our usual
 713 rule of only considering points very far from the diffracting
 714 object. The scale of the drawing is such that a wavelengths
 715 is one cm. In \subfigref{triple-slit-numerical}{1}, all three waves travel an integer number
 716 of wavelengths to reach the same point, so there is a bright
 717 central spot, as we would expect from our experience with
 718 the double slit. In figure \subfigref{triple-slit-numerical}{2}, we show the path lengths to
 719 a new point. This point is farther from slit A by a quarter
 720 of a wavelength, and correspondingly closer to slit C. The
 721 distance from slit B has hardly changed at all. Because
 722 the paths lengths traveled from slits A and C differ by
 723 half a wavelength, there will be perfect destructive
 724 interference between these two waves. There is still some
 725 uncanceled wave intensity because of slit B, but the
 726 amplitude will be three times less than in figure \subfigref{triple-slit-numerical}{1},
 727 resulting in a factor of 9 decrease in brightness. Thus, by
 728 moving off to the right a little, we have gone from the
 729 bright central maximum to a point that is quite dark.
 730 <%
 731   fig(
 732     'triple-slit-numerical',
 733     %q{%
 734       1. There is a bright central maximum. 2. At this point
 735       just off the central maximum, the path lengths traveled by the three waves have changed.
 736     },
 737     {
 738       'width'=>'fullpage'
 739     }
 740   )
 741 %>
 742
 743 Now let's compare with what would have happened if slit C
 744 had been covered, creating a plain old double slit. The
 745 waves coming from slits A and B would have been out of
 746 phase by 0.23 wavelengths, but this would not have caused
 747 very severe interference. The point in figure \subfigref{triple-slit-numerical}{2} would have
 748 been quite brightly lit up.
 749
 750 \enlargethispage{-\baselineskip}
 751
 752 To summarize, we have found that adding a third slit narrows
 753 down the central fringe dramatically. The same is true for
 754 all the other fringes as well, and since the same amount of
 755 energy is concentrated in narrower diffraction fringes, each
 756 fringe is brighter and easier to see, \figref{two-slits-and-five}.
 757
 758 This is an example of a more general fact about diffraction:
 759 if some feature of the diffracting object is repeated, the
 760 locations of the maxima and minima are unchanged, but
 761 they become narrower.
 762
 763 Taking this reasoning to its logical conclusion, a
 764 diffracting object with thousands of slits would produce
 765 extremely narrow fringes. Such an object is called a
 766 \index{diffraction grating}diffraction grating.
 767
 768 <% end_sec() %>
 769 <% begin_sec("Single-Slit Diffraction",0) %>\index{single-slit!diffraction}\index{diffraction!single-slit}
 770
 771 <% marg(m4_ifelse(__sn,1,[:0:],[:0:])) %>
 772 <%
 773   fig(
 774     'single-slit-water-waves',
 775     %q{Single-slit diffraction of water waves.}
 776   )
 777 %>
 778 \spacebetweenfigs
 779 <%
 780   fig(
 781     'single-slit',
 782     %q{Single-slit diffraction of red light. Note the double width of the central maximum.}
 783   )
 784 %>
 785 \spacebetweenfigs
 786 <%
 787   fig(
 788     'single-slit-simulated-with-three-sources',
 789     %q{%
 790       A pretty good simulation of the single-slit pattern of figure \figref{single-slit-water-waves}, made
 791       by using three motors to produce overlapping ripples from three neighboring points in the water.
 792     }
 793   )
 794 %>
 795
 796 <% end_marg %>
 797 If we use only a single slit, is there diffraction? If the
 798 slit is not wide compared to a wavelength of light, then we
 799 can approximate its behavior by using only a single set of
 800 Huygens ripples. There are no other sets of ripples to add
 801 to it, so there are no constructive or destructive
 802 interference effects, and no maxima or minima. The result
 803 will be a uniform spherical wave of light spreading out in
 804 all directions, like what we would expect from a tiny
 805 lightbulb. We could call this a diffraction pattern, but it
 806 is a completely featureless one, and it could not be used,
 807 for instance, to determine the wavelength of the light, as
 808 other diffraction patterns could.
 809
 810 All of this, however, assumes that the slit is narrow
 811 compared to a wavelength of light. If, on the other hand,
 812 the slit is broader, there will indeed be interference among
 813 the sets of ripples spreading out from various points along
 814 the opening. Figure \figref{single-slit-water-waves} shows an example with water waves,
 815 and figure \figref{single-slit} with light.
 816
 817 <% self_check('single-slit-wavelength',<<-'SELF_CHECK'
 818 How does the wavelength of the waves compare with the width
 819 of the slit in figure \figref{single-slit-water-waves}?
 820   SELF_CHECK
 821   ) %>
 822
 823 We will not go into the details of the analysis of
 824 single-slit diffraction, but let us see how its properties
 825 can be related to the general things we've learned about
 826 diffraction. We know based on scaling arguments that the
 827 angular sizes of features in the diffraction pattern must be
 828 related to the wavelength and the width, $a$, of the slit by
 829 some relationship of the form
 830 \begin{equation*}
 831         \frac{\lambda}{a} \leftrightarrow \theta \qquad .
 832 \end{equation*}
 833 This is indeed true, and for instance the angle between the
 834 maximum of the central fringe and the maximum of the next
 835 fringe on one side equals $1.5\lambda/a$. Scaling arguments will
 836 never produce factors such as the 1.5, but they tell us that
 837 the answer must involve $\lambda /a$, so all the familiar
 838 qualitative facts are true. For instance, shorter-wavelength
 839 light will produce a more closely spaced diffraction pattern.
 840
 841 \enlargethispage{-\baselineskip}
 842
 843 An important scientific example of single-slit diffraction
 844 is in telescopes. Images of individual stars, as in
 845 figure \figref{pleiades-closeup}, are a good way to examine diffraction effects,
 846 because all stars except the sun are so far away that no
 847 telescope, even at the highest magnification, can image
 848 their disks or surface features. Thus any features of a
 849 star's image must be due purely to optical effects such as
 850 diffraction. A prominent cross appears around the brightest
 851 star, and dimmer ones surround the dimmer stars. Something
 852 like this is seen in most telescope photos, and indicates
 853 that inside the tube of the telescope there were two
 854 perpendicular struts or supports. Light diffracted around
 855 these struts. You might think that diffraction could be
 856 eliminated entirely by getting rid of all obstructions in
 857 the tube, but the circles around the stars are diffraction
 858 effects arising from single-slit diffraction at the mouth of
 859 the \index{telescope}telescope's tube! (Actually we have not
 860 even talked about diffraction through a circular opening, but
 861 the idea is the same.) Since the angular sizes of the
 862 diffracted images depend on $\lambda $/a, the only way to
 863 improve the resolution of the images is to increase the
 864 diameter, $a$, of the tube. This is one of the main reasons
 865 (in addition to light-gathering power) why the best
 866 telescopes must be very large in diameter.
 867
 868 <% marg(131) %>
 869 <%
 870   fig(
 871     'pleiades-closeup',
 872     %q{%
 873       An image of the Pleiades star cluster.
 874       The circular rings around the bright stars are due to single-slit
 875       diffraction at the mouth of the telescope's tube.
 876     }
 877   )
 878 %>
 879
 880 <% end_marg %>
 881 <% self_check('radio-telescopes',<<-'SELF_CHECK'
 882 What would this imply about radio telescopes as compared
 883 with visible-light telescopes?
 884   SELF_CHECK
 885   ) %>
 886
 887 <% marg(30) %>
 888 <%
 889   fig(
 890     'very-large-array',
 891     %q{A radio telescope.}
 892   )
 893 %>
 894
 895 <% end_marg %>
 896
 897 m4_ifelse(__sn,1,[:\enlargethispage{\baselineskip}:],[::])
 898 Double-slit diffraction is easier to understand conceptually
 899 than single-slit diffraction, but if you do a double-slit
 900 diffraction experiment in real life, you are likely to
 901 encounter a complicated pattern like figure \subfigref{double-slit-fringes-realistic}{1},
 902 rather than the simpler one, 2, you were expecting.
 903 This is because the slits are fairly big compared to the
 904 wavelength of the light being used. We really have two
 905 different distances in our pair of slits: $d$, the distance
 906 between the slits, and $w$, the width of each slit. Remember
 907 that smaller distances on the object the light diffracts
 908 around correspond to larger features of the diffraction
 909 pattern. The pattern 1 thus has two spacings in it: a short
 910 spacing corresponding to the large distance $d$, and a long
 911 spacing that relates to the small dimension $w$.
 912
 913 <%
 914   fig(
 915     'double-slit-fringes-realistic',
 916     %q{%
 917       1. A diffraction pattern formed by a real double slit. The width of each slit is fairly big
 918       compared to the wavelength of the light. This is a real photo. 2. This idealized pattern is not likely to occur in real life. To get it,
 919       you would need each slit to be so narrow that its width was comparable to the wavelength of the light, but that's not usually possible. This is not a real
 920       photo. 3. A real photo of a single-slit diffraction pattern caused by a slit whose width is the same as the widths of the slits used to make the
 921       top pattern.
 922     },
 923     {
 924       'width'=>'fullpage'
 925     }
 926   )
 927 %>
 928
 929
 930 \startdq
 931
 932 \begin{dq}
 933 Why is it optically impossible for bacteria to evolve eyes
 934 that use visible light to form images?
 935 \end{dq}
 936
 937 <% end_sec() %>
 938 <% begin_sec("The Principle of Least Time",nil,'',{'optional'=>true,'calc'=>true}) %>
 939
 940 In m4_ifelse(__sn,1,[:subsection:],[:section:])
 941 __bare_subsection_or_section(least-time-reflection)
 942 and
 943 __bare_subsection_or_section(least-time-refraction),
 944 we saw how in the ray model of
 945 light, both refraction and reflection can be described in an
 946 elegant and beautiful way by a single principle, the
 947 principle of least time. We can now justify the principle of
 948 least time based on the wave model of light. Consider an
 949 example involving reflection, \figref{reflection-least-time}. Starting at point A,
 950 Huygens' principle for waves tells us that we can think of
 951 the wave as spreading out in all directions. Suppose we
 952 imagine all the possible ways that a ray could travel from A
 953 to B. We show this by drawing 25 possible paths, of which
 954 the central one is the shortest. Since the principle of
 955 least time connects the wave model to the ray model, we
 956 should expect to get the most accurate results when the
 957 wavelength is much shorter than the distances involved ---
 958 for the sake of this numerical example, let's say that a
 959 wavelength is 1/10 of the shortest reflected path from A to
 960 B. The table, 2, shows the distances traveled by the 25 rays.
 961 m4_ifelse(__sn,0,[:
 962 <% marg(0) %>
 963 <%
 964   fig(
 965     'reflection-least-time',
 966     %q{Light could take many different paths from A to B.}
 967   )
 968 %>
 969 <% end_marg %>
 970 :])
 971
 972
 973 Note how similar are the distances traveled by the group of
 974 7 rays, indicated with a bracket, that come closest to
 975 obeying the principle of least time. If we think of each one
 976 as a wave, then all 7 are again nearly in phase at point
 977 B. However, the rays that are farther from satisfying the
 978 principle of least time show more rapidly changing
 979 distances; on reuniting at point B, their phases are a
 980 random jumble, and they will very nearly cancel each other
 981 out. Thus, almost none of the wave energy delivered to point
 982 B goes by these longer paths. Physically we find, for
 983 instance, that a wave pulse emitted at A is observed at B
 984 after a time interval corresponding very nearly to the
 985 shortest possible path, and the pulse is not very ``smeared
 986 out'' when it gets there. The shorter the wavelength
 987 compared to the dimensions of the figure, the more accurate
 988 these approximate statements become.\index{least time, principle of}
 989
 990 Instead of drawing a finite number of rays, such 25, what
 991 happens if we think of the angle, $\theta $, of emission of
 992 the ray as a continuously varying variable? Minimizing the
 993 distance $L$ requires
 994 \begin{equation*}
 995         \frac{\der L}{\der\theta} = 0 \qquad .
 996 \end{equation*}
 997
 998 Because $L$ is changing slowly in the vicinity of the angle
 999 that satisfies the principle of least time, all the rays
1000 that come out close to this angle have very nearly the same
1001 $L$, and remain very nearly in phase when they reach B.
1002 This is the basic reason why the discrete table, \subfigref{reflection-least-time}{2}, turned
1003 out to have a group of rays that all traveled nearly the same distance.
1004 m4_ifelse(__sn,1,[:
1005 <% marg(150) %>
1006 <%
1007   fig(
1008     'reflection-least-time',
1009     %q{Light could take many different paths from A to B.}
1010   )
1011 %>
1012 <% end_marg %>
1013 :])
1014
1015 As discussed in __subsection_or_section(least-time-reflection), the principle of least time is
1016 really a principle of least \emph{or greatest} time. This
1017 makes perfect sense, since $\der L/\der \theta =0$ can in general
1018 describe either a minimum or a maximum
1019
1020 The principle of least time is very general. It does not
1021 apply just to refraction and reflection --- it can even be
1022 used to prove that light rays travel in a straight line
1023 through empty space, without taking detours! This general
1024 approach to wave motion was used by Richard Feynman, one of
1025 the pioneers who in the 1950's reconciled quantum mechanics
1026 with relativity. A very readable explanation is
1027 given in a book Feynman wrote for laypeople, QED: The
1028 Strange Theory of Light and Matter.
1029
1030 <% end_sec() %>