<?xml version="1.0" encoding="utf-8"?><?xml-stylesheet type="text/xsl" href="https://iancze.github.io/xml/base.min.xml"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Ian Czekala's homepage on Czekala Group</title><link>https://iancze.github.io/</link><description>Recent content in Ian Czekala's homepage on Czekala Group</description><generator>Hugo</generator><language>en-us</language><lastBuildDate>Wed, 27 Sep 2023 00:00:00 +0000</lastBuildDate><atom:link href="https://iancze.github.io/feed.xml" rel="self" type="application/rss+xml"/><item><title>Syllabus</title><link>https://iancze.github.io/courses/as5003/syllabus/</link><pubDate>Fri, 25 Aug 2023 00:00:00 +0000</pubDate><guid>https://iancze.github.io/courses/as5003/syllabus/</guid><description>



&lt;h2 id="objective-radio-astronomy-and-interferometric-imaging"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/as5003/syllabus/#objective-radio-astronomy-and-interferometric-imaging"&gt;Objective: radio astronomy and interferometric imaging&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;Interferometric astrophysical observations are a vast and deep topic. The goal of this module component is to help students develop a &lt;strong&gt;practical understanding&lt;/strong&gt; of how radio interferometers (in particular arrays like the VLA or ALMA) &lt;strong&gt;observe an astrophysical source&lt;/strong&gt;, build a &lt;strong&gt;mathematical foundation&lt;/strong&gt; for working with &lt;strong&gt;complex-valued, Fourier-plane data&lt;/strong&gt;, and survey some of the many approaches that are used to &lt;strong&gt;investigate astrophysical phenomena&lt;/strong&gt;, focusing on &lt;strong&gt;forward-modeling&lt;/strong&gt; and &lt;strong&gt;regularised maximum likelihood&lt;/strong&gt; techniques.&lt;/p&gt;




&lt;h2 id="instructor"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/as5003/syllabus/#instructor"&gt;Instructor&lt;/a&gt;
&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;Dr. Ian Czekala (he/him/his)&lt;/li&gt;
&lt;li&gt;&lt;em&gt;Email&lt;/em&gt;: &lt;a href="mailto:ic95@st-andrews.ac.uk"&gt;ic95@st-andrews.ac.uk&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;Office hours by appointment. J.F. Allen building, Rm. 308.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;If you are in any way feeling ill or suspect you might have been contact with an individual infected with COVID, &lt;em&gt;please&lt;/em&gt; stay home and seek medical care if necessary.&lt;/strong&gt; We plan on posting all lecture notes, and we will work with you to provide you with the course materials you need.&lt;/p&gt;




&lt;h2 id="course-grade"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/as5003/syllabus/#course-grade"&gt;Course Grade&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;The course grade (100%) will be assessed via a 2 hour written examination.&lt;/p&gt;




&lt;h2 id="tutorials"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/as5003/syllabus/#tutorials"&gt;Tutorials&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;This module component will have 2 tutorial sessions, as listed in the &lt;a href="https://iancze.github.io/courses/as5003/schedule/"&gt;schedule&lt;/a&gt;. These in-class, interactive sessions will be used to build practical understanding of the module material and apply your knowledge to solving the types of problems frequently encountered in radio astronomy research.&lt;/p&gt;
&lt;p&gt;Tutorial problems will be distributed via Moodle approximately one week in advance. You are encouraged to collaborate and work through the tutorial problems together, in advance, with other members of the module. However, each student should be prepared to discuss their answers on their own during the tutorial session.&lt;/p&gt;




&lt;h2 id="programming"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/as5003/syllabus/#programming"&gt;Programming&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;Some tutorial problems may require a small amount of programming. Students are encouraged to use whatever programming language they are most comfortable with. IC is most familiar with Python and Julia, so he will be more able to assist you with those.&lt;/p&gt;




&lt;h2 id="reference-materials"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/as5003/syllabus/#reference-materials"&gt;Reference Materials&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;There are many additional resources that will be helpful during this course (and beyond) and will be called out in the course at the appropriate juncture. Many of these resources are freely available online or through the &lt;a href="https://www.st-andrews.ac.uk/library/"&gt;University library&lt;/a&gt;.&lt;/p&gt;




&lt;h3 id="textbooks"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/as5003/syllabus/#textbooks"&gt;Textbooks&lt;/a&gt;
&lt;/h3&gt;
&lt;ul&gt;
&lt;li&gt;&lt;a href="https://www.cv.nrao.edu/~sransom/web/xxx.html"&gt;Essential Radio Astronomy&lt;/a&gt; by James Condon and Scott Ransom (online resource)&lt;/li&gt;
&lt;li&gt;&lt;a href="https://encore.st-andrews.ac.uk/iii/encore/record/C__Rb2084444__Stools%20of%20radio%20astronomy__Orightresult__U__X7?lang=eng&amp;amp;suite=def"&gt;Tools of Radio Astronomy&lt;/a&gt; by Rohlfs and Wilson (ebook)&lt;/li&gt;
&lt;li&gt;&lt;a href="https://encore.st-andrews.ac.uk/iii/encore/record/C__Rb2485438__Sinterferometry%20and%20synthesis%20__Orightresult__U__X6?lang=eng&amp;amp;suite=def"&gt;Interferometry and Synthesis in Radio Astronomy&lt;/a&gt; by Thompson, Moran, and Swenson (ebook)&lt;/li&gt;
&lt;li&gt;&lt;a href="https://encore.st-andrews.ac.uk/iii/encore/record/C__Rb1693763__SBracewell%2C%20Ronald%20N.__Orightresult__U__X2?lang=eng&amp;amp;suite=def"&gt;The Fourier Transform and its Applications&lt;/a&gt; by R. Bracewell&lt;/li&gt;
&lt;/ul&gt;




&lt;h3 id="courses"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/as5003/syllabus/#courses"&gt;Courses&lt;/a&gt;
&lt;/h3&gt;
&lt;ul&gt;
&lt;li&gt;18 NRAO Synthesis Imaging School &lt;a href="https://web.cvent.com/event/b7f82cf3-7126-4a71-a88b-7a93c66a4dc7/websitePage:6bbf1462-9f9f-4204-bd93-36032f793bc4"&gt;slides and lectures&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;




&lt;h3 id="videos"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/as5003/syllabus/#videos"&gt;Videos&lt;/a&gt;
&lt;/h3&gt;
&lt;ul&gt;
&lt;li&gt;&lt;a href="https://web.cvent.com/event/1ecded89-273f-4f8a-80e8-f5eb2d40d8c5/websitePage:4967d7b9-05fe-4b32-820d-1f1587665ee3"&gt;Cells to Galaxies Speaker Series Archive&lt;/a&gt;, in particular talks by &lt;a href="https://vimeo.com/639606652/a927105526"&gt;Urvashi Rao&lt;/a&gt; and &lt;a href="https://vimeo.com/612017530/9fc70df0ef"&gt;Sanjay Bhatnagar&lt;/a&gt; (opening ~15 minutes)&lt;/li&gt;
&lt;li&gt;Lectures by David Wilner &lt;a href="https://www.youtube.com/watch?v=0TwnZhiEc3A&amp;amp;ab_channel=AnitaChapter"&gt;Part I&lt;/a&gt; and &lt;a href="https://www.youtube.com/watch?v=mRUZ9eckHZg&amp;amp;ab_channel=AnitaChapter"&gt;Part II&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;</description></item><item><title>Syllabus</title><link>https://iancze.github.io/courses/astro589/syllabus/</link><pubDate>Tue, 21 Jun 2022 00:00:00 +0000</pubDate><guid>https://iancze.github.io/courses/astro589/syllabus/</guid><description>



&lt;h2 id="objective-radio-astronomy-and-interferometric-imaging"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro589/syllabus/#objective-radio-astronomy-and-interferometric-imaging"&gt;Objective: radio astronomy and interferometric imaging&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;Interferometric astrophysical observations are a vast and deep topic. The goal of this course is to help students develop a &lt;strong&gt;practical understanding&lt;/strong&gt; of how interferometers (in particular arrays like the VLA or ALMA) &lt;strong&gt;observe an astrophysical source&lt;/strong&gt;, build a &lt;strong&gt;mathematical foundation&lt;/strong&gt; for working with &lt;strong&gt;complex-valued, Fourier-plane data&lt;/strong&gt;, and survey some of the many approaches that are used to &lt;strong&gt;investigate astrophysical phenomena&lt;/strong&gt;, focusing on &lt;strong&gt;forward-modeling&lt;/strong&gt; and &lt;strong&gt;regularized maximum likelihood&lt;/strong&gt; techniques.&lt;/p&gt;




&lt;h2 id="instructor"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro589/syllabus/#instructor"&gt;Instructor&lt;/a&gt;
&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;Professor (Dr.) Ian Czekala (he/him/his)&lt;/li&gt;
&lt;li&gt;&lt;em&gt;Email&lt;/em&gt;: &lt;a href="mailto:ipc5094@psu.edu"&gt;ipc5094@psu.edu&lt;/a&gt; or &lt;a href="mailto:iczekala@psu.edu"&gt;iczekala@psu.edu&lt;/a&gt; (alias).&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;Office hours by appointment.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;If you are in any way feeling ill or suspect you might have been contact with an individual infected with COVID, &lt;em&gt;please&lt;/em&gt; stay home and seek medical care if necessary.&lt;/strong&gt; We plan on posting all lecture notes, and we will work with you to provide you with the course materials you need.&lt;/p&gt;




&lt;h2 id="course-calendar-and-closure-policies"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro589/syllabus/#course-calendar-and-closure-policies"&gt;Course Calendar and Closure Policies&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;ASTRO 589 meets &lt;strong&gt;once a week on Wednesdays from 10:10am to 11:00am ET (prompt) in Davey Lab Room 538&lt;/strong&gt;.&lt;/p&gt;
&lt;p&gt;For full information on lecture dates and topics, see the &lt;a href="https://iancze.github.io/courses/astro589/schedule/"&gt;Course Schedule&lt;/a&gt;.&lt;/p&gt;
&lt;p&gt;If campus should be closed (e.g. for a weather-related event or COVID precautions), the instructor will provide instructions via email on course lecture format (possibly remote, keeping the same schedule) and examinations/assignments (due dates to be rescheduled &lt;em&gt;no earlier than 48 hours&lt;/em&gt; after closure announcement).&lt;/p&gt;




&lt;h2 id="assignments-and-project"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro589/syllabus/#assignments-and-project"&gt;Assignments and Project&lt;/a&gt;
&lt;/h2&gt;




&lt;h3 id="course-grade"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro589/syllabus/#course-grade"&gt;Course Grade&lt;/a&gt;
&lt;/h3&gt;
&lt;p&gt;The course grade will be based on&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;problem sets (55%)&lt;/li&gt;
&lt;li&gt;group project and presentation (45%)&lt;/li&gt;
&lt;/ul&gt;




&lt;h3 id="problem-sets"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro589/syllabus/#problem-sets"&gt;Problem sets&lt;/a&gt;
&lt;/h3&gt;
&lt;p&gt;Problem sets will be assigned approximately every 2-3 weeks.&lt;/p&gt;
&lt;p&gt;Please turn in homeworks in &lt;code&gt;.PDF&lt;/code&gt; format to Ian by email at &lt;a href="mailto:ipc5094@psu.edu"&gt;ipc5094@psu.edu&lt;/a&gt; with the subject line &lt;code&gt;ASTRO 589 problem set&lt;/code&gt;. Handwritten solutions are fine, but please digitize your submission using a department scanner or a smartphone scanner app (e.g., &lt;a href="https://acrobat.adobe.com/us/en/acrobat/mobile/scanner-app.html"&gt;Adobe Scan&lt;/a&gt;).&lt;/p&gt;
&lt;p&gt;Due dates will be set to correspond to the beginning of a class period (i.e., 10:10am).&lt;/p&gt;




&lt;h4 id="programming"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro589/syllabus/#programming"&gt;Programming&lt;/a&gt;
&lt;/h4&gt;
&lt;p&gt;For problem sets, students are encouraged to use whatever programming language they are most comfortable with.&lt;/p&gt;
&lt;p&gt;Some project choices, especially those involving CASA, may require basic familiarity with the Python programming language.&lt;/p&gt;




&lt;h4 id="problem-set-late-policy"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro589/syllabus/#problem-set-late-policy"&gt;Problem set late policy&lt;/a&gt;
&lt;/h4&gt;
&lt;p&gt;The following percentages will be deducted from your score&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;one day late: 5%&lt;/li&gt;
&lt;li&gt;two days late: 10%&lt;/li&gt;
&lt;li&gt;three days late: 25%&lt;/li&gt;
&lt;li&gt;more than 72 hours late: no credit&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;If extenuating circumstances arise such that you will be unable to complete the homework on time, please contact Ian &lt;em&gt;before&lt;/em&gt; the homework deadline and we can most likely arrange an extension.&lt;/p&gt;




&lt;h4 id="problem-set-collaboration-policy"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro589/syllabus/#problem-set-collaboration-policy"&gt;Problem set collaboration policy&lt;/a&gt;
&lt;/h4&gt;
&lt;p&gt;You are encouraged to collaborate and work through the problem sets together. However, each student must complete the final write up on their own, i.e., no problem set should be duplicated verbatim between students.&lt;/p&gt;




&lt;h3 id="group-project-and-presentation"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro589/syllabus/#group-project-and-presentation"&gt;Group project and presentation&lt;/a&gt;
&lt;/h3&gt;
&lt;p&gt;Students will form groups of 2-3 for their course projects. Groups will propose a project concept around some aspect of interferometry and discuss in detail at least one major astrophysical application. The project idea may be based of the list of example projects (below) or may be an original idea proposed by the group.&lt;/p&gt;
&lt;p&gt;The project will culminate in a presentation to the class. This presentation will be a comprehensive lecture on your topic, which should take 35 minutes of our class period, followed by 15 minutes of detailed questions and answers. Students may submit any additional materials beyond their presentation for review, if so desired; however, no final report is required.&lt;/p&gt;
&lt;p&gt;Good presentations will:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Provide a cogent astrophysical and technical introduction&lt;/li&gt;
&lt;li&gt;Connect to course material covered (if applicable)&lt;/li&gt;
&lt;li&gt;Introduce the key supporting background material, including any relevant equations or seminal figures from related works&lt;/li&gt;
&lt;li&gt;Copious citations to and explanations of relevant literature&lt;/li&gt;
&lt;li&gt;Explain the key observational or theoretical instruments/methodologies (as applicable)&lt;/li&gt;
&lt;li&gt;Identify at least one major astrophysical application and discuss in detail&lt;/li&gt;
&lt;li&gt;Contain some new aspect of technical development or investigation by the group&lt;/li&gt;
&lt;li&gt;Finish the presentation within the allotted time (35 +/- 5 minutes)&lt;/li&gt;
&lt;li&gt;Adequately answer questions raised by the instructor and students&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;Each group should provide the instructor with their proposed group by &lt;strong&gt;Wednesday, September 21st&lt;/strong&gt;. The instructor will then assign one presentation date to each group. You should notify the instructor of your project topic at least two weeks prior to your presentation date to confirm that it is an acceptable choice.&lt;/p&gt;
&lt;p&gt;It is a wise idea to practice your presentation from start to finish &amp;ldquo;live&amp;rdquo; to make sure your timing is correct. Groups whose talks are wildly under/over time will find it difficult to achieve full credit.&lt;/p&gt;




&lt;h4 id="example-project-ideas"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro589/syllabus/#example-project-ideas"&gt;Example Project Ideas&lt;/a&gt;
&lt;/h4&gt;
&lt;ul&gt;
&lt;li&gt;Cross-validation strategies for imaging (including for RML)&lt;/li&gt;
&lt;li&gt;How do MRI imagers work, and what are their fundamental relationships to radio interferometers?&lt;/li&gt;
&lt;li&gt;Optical interferometers and science results&lt;/li&gt;
&lt;li&gt;Download a dataset from the ALMA archive, image with CLEAN or RML techniques&lt;/li&gt;
&lt;li&gt;Calibration techniques for ALMA and EHT&lt;/li&gt;
&lt;li&gt;Self-calibration theory and application with CASA&lt;/li&gt;
&lt;li&gt;Spectral line capabilities of ALMA, image a molecular line from ALMA archive&lt;/li&gt;
&lt;li&gt;Wavelets: what are they, how have they been used in radio astronomy applications&lt;/li&gt;
&lt;li&gt;Design new ALMA observations using CASA/simobserve&lt;/li&gt;
&lt;li&gt;Expanding MPoL to do &lt;a href="https://github.com/MPoL-dev/MPoL/issues/33"&gt;model fitting with Pyro&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;Expanding MPoL for &lt;a href="https://github.com/MPoL-dev/MPoL/issues/77"&gt;fake data generation&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;Expanding MPoL to use &lt;a href="https://github.com/MPoL-dev/MPoL/issues/17"&gt;nufft for gridding&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;Expanding MPoL to do &lt;a href="https://github.com/MPoL-dev/MPoL/issues/24"&gt;self-calibration&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;Expanding MPoL to do &lt;a href="https://github.com/MPoL-dev/MPoL/issues/23"&gt;primary beam corrections&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;Wide-field imaging: what changes about the imaging assumptions, applications&lt;/li&gt;
&lt;li&gt;Briggs weighting: (introduction, Briggs&amp;rsquo;s thesis, application w/ CASA)&lt;/li&gt;
&lt;li&gt;Time and bandwidth smearing: theory, demonstrations with CASA and real ALMA data&lt;/li&gt;
&lt;li&gt;Non-parametric techniques for protoplanetary surface brightness profiles, including &lt;a href="https://ui.adsabs.harvard.edu/abs/2020MNRAS.495.3209J/abstract"&gt;frank&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;Beam-forming and applications&lt;/li&gt;
&lt;/ul&gt;




&lt;h4 id="a-note-about-scope"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro589/syllabus/#a-note-about-scope"&gt;A note about scope&lt;/a&gt;
&lt;/h4&gt;
&lt;p&gt;Radio astronomy is a vast and deep topic&amp;mdash;some of these project topics are considerable in scope and could constitute Ph.D. theses (some have). The primary purpose of the group project is to &lt;em&gt;educate yourself and your classmates on an advanced radio astronomy and/or imaging topic&lt;/em&gt;. To that end, I encourage you to focus most of your time on developing a solid understanding of the concept and designing a pedagogical presentation that clearly develops and introduces core practical and theoretical components.&lt;/p&gt;
&lt;p&gt;For example, if you were to choose to cover time and bandwidth smearing, I would expect at least half and potentially 2/3 of your presentation to be pedagogical, i.e., extending the mathematical formalism we discussed in class to arrive at why time and bandwidth smearing occur in an interferometric system, highlighting and presenting classical treatments on the topic from our class textbooks, and presenting an astrophysical observation (i.e., published journal article) that discussed their approach to mitigating these effects. You may also choose to cover historical development of the topic, if relevant.&lt;/p&gt;
&lt;p&gt;The &amp;ldquo;technical&amp;rdquo; component of the presentation is designed to be an opportunity for you to practically apply your knowledge, either by application to a real dataset or development of a new algorithm (e.g., for MPoL). It is meant as a &lt;em&gt;supplement&lt;/em&gt; to the aforementioned pedagogical treatment, not a substitute for it. Keeping with the time/bandwidth smearing example, an appropriate technical component to the project would be to download a dataset from the ALMA archive and use CASA to synthesize a range of images after various amounts of time and/or bandwidth averaging have been applied, and discuss if/how time or bandwidth smearing has compromised the image quality.&lt;/p&gt;
&lt;p&gt;Presentations that do not devote proper coverage to introductory/pedagogical material will find it difficult to achieve a satisfactory grade&amp;mdash;even those that have an impressive technical application. Conversely, group projects that fail to write a single line of new code but deliver a quality pedagogical component would likely still receive a passing grade.&lt;/p&gt;
&lt;p&gt;I strongly encourage all groups to keep in contact with me as you are developing your project ideas, so that I may provide feedback on scope.&lt;/p&gt;




&lt;h2 id="reference-materials"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro589/syllabus/#reference-materials"&gt;Reference Materials&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;There are many additional resources that will be helpful during this course (and beyond) and will be called out in the course at the appropriate juncture. Many of these resources are freely available online or through the &lt;a href="https://libraries.psu.edu/"&gt;University library&lt;/a&gt;.&lt;/p&gt;




&lt;h3 id="textbooks"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro589/syllabus/#textbooks"&gt;Textbooks&lt;/a&gt;
&lt;/h3&gt;
&lt;ul&gt;
&lt;li&gt;&lt;a href="https://www.cv.nrao.edu/~sransom/web/xxx.html"&gt;Essential Radio Astronomy&lt;/a&gt; by James Condon and Scott Ransom&lt;/li&gt;
&lt;li&gt;&lt;a href="https://catalog.libraries.psu.edu/catalog/2625688"&gt;Tools of Radio Astronomy&lt;/a&gt; by Rohlfs and Wilson&lt;/li&gt;
&lt;li&gt;&lt;a href="https://catalog.libraries.psu.edu/catalog/20789467"&gt;Interferometry and Synthesis in Radio Astronomy&lt;/a&gt; by Thompson, Moran, and Swenson&lt;/li&gt;
&lt;li&gt;&lt;a href="https://catalog.libraries.psu.edu/catalog/34517505"&gt;Fourier Analysis and Imaging&lt;/a&gt; by R. Bracewell&lt;/li&gt;
&lt;li&gt;&lt;a href="https://catalog.libraries.psu.edu/catalog/2010095"&gt;The Fourier Transform and its Applications&lt;/a&gt; by R. Bracewell&lt;/li&gt;
&lt;/ul&gt;




&lt;h3 id="courses"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro589/syllabus/#courses"&gt;Courses&lt;/a&gt;
&lt;/h3&gt;
&lt;ul&gt;
&lt;li&gt;18 NRAO Synthesis Imaging School &lt;a href="https://web.cvent.com/event/b7f82cf3-7126-4a71-a88b-7a93c66a4dc7/websitePage:6bbf1462-9f9f-4204-bd93-36032f793bc4"&gt;slides and lectures&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;




&lt;h3 id="videos"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro589/syllabus/#videos"&gt;Videos&lt;/a&gt;
&lt;/h3&gt;
&lt;ul&gt;
&lt;li&gt;&lt;a href="https://web.cvent.com/event/1ecded89-273f-4f8a-80e8-f5eb2d40d8c5/websitePage:4967d7b9-05fe-4b32-820d-1f1587665ee3"&gt;Cells to Galaxies Speaker Series Archive&lt;/a&gt;, in particular talks by &lt;a href="https://vimeo.com/639606652/a927105526"&gt;Urvashi Rao&lt;/a&gt; and &lt;a href="https://vimeo.com/612017530/9fc70df0ef"&gt;Sanjay Bhatnagar&lt;/a&gt; (opening ~15 minutes)&lt;/li&gt;
&lt;li&gt;Lectures by David Wilner &lt;a href="https://www.youtube.com/watch?v=0TwnZhiEc3A&amp;amp;ab_channel=AnitaChapter"&gt;Part I&lt;/a&gt; and &lt;a href="https://www.youtube.com/watch?v=mRUZ9eckHZg&amp;amp;ab_channel=AnitaChapter"&gt;Part II&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;




&lt;h2 id="masking-policy"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro589/syllabus/#masking-policy"&gt;Masking Policy&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;We encourage you to get COVID-19 vaccinated and follow University policies on masking, especially in indoor spaces. In consultation with background rates in Centre County, masks may be required part or all of the fall semester. Please consult &lt;a href="https://virusinfo.psu.edu/"&gt;PSU VirusInfo&lt;/a&gt; for the latest policy.&lt;/p&gt;




&lt;h2 id="academic-integrity"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro589/syllabus/#academic-integrity"&gt;Academic Integrity&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;Academic integrity is the pursuit of scholarly activity in an open, honest and responsible manner. Academic integrity is a basic guiding principle for all academic activity at The Pennsylvania State University, and all members of the University community are expected to act in accordance with this principle. Consistent with this expectation, the University’s Code of Conduct states that all students should act with personal integrity, respect other students’ dignity, rights and property, and help create and maintain an environment in which all can succeed through the fruits of their efforts.&lt;/p&gt;
&lt;p&gt;Academic integrity includes a commitment by all members of the University community not to engage in or tolerate acts of falsification, misrepresentation or deception. Such acts of dishonesty violate the fundamental ethical principles of the University community and compromise the worth of work completed by others.&lt;/p&gt;




&lt;h2 id="disability-accommodation"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro589/syllabus/#disability-accommodation"&gt;Disability Accommodation&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;Penn State welcomes students with disabilities into the University’s educational programs. Student Disability Resources (SDR) website provides contact information for every Penn State campus (Links to an external site.). For further information, please visit &lt;a href="http://equity.psu.edu/student-disability-resources/"&gt;Student Disability Resources website&lt;/a&gt;.&lt;/p&gt;
&lt;p&gt;In order to receive consideration for reasonable accommodations, you must contact the appropriate disability services office at the campus where you are officially enrolled, participate in an intake interview, and provide documentation: See &lt;a href="http://equity.psu.edu/student-disability-resources/applying-for-services/documentation-guidelines"&gt;documentation guidelines&lt;/a&gt;. If the documentation supports your request for reasonable accommodations, your campus disability services office will provide you with an accommodation letter. Please share this letter with your instructors and discuss the accommodations with them as early as possible. You must follow this process for every semester that you request accommodations.&lt;/p&gt;




&lt;h2 id="counseling-and-psychological-services"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro589/syllabus/#counseling-and-psychological-services"&gt;Counseling and Psychological Services&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;Many students at Penn State face personal challenges or have psychological needs that may interfere with their academic progress, social development, or emotional wellbeing. The university offers a variety of confidential services to help you through difficult times, including individual and group counseling, crisis intervention, consultations, online chats, and mental health screenings. These services are provided by staff who welcome all students and embrace a philosophy respectful of clients’ cultural and religious backgrounds, and sensitive to differences in race, ability, gender identity and sexual orientation.&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Counseling and Psychological Services at University Park &lt;a href="http://studentaffairs.psu.edu/counseling/"&gt;CAPS&lt;/a&gt;: 814-863-0395&lt;/li&gt;
&lt;li&gt;&lt;a href="https://senate.psu.edu/faculty/counseling-services-at-commonwealth-campuses/"&gt;Counseling and Psychological Services at Commonwealth Campuses&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;Penn State Crisis Line (24 hours/7 days/week): 877-229-6400&lt;/li&gt;
&lt;li&gt;Crisis Text Line (24 hours/7 days/week): Text LIONS to 741741&lt;/li&gt;
&lt;/ul&gt;




&lt;h2 id="educational-equity-and-reporting-bias"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro589/syllabus/#educational-equity-and-reporting-bias"&gt;Educational Equity and Reporting Bias&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;Penn State takes great pride to foster a diverse and inclusive environment for students, faculty, and staff. Acts of intolerance, discrimination, or harassment due to age, ancestry, color, disability, gender, gender identity, national origin, race, religious belief, sexual orientation, or veteran status are not tolerated and can be reported through Educational Equity via the &lt;a href="http://equity.psu.edu/reportbias/"&gt;Report Bias webpage&lt;/a&gt;.&lt;/p&gt;
&lt;p&gt;&lt;em&gt;Whom should I contact if I need additional assistance?&lt;/em&gt;
I encourage you to be in contact with your academic adviser for specific needs you might have outside this course. Academic adviser information and scheduling can be found at &lt;a href="https://sites.psu.edu/starfishinfo/"&gt;https://sites.psu.edu/starfishinfo/&lt;/a&gt;. There are also additional resources available at &lt;a href="https://keeplearning.psu.edu/student-support/"&gt;https://keeplearning.psu.edu/student-support/&lt;/a&gt;&lt;/p&gt;




&lt;h3 id="counseling--psychological-services-caps"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro589/syllabus/#counseling--psychological-services-caps"&gt;Counseling &amp;amp; Psychological Services (CAPS)&lt;/a&gt;
&lt;/h3&gt;
&lt;ul&gt;
&lt;li&gt;&lt;a href="https://studentaffairs.psu.edu/caps-contact-form"&gt;https://studentaffairs.psu.edu/caps-contact-form&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;CAPS Phone: (814) 863-0395&lt;/li&gt;
&lt;li&gt;Penn State Crisis Line 1-877-229-6400&lt;/li&gt;
&lt;li&gt;Student Care and Advocacy: Email: &lt;a href="mailto:StudentCare@psu.edu"&gt;StudentCare@psu.edu&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;Share a Concern: &lt;a href="https://cm.maxient.com/reportingform.php?PennState&amp;amp;layout_id=14"&gt;Share a Concern Form&lt;/a&gt; Phone: 814-863-2020 (voicemail)&lt;/li&gt;
&lt;/ul&gt;




&lt;h2 id="code-of-mutual-respect-and-cooperation"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro589/syllabus/#code-of-mutual-respect-and-cooperation"&gt;Code of Mutual Respect and Cooperation&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;The Eberly College of Science (ECoS) &lt;a href="https://science.psu.edu/climate-and-diversity/code-mutual-respect-and-cooperation"&gt;Code of Mutual Respect and Cooperation&lt;/a&gt; embodies the values that we hope
faculty, staff, and students possess and will endorse to make ECoS a place where every individual feels respected and valued, as well as challenged and rewarded. Please review these principles, linked &lt;a href="https://science.psu.edu/climate-and-diversity/code-mutual-respect-and-cooperation"&gt;here&lt;/a&gt;.&lt;/p&gt;</description></item><item><title>Syllabus</title><link>https://iancze.github.io/courses/astro542/syllabus/</link><pubDate>Mon, 21 Jun 2021 00:00:00 +0000</pubDate><guid>https://iancze.github.io/courses/astro542/syllabus/</guid><description>



&lt;h2 id="instructor"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/syllabus/#instructor"&gt;Instructor&lt;/a&gt;
&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;Professor (Dr.) Ian Czekala (he/him/his)&lt;/li&gt;
&lt;li&gt;&lt;em&gt;Email&lt;/em&gt;: &lt;a href="mailto:ipc5094@psu.edu"&gt;ipc5094@psu.edu&lt;/a&gt; or &lt;a href="mailto:iczekala@psu.edu"&gt;iczekala@psu.edu&lt;/a&gt; (alias).&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;Office hours by appointment (remote only).&lt;/p&gt;




&lt;h2 id="objectives"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/syllabus/#objectives"&gt;Objectives&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;The interstellar medium (ISM) is the space between the stars. This space is far from empty, however, and consists of large reservoirs of atomic and molecular gas and solids in the form of &amp;ldquo;dust&amp;rdquo; that mediate or contribute to many important astrophysical processes such as star formation, planet formation, stellar feedback, and stellar death/supernovae. Considered in aggregate, these processes are important on galactic- (and even intergalactic-) scales, influencing the distribution of giant molecular clouds and stellar populations. As such, Astro 542 is at once a course about &lt;em&gt;nothing&lt;/em&gt; and a course about &lt;em&gt;everything&lt;/em&gt;. We will learn the astrophysical processes that govern the interactions between the ISM and its many astrophysical interfaces. Students will also learn telescope proposal writing strategies, prepare a mock ALMA proposal, and simulate the dual-anonymous distributed peer review process.&lt;/p&gt;




&lt;h2 id="format"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/syllabus/#format"&gt;Format&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;ASTRO 542 meets three times a week on Monday, Wednesday, and Friday on Zoom in a &lt;em&gt;remote synchronous&lt;/em&gt; format from 9:05am to 9:55am ET (prompt). &lt;strong&gt;If you are in any way feeling ill or suspect you might have been contact with an individual infected with COVID, &lt;em&gt;please&lt;/em&gt; stay home and seek medical care.&lt;/strong&gt; We plan on recording and posting all lectures, and we will work with you to provide you with the course materials you need.&lt;/p&gt;




&lt;h3 id="masking-policy"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/syllabus/#masking-policy"&gt;Masking Policy&lt;/a&gt;
&lt;/h3&gt;
&lt;p&gt;Though this course is remote synchronous, we encourage you to get vaccinated and follow University policies on masking, especially in indoor spaces.&lt;/p&gt;
&lt;p&gt;&lt;em&gt;University policy&lt;/em&gt;: Penn State University requires everyone to wear a face mask in all university buildings, including classrooms, regardless of vaccination status. ALL STUDENTS MUST wear a mask appropriately (i.e., covering both your mouth and nose) while you are indoors on campus. This is to protect your health and safety as well as the health and safety of your classmates, instructor, and the university community. Anyone attending class without a mask will be asked to put one on or leave. Instructors may end class if anyone present refuses to appropriately wear a mask for the duration of class. Students who refuse to wear masks appropriately may face disciplinary action for Code of Conduct violations. If you feel you cannot wear a mask during class, please speak with your adviser immediately about your options for altering your schedule.&lt;/p&gt;




&lt;h2 id="textbook"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/syllabus/#textbook"&gt;Textbook&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;This course has one required textbook:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;&lt;strong&gt;Title&lt;/strong&gt;: &lt;em&gt;Physics of the Interstellar and Intergalactic Medium&lt;/em&gt;&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Author&lt;/strong&gt;: Bruce Draine&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;ISBN-13&lt;/strong&gt; &lt;code&gt;978-0-691-12213-7&lt;/code&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;It is available through through textbook sellers online and will be available in the PSU bookstore. We recommended that you secure access to this textbook for readings and reference during the course. The PAMS library (201 Davey Lab) also has a copy on course reserve.&lt;/p&gt;




&lt;h2 id="additional-reference-materials"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/syllabus/#additional-reference-materials"&gt;Additional Reference Materials&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;There are many additional resources that will be helpful during this course (and beyond) and will be called out in the course at the appropriate juncture. Many of these resources are freely available online or through the &lt;a href="https://libraries.psu.edu/"&gt;University library&lt;/a&gt;.&lt;/p&gt;




&lt;h3 id="course-materials-and-lecture-notes"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/syllabus/#course-materials-and-lecture-notes"&gt;Course materials and lecture notes&lt;/a&gt;
&lt;/h3&gt;
&lt;ul&gt;
&lt;li&gt;&lt;a href="https://arxiv.org/abs/1511.03457"&gt;Notes on Star Formation&lt;/a&gt; by Mark Krumholz&lt;/li&gt;
&lt;li&gt;Lecture notes on &lt;a href="https://www.ita.uni-heidelberg.de/~dullemond/lectures/radtrans_2012/index.shtml"&gt;Radiative Transfer in Astrophysics&lt;/a&gt; by C.P. Dullemond.&lt;/li&gt;
&lt;/ul&gt;




&lt;h3 id="review-articles"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/syllabus/#review-articles"&gt;Review Articles&lt;/a&gt;
&lt;/h3&gt;
&lt;ul&gt;
&lt;li&gt;&lt;a href="https://ui.adsabs.harvard.edu/abs/2021PhR...893....1O/abstract"&gt;Astrochemistry and Compositions of Planetary Systems&lt;/a&gt; by Karin Oberg and Ted Bergin, Physics Reports 2021&lt;/li&gt;
&lt;li&gt;&lt;a href="https://ui.adsabs.harvard.edu/abs/2013ARA%26A..51..269D/abstract"&gt;Stellar Multiplicity&lt;/a&gt; by Gaspard Duchene and Adam Kraus, ARA&amp;amp;A 2013&lt;/li&gt;
&lt;li&gt;&lt;a href="https://ui.adsabs.harvard.edu/abs/2020ARA%26A..58..483A/abstract"&gt;Observations of Protoplanetary Disk Structures&lt;/a&gt; by Sean Andrews, ARA&amp;amp;A 2020&lt;/li&gt;
&lt;li&gt;&lt;a href="https://ui.adsabs.harvard.edu/abs/2011ARA%26A..49..195A/abstract"&gt;Dynamics of Protoplanetary Disks&lt;/a&gt; by Phil Armitage, ARA&amp;amp;A 2011&lt;/li&gt;
&lt;/ul&gt;




&lt;h3 id="textbooks"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/syllabus/#textbooks"&gt;Textbooks&lt;/a&gt;
&lt;/h3&gt;
&lt;ul&gt;
&lt;li&gt;&lt;em&gt;Interstellar and Intergalactic Medium&lt;/em&gt; by Barbara Ryden and Richard W. Pogge, The Ohio State Astrophysics Series, 2021&lt;/li&gt;
&lt;li&gt;&lt;a href="https://www.cv.nrao.edu/~sransom/web/xxx.html"&gt;Essential Radio Astronomy&lt;/a&gt; by James Condon and Scott Ransom&lt;/li&gt;
&lt;li&gt;&lt;em&gt;Physics and Chemistry of the Interstellar Medium&lt;/em&gt; by Sun Kwok, University Science Books, 2007&lt;/li&gt;
&lt;li&gt;&lt;em&gt;The Origin of Stars&lt;/em&gt; by Michael D. Smith, Imperial College Press, 2004&lt;/li&gt;
&lt;li&gt;&lt;em&gt;The Physics and Chemistry of the Interstellar Medium&lt;/em&gt; by A.G.G.M Tielens, Cambridge University Press, 2005&lt;/li&gt;
&lt;li&gt;&lt;em&gt;The Formation of Stars&lt;/em&gt; by Steven W. Stahler and Francesco Palla, Wiley-VCH, 2004&lt;/li&gt;
&lt;li&gt;&lt;em&gt;Protostars and Planets&lt;/em&gt; V, B. Reipurth, D. Jewitt, and K. Keil (eds.), University of Arizona Press, 2007 &lt;a href="http://adsabs.harvard.edu/cgi-bin/nph-abs_connect?bibcode=2007prpl.conf&amp;amp;db_key=ALL&amp;amp;sort=BIBCODE&amp;amp;nr_to_return=500&amp;amp;data_and=YES&amp;amp;toc_link=YES"&gt;Chapters available here&lt;/a&gt;.&lt;/li&gt;
&lt;/ul&gt;




&lt;h2 id="assignments-and-exams"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/syllabus/#assignments-and-exams"&gt;Assignments and Exams&lt;/a&gt;
&lt;/h2&gt;




&lt;h3 id="course-grade"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/syllabus/#course-grade"&gt;Course Grade&lt;/a&gt;
&lt;/h3&gt;
&lt;p&gt;The course grade will be based on&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;problem sets (20%)&lt;/li&gt;
&lt;li&gt;a paper presentation (15%)&lt;/li&gt;
&lt;li&gt;three mid-term exams (40%)&lt;/li&gt;
&lt;li&gt;reviews from our mock ALMA TAC (5%)&lt;/li&gt;
&lt;li&gt;an ALMA proposal (20%)&lt;/li&gt;
&lt;/ul&gt;




&lt;h3 id="problem-sets"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/syllabus/#problem-sets"&gt;Problem sets&lt;/a&gt;
&lt;/h3&gt;
&lt;p&gt;Problem sets will be assigned approximately every 3-4 weeks.&lt;/p&gt;
&lt;p&gt;Please turn in homeworks in &lt;code&gt;.PDF&lt;/code&gt; format to Ian by email at &lt;a href="mailto:ipc5094@psu.edu"&gt;ipc5094@psu.edu&lt;/a&gt; with the subject line &lt;code&gt;ASTRO 542 problem set&lt;/code&gt;. Handwritten solutions are fine, but please digitize your submission using a department scanner or a smartphone scanner app (e.g., &lt;a href="https://acrobat.adobe.com/us/en/acrobat/mobile/scanner-app.html"&gt;Adobe Scan&lt;/a&gt;).&lt;/p&gt;
&lt;p&gt;Due dates will be set to correspond to the beginning of a class period (i.e., 9:05am).&lt;/p&gt;




&lt;h4 id="problem-set-late-policy"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/syllabus/#problem-set-late-policy"&gt;Problem set late policy&lt;/a&gt;
&lt;/h4&gt;
&lt;p&gt;The following percentages will be deducted from your score&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;one day late: 5%&lt;/li&gt;
&lt;li&gt;two days late: 10%&lt;/li&gt;
&lt;li&gt;three days late: 25%&lt;/li&gt;
&lt;li&gt;more than 72 hours late: no credit&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;If extenuating circumstances arise such that you will be unable to complete the homework on time, please contact Ian &lt;em&gt;before&lt;/em&gt; the homework deadline and we can most likely arrange an extension.&lt;/p&gt;




&lt;h4 id="problem-set-collaboration-policy"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/syllabus/#problem-set-collaboration-policy"&gt;Problem set collaboration policy&lt;/a&gt;
&lt;/h4&gt;
&lt;p&gt;You are welcome to collaborate and work through the problem sets. However, each student must complete the final write up on their own, i.e., no problem set should be duplicated verbatim between students.&lt;/p&gt;
&lt;p&gt;It should go without saying that students may not collaborate on exams.&lt;/p&gt;




&lt;h3 id="paper-presentation"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/syllabus/#paper-presentation"&gt;Paper presentation&lt;/a&gt;
&lt;/h3&gt;
&lt;p&gt;Each student will give one 15 minute presentation with 5 minutes of questions (equivalent to journal club talk) on an article from the last 1-2 years. Please select an article submitted to the Astrophysical Journal, Astronomical Journal, MNRAS, or A&amp;amp;A that has &lt;em&gt;completed the referee process&lt;/em&gt; (i.e., is available from the journal itself, or is labeled &amp;ldquo;accepted&amp;rdquo; on the arXiv). Chosen articles should fall under the subjects that have been covered by the intervening topics since the previous student talk. To select a paper, you can search for papers in &lt;a href="https://ui.adsabs.harvard.edu/#"&gt;ADS&lt;/a&gt; based in terms (or combinations of them) appearing in titles or abstracts (e.g., extinction, molecular cloud).&lt;/p&gt;
&lt;p&gt;Each student should provide the instructor with the two dates that they prefer by Monday, August 30th. The instructor will then assign one date to each student. You should notify the instructor of your paper selection at least one week prior to your presentation date to confirm that it is an acceptable choice.&lt;/p&gt;
&lt;p&gt;Good talks will&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Provide a cogent introduction to the subfield of the paper&lt;/li&gt;
&lt;li&gt;Introduce the key supporting background material, including any relevant equations or seminal figures from related works&lt;/li&gt;
&lt;li&gt;Explain the key observational or theoretical instruments/methodologies (as applicable)&lt;/li&gt;
&lt;li&gt;Discuss the scientific findings of the paper, and their implications for the broader astrophysical subfield&lt;/li&gt;
&lt;li&gt;Finish the presentation within the allotted time (15 +/- 2 minutes)&lt;/li&gt;
&lt;li&gt;Adequately answer questions raised by the instructor and students&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;It is a wise idea to practice your presentation from start to finish &amp;ldquo;live&amp;rdquo; to make sure your timing is correct. Students whose talks are wildly under/over time will find it difficult to achieve full credit.&lt;/p&gt;




&lt;h3 id="midterm-exams"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/syllabus/#midterm-exams"&gt;Midterm Exams&lt;/a&gt;
&lt;/h3&gt;
&lt;p&gt;There will be three midterm exams throughout the semester. Each exam is designed to assess your understanding of the topics covered in the previous ~4 weeks of lectures. Each exam will be scheduled during a class period (see the &lt;a href="https://iancze.github.io/courses/astro542/schedule/"&gt;Course Schedule&lt;/a&gt; for the precise date).&lt;/p&gt;
&lt;p&gt;Exam makeup: If you are unable to attend class on the date of an exam, please contact the instructor &lt;em&gt;before the exam date&lt;/em&gt; to schedule a makeup exam.&lt;/p&gt;
&lt;p&gt;There will be no final exam (the ALMA proposal + review plays this role).&lt;/p&gt;




&lt;h3 id="alma-mock-tac-and-proposal"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/syllabus/#alma-mock-tac-and-proposal"&gt;ALMA mock TAC and proposal&lt;/a&gt;
&lt;/h3&gt;
&lt;p&gt;Throughout the course, students will learn general telescope and grant proposal writing strategies, as well as strategies specific to the ALMA observatory. We will spend a few class periods describing the capabilities of the ALMA observatory, exploring how to write an ALMA proposal, and engaging in a mock TAC panel discussion using previously submitted (successful and unsuccessful) ALMA proposals.&lt;/p&gt;




&lt;h4 id="tac-reviews"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/syllabus/#tac-reviews"&gt;TAC reviews&lt;/a&gt;
&lt;/h4&gt;
&lt;p&gt;Our class will simulate the dual-anonymous peer review process in a distributed fashion. In the actual ALMA review, for every proposal that the P.I. submits, they are sent (electronically) 10 proposals to review. Our review process will work as follows:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;You will receive approximately 5 proposals related to course topics we have covered.&lt;/li&gt;
&lt;li&gt;You are responsible for providing a written report describing (at minimum) the strengths and weaknesses of each proposal, at least two paragraphs in length. The ALMA guidelines for reviewers are available &lt;a href="https://almascience.eso.org/proposing/alma-proposal-review/guidelines-for-reviewers"&gt;here&lt;/a&gt;. We also recommend reviewing the &lt;a href="https://jwst-docs.stsci.edu/jwst-opportunities-and-policies/james-webb-space-telescope-science-policies-group-and-review-information/panel-reports"&gt;JWST cycle 1&lt;/a&gt; review criteria, which are similar and provide another data point for &amp;ldquo;what makes a good review.&amp;rdquo;&lt;/li&gt;
&lt;li&gt;You are responsible for evaluating the scientific merit of the proposal with a numerical score between 1 and 5 (1 being excellent, 5 being unsatisfactory).&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;The TAC reviews will be worth 5% of your final grade.&lt;/p&gt;




&lt;h4 id="alma-proposal"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/syllabus/#alma-proposal"&gt;ALMA proposal&lt;/a&gt;
&lt;/h4&gt;
&lt;p&gt;As a final course project, students will be responsible for preparing their own ALMA observing proposal using ALMA Cycle 8 capabilities and materials as a baseline (worth 20% of your final grade). Proposals will be due during the final examination period.&lt;/p&gt;
&lt;p&gt;An overview of the ALMA proposal process is described in the &lt;a href="https://almascience.nrao.edu/proposing/proposers-guide"&gt;ALMA Cycle 8 Proposer&amp;rsquo;s Guide&lt;/a&gt;. If you have any questions about preparing an ALMA proposal, answers can most likely be found in this document.&lt;/p&gt;
&lt;p&gt;The following are some sections of the Proposer&amp;rsquo;s Guide that we recommend reading to understand this course project&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;&lt;a href="https://almascience.nrao.edu/proposing/proposers-guide#section-4"&gt;1.2.1 Dual anonymous review&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="https://almascience.nrao.edu/proposing/proposers-guide#section-5"&gt;1.2.2 Distributed peer review&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="https://almascience.nrao.edu/proposing/proposers-guide#section-22"&gt;4. Proposal Planning&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="https://almascience.nrao.edu/proposing/proposers-guide#section-40"&gt;5. Proposal Preparation and submission&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;The assignment will consist of preparing a Scientific Justification (&lt;a href="https://almascience.nrao.edu/proposing/proposers-guide#section-43"&gt;Section 5.3&lt;/a&gt;) PDF &lt;em&gt;only&lt;/em&gt;. You may need to install the observing tool (OT) for help with your resolution/sensitivity calculations, but your submitted assignment does not need a cover sheet, technical justification, or abstract.&lt;/p&gt;
&lt;p&gt;We will grade your proposal following the spirit of the actual ALMA review critera. Specifically, your Scientific Justification should explicitly include:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;a title&lt;/li&gt;
&lt;li&gt;a discussion of the overall scientific merit of the proposed investigations and their potential contribution to the advancement of scientific knowledge&lt;/li&gt;
&lt;li&gt;a clear description of the proposed observations. I.e.,
&lt;ul&gt;
&lt;li&gt;what targets will be observed&lt;/li&gt;
&lt;li&gt;at what angular resolution&lt;/li&gt;
&lt;li&gt;at what frequency (including continuum/spectral line)&lt;/li&gt;
&lt;li&gt;to what sensitivity (Jy for point sources, and Jy/beam or Jy/arcsec² for resolved sources)&lt;/li&gt;
&lt;/ul&gt;
&lt;/li&gt;
&lt;li&gt;a robust data analysis plan&lt;/li&gt;
&lt;li&gt;a discussion of why the capabilities of ALMA are required to carry out this science&lt;/li&gt;
&lt;li&gt;figures and tables supporting the proposal, as necessary&lt;/li&gt;
&lt;li&gt;references&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;Your Scientific Justification should follow all rules for page limits and fonts, especially the requirements for 4 single-spaced pages (A4 or US letter size) in 12pt font. We suggest that you allow two pages for the science case and two pages for figures, tables, and references; however, you may allocate space as best you see fit.&lt;/p&gt;
&lt;p&gt;Your Scientific Justification should follow all guidelines for &lt;a href="https://almascience.nrao.edu/proposing/alma-proposal-review/dual-anonymous"&gt;dual-anonymous peer review&lt;/a&gt; such that your PDF &lt;em&gt;does not contain&lt;/em&gt; any information that could be easily used to identify the proposer.&lt;/p&gt;




&lt;h2 id="course-calendar-and-closure-policies"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/syllabus/#course-calendar-and-closure-policies"&gt;Course Calendar and Closure Policies&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;For full information, see the &lt;a href="https://iancze.github.io/courses/astro542/schedule/"&gt;Course Schedule&lt;/a&gt;.&lt;/p&gt;
&lt;p&gt;If campus should be closed (e.g. for a weather-related event or COVID precautions), the instructor will provide instructions via email on course lecture format (possibly remote, keeping the same schedule) and examinations/assignments (due dates to be rescheduled &lt;em&gt;no earlier than 48 hours&lt;/em&gt; after closure announcement).&lt;/p&gt;




&lt;h2 id="academic-integrity"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/syllabus/#academic-integrity"&gt;Academic Integrity&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;Academic integrity is the pursuit of scholarly activity in an open, honest and responsible manner. Academic integrity is a basic guiding principle for all academic activity at The Pennsylvania State University, and all members of the University community are expected to act in accordance with this principle. Consistent with this expectation, the University’s Code of Conduct states that all students should act with personal integrity, respect other students’ dignity, rights and property, and help create and maintain an environment in which all can succeed through the fruits of their efforts.&lt;/p&gt;
&lt;p&gt;Academic integrity includes a commitment by all members of the University community not to engage in or tolerate acts of falsification, misrepresentation or deception. Such acts of dishonesty violate the fundamental ethical principles of the University community and compromise the worth of work completed by others.&lt;/p&gt;




&lt;h2 id="disability-accommodation"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/syllabus/#disability-accommodation"&gt;Disability Accommodation&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;Penn State welcomes students with disabilities into the University’s educational programs. Student Disability Resources (SDR) website provides contact information for every Penn State campus (Links to an external site.). For further information, please visit &lt;a href="http://equity.psu.edu/student-disability-resources/"&gt;Student Disability Resources website&lt;/a&gt;.&lt;/p&gt;
&lt;p&gt;In order to receive consideration for reasonable accommodations, you must contact the appropriate disability services office at the campus where you are officially enrolled, participate in an intake interview, and provide documentation: See &lt;a href="http://equity.psu.edu/student-disability-resources/applying-for-services/documentation-guidelines"&gt;documentation guidelines&lt;/a&gt;. If the documentation supports your request for reasonable accommodations, your campus disability services office will provide you with an accommodation letter. Please share this letter with your instructors and discuss the accommodations with them as early as possible. You must follow this process for every semester that you request accommodations.&lt;/p&gt;




&lt;h2 id="counseling-and-psychological-services"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/syllabus/#counseling-and-psychological-services"&gt;Counseling and Psychological Services&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;Many students at Penn State face personal challenges or have psychological needs that may interfere with their academic progress, social development, or emotional wellbeing. The university offers a variety of confidential services to help you through difficult times, including individual and group counseling, crisis intervention, consultations, online chats, and mental health screenings. These services are provided by staff who welcome all students and embrace a philosophy respectful of clients’ cultural and religious backgrounds, and sensitive to differences in race, ability, gender identity and sexual orientation.&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Counseling and Psychological Services at University Park &lt;a href="http://studentaffairs.psu.edu/counseling/"&gt;CAPS&lt;/a&gt;: 814-863-0395&lt;/li&gt;
&lt;li&gt;&lt;a href="https://senate.psu.edu/faculty/counseling-services-at-commonwealth-campuses/"&gt;Counseling and Psychological Services at Commonwealth Campuses&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;Penn State Crisis Line (24 hours/7 days/week): 877-229-6400&lt;/li&gt;
&lt;li&gt;Crisis Text Line (24 hours/7 days/week): Text LIONS to 741741&lt;/li&gt;
&lt;/ul&gt;




&lt;h2 id="educational-equity-and-reporting-bias"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/syllabus/#educational-equity-and-reporting-bias"&gt;Educational Equity and Reporting Bias&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;Penn State takes great pride to foster a diverse and inclusive environment for students, faculty, and staff. Acts of intolerance, discrimination, or harassment due to age, ancestry, color, disability, gender, gender identity, national origin, race, religious belief, sexual orientation, or veteran status are not tolerated and can be reported through Educational Equity via the &lt;a href="http://equity.psu.edu/reportbias/"&gt;Report Bias webpage&lt;/a&gt;.&lt;/p&gt;
&lt;p&gt;&lt;em&gt;Whom should I contact if I need additional assistance?&lt;/em&gt;
I encourage you to be in contact with your academic adviser for specific needs you might have outside this course. Academic adviser information and scheduling can be found at &lt;a href="https://sites.psu.edu/starfishinfo/"&gt;https://sites.psu.edu/starfishinfo/&lt;/a&gt;. There are also additional resources available at &lt;a href="https://keeplearning.psu.edu/student-support/"&gt;https://keeplearning.psu.edu/student-support/&lt;/a&gt;&lt;/p&gt;




&lt;h3 id="counseling--psychological-services-caps"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/syllabus/#counseling--psychological-services-caps"&gt;Counseling &amp;amp; Psychological Services (CAPS)&lt;/a&gt;
&lt;/h3&gt;
&lt;ul&gt;
&lt;li&gt;&lt;a href="https://studentaffairs.psu.edu/caps-contact-form"&gt;https://studentaffairs.psu.edu/caps-contact-form&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;CAPS Phone: (814) 863-0395&lt;/li&gt;
&lt;li&gt;Penn State Crisis Line 1-877-229-6400&lt;/li&gt;
&lt;li&gt;Student Care and Advocacy: Email: &lt;a href="mailto:StudentCare@psu.edu"&gt;StudentCare@psu.edu&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;Share a Concern: &lt;a href="https://cm.maxient.com/reportingform.php?PennState&amp;amp;layout_id=14"&gt;Share a Concern Form&lt;/a&gt; Phone: 814-863-2020 (voicemail)&lt;/li&gt;
&lt;/ul&gt;




&lt;h2 id="code-of-mutual-respect-and-cooperation"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/syllabus/#code-of-mutual-respect-and-cooperation"&gt;Code of Mutual Respect and Cooperation&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;The Eberly College of Science (ECoS) &lt;a href="https://science.psu.edu/climate-and-diversity/code-mutual-respect-and-cooperation"&gt;Code of Mutual Respect and Cooperation&lt;/a&gt; embodies the values that we hope
faculty, staff, and students possess and will endorse to make ECoS a place where every individual feels respected and valued, as well as challenged and rewarded. Please review these principles, linked &lt;a href="https://science.psu.edu/climate-and-diversity/code-mutual-respect-and-cooperation"&gt;here&lt;/a&gt;.&lt;/p&gt;</description></item><item><title>Class Schedule: Martinmas (Fall 2023)</title><link>https://iancze.github.io/courses/as5003/schedule/</link><pubDate>Fri, 25 Aug 2023 00:00:00 +0000</pubDate><guid>https://iancze.github.io/courses/as5003/schedule/</guid><description>&lt;p&gt;AS5003 meets three times a week. See the Moodle for class times and meeting locations.&lt;/p&gt;
&lt;p&gt;The following is the anticipated schedule for course lectures and activities. Lecture topics/dates may shift slightly depending on course progress.&lt;/p&gt;
&lt;table&gt;
 &lt;thead&gt;
 &lt;tr&gt;
 &lt;th&gt;Week&lt;/th&gt;
 &lt;th&gt;Date&lt;/th&gt;
 &lt;th&gt;Activity&lt;/th&gt;
 &lt;th&gt;Topic&lt;/th&gt;
 &lt;/tr&gt;
 &lt;/thead&gt;
 &lt;tbody&gt;
 &lt;tr&gt;
 &lt;td&gt;1&lt;/td&gt;
 &lt;td&gt;T 12 Sep&lt;/td&gt;
 &lt;td&gt;Lecture&lt;/td&gt;
 &lt;td&gt;

 
 

 
 
 

 
 Intro and Overview
 &lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr&gt;
 &lt;td&gt;1&lt;/td&gt;
 &lt;td&gt;W 13 Sep&lt;/td&gt;
 &lt;td&gt;Lecture&lt;/td&gt;
 &lt;td&gt;

 
 

 
 
 

 
 The Fourier Transform: Analytical
 &lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr&gt;
 &lt;td&gt;1&lt;/td&gt;
 &lt;td&gt;F 15 Sep&lt;/td&gt;
 &lt;td&gt;Lecture&lt;/td&gt;
 &lt;td&gt;

 
 

 
 
 

 
 The Fourier Transform: Numerical
 &lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr&gt;
 &lt;td&gt;2&lt;/td&gt;
 &lt;td&gt;T 19 Sep&lt;/td&gt;
 &lt;td&gt;Tutorial&lt;/td&gt;
 &lt;td&gt;The Fourier Transform and Digital Signal Processing&lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr&gt;
 &lt;td&gt;2&lt;/td&gt;
 &lt;td&gt;W 20 Sep&lt;/td&gt;
 &lt;td&gt;Lecture&lt;/td&gt;
 &lt;td&gt;

 
 

 
 
 

 
 Interferometry in Practice
 &lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr&gt;
 &lt;td&gt;2&lt;/td&gt;
 &lt;td&gt;F 22 Sep&lt;/td&gt;
 &lt;td&gt;Lecture&lt;/td&gt;
 &lt;td&gt;

 
 

 
 
 

 
 Making Images: PSFs, Gridding, and Dirty Images
 &lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr&gt;
 &lt;td&gt;3&lt;/td&gt;
 &lt;td&gt;T 26 Sep&lt;/td&gt;
 &lt;td&gt;Lecture&lt;/td&gt;
 &lt;td&gt;

 
 

 
 
 

 
 Image Plane Deconvolution (CLEAN)
 &lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr&gt;
 &lt;td&gt;3&lt;/td&gt;
 &lt;td&gt;W 27 Sep&lt;/td&gt;
 &lt;td&gt;Lecture&lt;/td&gt;
 &lt;td&gt;

 
 

 
 
 

 
 Regularized Maximum Likelihood (RML)
 &lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr&gt;
 &lt;td&gt;3&lt;/td&gt;
 &lt;td&gt;F 29 Sep&lt;/td&gt;
 &lt;td&gt;Tutorial&lt;/td&gt;
 &lt;td&gt;Imaging in Practice, Concept Review&lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr&gt;
 &lt;td&gt;4&lt;/td&gt;
 &lt;td&gt;T 3 Oct&lt;/td&gt;
 &lt;td&gt;Peer Study&lt;/td&gt;
 &lt;td&gt;&lt;/td&gt;
 &lt;/tr&gt;
 &lt;/tbody&gt;
&lt;/table&gt;</description></item><item><title>Fall 2022 Class Schedule</title><link>https://iancze.github.io/courses/astro589/schedule/</link><pubDate>Tue, 21 Jun 2022 00:00:00 +0000</pubDate><guid>https://iancze.github.io/courses/astro589/schedule/</guid><description>&lt;p&gt;ASTRO 589 meets &lt;strong&gt;once a week on Wednesdays from 10:10am to 11:00am ET (prompt) in Davey Lab Room 538&lt;/strong&gt;.&lt;/p&gt;
&lt;p&gt;Please be aware that we will not be meeting the week of the Thanksgiving Holiday (Nov 21 - 25), following PSU&amp;rsquo;s &lt;a href="https://www.registrar.psu.edu/academic-calendars/2022-23.cfm"&gt;Fall 2022 academic calendar&lt;/a&gt;.&lt;/p&gt;
&lt;p&gt;The following is the anticipated schedule for course lectures and activities. Lecture topics/dates may shift slightly depending on course progress.&lt;/p&gt;
&lt;table&gt;
 &lt;thead&gt;
 &lt;tr&gt;
 &lt;th&gt;Date&lt;/th&gt;
 &lt;th&gt;Activity&lt;/th&gt;
 &lt;th&gt;Topic&lt;/th&gt;
 &lt;/tr&gt;
 &lt;/thead&gt;
 &lt;tbody&gt;
 &lt;tr&gt;
 &lt;td&gt;Aug 24&lt;/td&gt;
 &lt;td&gt;Lecture&lt;/td&gt;
 &lt;td&gt;

 
 

 
 
 

 
 Intro and Overview
 &lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr&gt;
 &lt;td&gt;Aug 31&lt;/td&gt;
 &lt;td&gt;Lecture&lt;/td&gt;
 &lt;td&gt;

 
 

 
 
 

 
 The Fourier Transform I
 &lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr&gt;
 &lt;td&gt;Sep 7&lt;/td&gt;
 &lt;td&gt;Lecture&lt;/td&gt;
 &lt;td&gt;

 
 

 
 
 

 
 The Fourier Transform II
 &lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr&gt;
 &lt;td&gt;Sep 14&lt;/td&gt;
 &lt;td&gt;Lecture&lt;/td&gt;
 &lt;td&gt;

 
 

 
 
 

 
 The Fast Fourier Transform and Numerical Implementation
 &lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr&gt;
 &lt;td&gt;Sep 21&lt;/td&gt;
 &lt;td&gt;Lecture&lt;/td&gt;
 &lt;td&gt;

 
 

 
 
 

 
 Interferometry in Practice
 &lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr&gt;
 &lt;td&gt;Sep 28&lt;/td&gt;
 &lt;td&gt;Lecture&lt;/td&gt;
 &lt;td&gt;

 
 

 
 
 

 
 Making Images: PSFs, Gridding, and Dirty Images
 &lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr&gt;
 &lt;td&gt;Oct 5&lt;/td&gt;
 &lt;td&gt;Lecture&lt;/td&gt;
 &lt;td&gt;

 
 

 
 
 

 
 Bayesian Inference and Model Fitting
 &lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr&gt;
 &lt;td&gt;Oct 12&lt;/td&gt;
 &lt;td&gt;Lecture&lt;/td&gt;
 &lt;td&gt;

 
 

 
 
 

 
 Image Plane Deconvolution (CLEAN)
 &lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr&gt;
 &lt;td&gt;Oct 19&lt;/td&gt;
 &lt;td&gt;Lecture&lt;/td&gt;
 &lt;td&gt;

 
 

 
 
 

 
 Regularized Maximum Likelihood (RML) I
 &lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr&gt;
 &lt;td&gt;Oct 26&lt;/td&gt;
 &lt;td&gt;Lecture&lt;/td&gt;
 &lt;td&gt;

 
 

 
 
 

 
 Regularized Maximum Likelihood (RML) II
 &lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr&gt;
 &lt;td&gt;Nov 2&lt;/td&gt;
 &lt;td&gt;Project presentation&lt;/td&gt;
 &lt;td&gt;TBD&lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr&gt;
 &lt;td&gt;Nov 9&lt;/td&gt;
 &lt;td&gt;Project presentation&lt;/td&gt;
 &lt;td&gt;TBD&lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr&gt;
 &lt;td&gt;Nov 16&lt;/td&gt;
 &lt;td&gt;Project presentation&lt;/td&gt;
 &lt;td&gt;TBD&lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr&gt;
 &lt;td&gt;Nov 30&lt;/td&gt;
 &lt;td&gt;Project presentation&lt;/td&gt;
 &lt;td&gt;TBD&lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr&gt;
 &lt;td&gt;Dec 7&lt;/td&gt;
 &lt;td&gt;Lecture&lt;/td&gt;
 &lt;td&gt;

 
 

 
 
 

 
 Correlation in ALMA data
 &lt;/td&gt;
 &lt;/tr&gt;
 &lt;/tbody&gt;
&lt;/table&gt;</description></item><item><title>Fall 2021 Class Schedule</title><link>https://iancze.github.io/courses/astro542/schedule/</link><pubDate>Mon, 21 Jun 2021 00:00:00 +0000</pubDate><guid>https://iancze.github.io/courses/astro542/schedule/</guid><description>&lt;p&gt;ASTRO 542 meets three times a week (9:05am to 9:55am) on Monday, Wednesday, and Friday in a &lt;em&gt;remote synchronous&lt;/em&gt; format over Zoom. The Zoom link is provided to enrolled students via email for security purposes. If you would like to audit a course, please email the instructor.&lt;/p&gt;
&lt;p&gt;Please be aware that we will not be meeting on Labor Day (Sep 6) or over Thanksgiving Holiday (Nov 22 - 26), following PSU&amp;rsquo;s &lt;a href="https://www.registrar.psu.edu/academic-calendars/2021-22.cfm"&gt;Fall 2021 academic calendar&lt;/a&gt;.&lt;/p&gt;
&lt;p&gt;The following is the anticipated schedule for course lectures and activities. Lecture topics/dates may shift slightly depending on course progress. Exam and presentation dates will remain fixed.&lt;/p&gt;
&lt;table&gt;
 &lt;thead&gt;
 &lt;tr&gt;
 &lt;th&gt;Date&lt;/th&gt;
 &lt;th&gt;Activity&lt;/th&gt;
 &lt;th&gt;Topic&lt;/th&gt;
 &lt;/tr&gt;
 &lt;/thead&gt;
 &lt;tbody&gt;
 &lt;tr&gt;
 &lt;td&gt;Mon Aug 23&lt;/td&gt;
 &lt;td&gt;Lecture&lt;/td&gt;
 &lt;td&gt;

 
 

 
 
 

 
 &lt;a href=https://iancze.github.io/courses/astro542/lectures/01-introduction-and-overview/&gt;Introduction and Course Overview&lt;/a&gt;
 &lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr&gt;
 &lt;td&gt;Wed Aug 25&lt;/td&gt;
 &lt;td&gt;Lecture&lt;/td&gt;
 &lt;td&gt;

 
 

 
 
 

 
 &lt;a href=https://iancze.github.io/courses/astro542/lectures/02-collisional-processes/&gt;Collisional Processes&lt;/a&gt;
 &lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr&gt;
 &lt;td&gt;Fri Aug 27&lt;/td&gt;
 &lt;td&gt;Lecture&lt;/td&gt;
 &lt;td&gt;

 
 

 
 
 

 
 &lt;a href=https://iancze.github.io/courses/astro542/lectures/03-stat-mech/&gt;Statistical Mechanics and Thermodynamic Equilibrium&lt;/a&gt;
 &lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr&gt;
 &lt;td&gt;Mon Aug 30&lt;/td&gt;
 &lt;td&gt;Lecture&lt;/td&gt;
 &lt;td&gt;

 
 

 
 
 

 
 &lt;a href=https://iancze.github.io/courses/astro542/lectures/04-energy-levels-atoms/&gt;Energy Levels of Atoms, Ions, and Molecules&lt;/a&gt;
 &lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr&gt;
 &lt;td&gt;Wed Sep 1&lt;/td&gt;
 &lt;td&gt;Lecture&lt;/td&gt;
 &lt;td&gt;

 
 

 
 
 

 
 &lt;a href=https://iancze.github.io/courses/astro542/lectures/05-emission-and-absorption/&gt;Spontaneous Emission, Stimulated Emission, and Absorption&lt;/a&gt;
 &lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr&gt;
 &lt;td&gt;Fri Sep 3&lt;/td&gt;
 &lt;td&gt;Lecture&lt;/td&gt;
 &lt;td&gt;

 
 

 
 
 

 
 &lt;a href=https://iancze.github.io/courses/astro542/lectures/06-radiative-transfer/&gt;Radiative Transfer&lt;/a&gt;
 &lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr&gt;
 &lt;td&gt;Wed Sep 8&lt;/td&gt;
 &lt;td&gt;Lecture&lt;/td&gt;
 &lt;td&gt;

 
 

 
 
 

 
 &lt;a href=https://iancze.github.io/courses/astro542/lectures/07-hi-21cm/&gt;H I 21-cm Emission and Absorption&lt;/a&gt;
 &lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr&gt;
 &lt;td&gt;Fri Sep 10&lt;/td&gt;
 &lt;td&gt;Lecture&lt;/td&gt;
 &lt;td&gt;

 
 

 
 
 

 
 &lt;a href=https://iancze.github.io/courses/astro542/lectures/08-absorption-lines/&gt;Absorption Lines and Curve of Growth&lt;/a&gt;
 &lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr&gt;
 &lt;td&gt;Mon Sep 13&lt;/td&gt;
 &lt;td&gt;Lecture&lt;/td&gt;
 &lt;td&gt;

 
 

 
 
 

 
 &lt;a href=https://iancze.github.io/courses/astro542/lectures/09-emission-absorption-plasma/&gt;Emission and Absorption by a Thermal Plasma&lt;/a&gt;
 &lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr&gt;
 &lt;td&gt;Wed Sep 15&lt;/td&gt;
 &lt;td&gt;Lecture&lt;/td&gt;
 &lt;td&gt;

 
 

 
 
 

 
 &lt;a href=https://iancze.github.io/courses/astro542/lectures/10-wave-propagation/&gt;Propagation of Radio Waves through the ISM&lt;/a&gt;
 &lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr&gt;
 &lt;td&gt;Fri Sep 17&lt;/td&gt;
 &lt;td&gt;Presentation by Laura Duffy (#1)&lt;/td&gt;
 &lt;td&gt;&lt;a href="https://ui.adsabs.harvard.edu/abs/2019MNRAS.485L..93R/abstract"&gt;H I 21 cm mapping of the host galaxy of AT2018cow: a fast-evolving luminous transient within a ring of high column density gas&lt;/a&gt;&lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr&gt;
 &lt;td&gt;Mon Sep 20&lt;/td&gt;
 &lt;td&gt;Lecture&lt;/td&gt;
 &lt;td&gt;

 
 

 
 
 

 
 &lt;a href=https://iancze.github.io/courses/astro542/lectures/11-intro-radio-astronomy/&gt;Introduction to Radio Astronomy and ALMA&lt;/a&gt;
 &lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr&gt;
 &lt;td&gt;Wed Sep 22&lt;/td&gt;
 &lt;td&gt;&lt;strong&gt;Exam 1&lt;/strong&gt;&lt;/td&gt;
 &lt;td&gt;Lectures up to Sep 13 (incl)&lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr&gt;
 &lt;td&gt;Fri Sep 24&lt;/td&gt;
 &lt;td&gt;Lecture&lt;/td&gt;
 &lt;td&gt;

 
 

 
 
 

 
 &lt;a href=https://iancze.github.io/courses/astro542/lectures/12-ionization-processes/&gt;Interstellar Radiation Fields, Ionization Processes, and Recombination of Ions with Electrons&lt;/a&gt;
 &lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr&gt;
 &lt;td&gt;Mon Sep 27&lt;/td&gt;
 &lt;td&gt;Lecture&lt;/td&gt;
 &lt;td&gt;

 
 

 
 
 

 
 &lt;a href=https://iancze.github.io/courses/astro542/lectures/13-photoionized-gas-hii/&gt;Photoionized Gas (HII regions)&lt;/a&gt;
 &lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr&gt;
 &lt;td&gt;Wed Sep 29&lt;/td&gt;
 &lt;td&gt;Lecture&lt;/td&gt;
 &lt;td&gt;

 
 

 
 
 

 
 &lt;a href=https://iancze.github.io/courses/astro542/lectures/14-ionization-neutral-regions/&gt;Ionization in Predominantly Neutral Regions (cool &amp;#43; warm HI regions)&lt;/a&gt;
 &lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr&gt;
 &lt;td&gt;Fri Oct 1&lt;/td&gt;
 &lt;td&gt;Lecture&lt;/td&gt;
 &lt;td&gt;

 
 

 
 
 

 
 &lt;a href=https://iancze.github.io/courses/astro542/lectures/15-collisional-excitation/&gt;Collisional Excitation&lt;/a&gt;
 &lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr&gt;
 &lt;td&gt;Mon Oct 4&lt;/td&gt;
 &lt;td&gt;Lecture&lt;/td&gt;
 &lt;td&gt;

 
 

 
 
 

 
 &lt;a href=https://iancze.github.io/courses/astro542/lectures/16-nebular-diagnostics/&gt;Nebular Diagnostics&lt;/a&gt;
 &lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr&gt;
 &lt;td&gt;Wed Oct 6&lt;/td&gt;
 &lt;td&gt;Lecture&lt;/td&gt;
 &lt;td&gt;

 
 

 
 
 

 
 &lt;a href=https://iancze.github.io/courses/astro542/lectures/17-radiative-trapping/&gt;Radiative Trapping&lt;/a&gt;
 &lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr&gt;
 &lt;td&gt;Fri Oct 8&lt;/td&gt;
 &lt;td&gt;Lecture&lt;/td&gt;
 &lt;td&gt;

 
 

 
 
 

 
 &lt;a href=https://iancze.github.io/courses/astro542/lectures/18-dust/&gt;Interstellar Dust&lt;/a&gt;
 &lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr&gt;
 &lt;td&gt;Mon Oct 11&lt;/td&gt;
 &lt;td&gt;Lecture&lt;/td&gt;
 &lt;td&gt;

 
 

 
 
 

 
 &lt;a href=https://iancze.github.io/courses/astro542/lectures/19-scattering-absorption/&gt;Scattering and Absorption by Small Particles&lt;/a&gt;
 &lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr&gt;
 &lt;td&gt;Wed Oct 13&lt;/td&gt;
 &lt;td&gt;Presentation by Malinda Baer (#2)&lt;/td&gt;
 &lt;td&gt;&lt;a href="https://psu.mediaspace.kaltura.com/media/Astro+542A+Malinda+Baer+Presentation/1_jlz3to67"&gt;Tracing PAH Size in Prominent Nearby Mid-Infrared Environments&lt;/a&gt;&lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr&gt;
 &lt;td&gt;Fri Oct 15&lt;/td&gt;
 &lt;td&gt;Lecture&lt;/td&gt;
 &lt;td&gt;

 
 

 
 
 

 
 &lt;a href=https://iancze.github.io/courses/astro542/lectures/20-telescope-proposal-tac/&gt;Telescope Proposals and Time Allocation Committees&lt;/a&gt;
 &lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr&gt;
 &lt;td&gt;Mon Oct 18&lt;/td&gt;
 &lt;td&gt;Lecture&lt;/td&gt;
 &lt;td&gt;

 
 

 
 
 

 
 Dust Composition
 &lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr&gt;
 &lt;td&gt;Wed Oct 20&lt;/td&gt;
 &lt;td&gt;&lt;strong&gt;Exam 2&lt;/strong&gt;&lt;/td&gt;
 &lt;td&gt;Lectures up to Oct 11 (inclusive)&lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr&gt;
 &lt;td&gt;Mon Oct 25&lt;/td&gt;
 &lt;td&gt;Lecture&lt;/td&gt;
 &lt;td&gt;

 
 

 
 
 

 
 &lt;a href=https://iancze.github.io/courses/astro542/lectures/22-grain-temp-dynamics/&gt;Grain Temperatures, Physics, and Dynamics&lt;/a&gt;
 &lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr&gt;
 &lt;td&gt;Wed Oct 27&lt;/td&gt;
 &lt;td&gt;Lecture&lt;/td&gt;
 &lt;td&gt;

 
 

 
 
 

 
 &lt;a href=https://iancze.github.io/courses/astro542/lectures/23-hii-heating-cooling/&gt;Heating and Cooling of HII regions&lt;/a&gt;
 &lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr&gt;
 &lt;td&gt;Fri Oct 29&lt;/td&gt;
 &lt;td&gt;Lecture&lt;/td&gt;
 &lt;td&gt;

 
 

 
 
 

 
 &lt;a href=https://iancze.github.io/courses/astro542/lectures/24-hi-heating-cooling/&gt;HI Clouds: Observations, Heating and Cooling&lt;/a&gt;
 &lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr&gt;
 &lt;td&gt;Mon Nov 1&lt;/td&gt;
 &lt;td&gt;Lecture&lt;/td&gt;
 &lt;td&gt;

 
 

 
 
 

 
 &lt;a href=https://iancze.github.io/courses/astro542/lectures/25-molecular-hydrogen/&gt;Molecular Hydrogen&lt;/a&gt;
 &lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr&gt;
 &lt;td&gt;Wed Nov 3&lt;/td&gt;
 &lt;td&gt;Mock TAC discussion pt 1&lt;/td&gt;
 &lt;td&gt;&lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr&gt;
 &lt;td&gt;Fri Nov 5&lt;/td&gt;
 &lt;td&gt;Mock TAC discussion pt 2&lt;/td&gt;
 &lt;td&gt;&lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr&gt;
 &lt;td&gt;Mon Nov 8&lt;/td&gt;
 &lt;td&gt;Lecture&lt;/td&gt;
 &lt;td&gt;

 
 

 
 
 

 
 &lt;a href=https://iancze.github.io/courses/astro542/lectures/26-molecular-clouds/&gt;Molecular Clouds, Observations, Chemistry, and Ionization&lt;/a&gt;
 &lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr&gt;
 &lt;td&gt;Wed Nov 10&lt;/td&gt;
 &lt;td&gt;Lecture&lt;/td&gt;
 &lt;td&gt;

 
 

 
 
 

 
 &lt;a href=https://iancze.github.io/courses/astro542/lectures/27-fluid-dynamics/&gt;Fluids and Shocks&lt;/a&gt;
 &lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr&gt;
 &lt;td&gt;Fri Nov 12&lt;/td&gt;
 &lt;td&gt;Lecture&lt;/td&gt;
 &lt;td&gt;

 
 

 
 
 

 
 &lt;a href=https://iancze.github.io/courses/astro542/lectures/28-winds-sne-grb/&gt;Supernovae Remnants and the Hot Ionized Medium&lt;/a&gt;
 &lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr&gt;
 &lt;td&gt;Mon Nov 15&lt;/td&gt;
 &lt;td&gt;Lecture&lt;/td&gt;
 &lt;td&gt;

 
 

 
 
 

 
 &lt;a href=https://iancze.github.io/courses/astro542/lectures/29-star-formation-theory/&gt;Gravitational Collapse and Star Formation: Theory&lt;/a&gt;
 &lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr&gt;
 &lt;td&gt;Wed Nov 17&lt;/td&gt;
 &lt;td&gt;Lecture&lt;/td&gt;
 &lt;td&gt;

 
 

 
 
 

 
 &lt;a href=https://iancze.github.io/courses/astro542/lectures/30-star-formation-observations/&gt;Star Formation: Observations&lt;/a&gt;
 &lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr&gt;
 &lt;td&gt;Fri Nov 19&lt;/td&gt;
 &lt;td&gt;Lecture&lt;/td&gt;
 &lt;td&gt;

 
 

 
 
 

 
 Protostars to PPDs
 &lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr&gt;
 &lt;td&gt;Mon Nov 29&lt;/td&gt;
 &lt;td&gt;Lecture&lt;/td&gt;
 &lt;td&gt;

 
 

 
 
 

 
 &lt;a href=https://iancze.github.io/courses/astro542/lectures/32-circumstellar-disks/&gt;Circumstellar Disks&lt;/a&gt;
 &lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr&gt;
 &lt;td&gt;Wed Dec 1&lt;/td&gt;
 &lt;td&gt;&lt;strong&gt;Exam 3&lt;/strong&gt;&lt;/td&gt;
 &lt;td&gt;Lectures up to TBD&lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr&gt;
 &lt;td&gt;Fri Dec 3&lt;/td&gt;
 &lt;td&gt;Lecture&lt;/td&gt;
 &lt;td&gt;

 
 

 
 
 

 
 &lt;a href=https://iancze.github.io/courses/astro542/lectures/33-exoplanets-planet-formation/&gt;Exoplanets and Planet Formation&lt;/a&gt;
 &lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr&gt;
 &lt;td&gt;Mon Dec 6&lt;/td&gt;
 &lt;td&gt;Presentation by Megan Delamer (#3)&lt;/td&gt;
 &lt;td&gt;TBD&lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr&gt;
 &lt;td&gt;Wed Dec 8&lt;/td&gt;
 &lt;td&gt;Lecture&lt;/td&gt;
 &lt;td&gt;

 
 

 
 
 

 
 &lt;a href=https://iancze.github.io/courses/astro542/lectures/35-astrochemistry/&gt;Astrochemistry&lt;/a&gt;
 &lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr&gt;
 &lt;td&gt;Fri Dec 10&lt;/td&gt;
 &lt;td&gt;Workshop&lt;/td&gt;
 &lt;td&gt;ALMA proposals&lt;/td&gt;
 &lt;/tr&gt;
 &lt;/tbody&gt;
&lt;/table&gt;</description></item><item><title>Regularized Maximum Likelihood (RML)</title><link>https://iancze.github.io/courses/as5003/lectures/08-rml/</link><pubDate>Wed, 27 Sep 2023 00:00:00 +0000</pubDate><guid>https://iancze.github.io/courses/as5003/lectures/08-rml/</guid><description>



&lt;h2 id="references-for-today"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/as5003/lectures/08-rml/#references-for-today"&gt;References for today&lt;/a&gt;
&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;&lt;a href="https://mpol-dev.github.io/MPoL/rml_intro.html"&gt;MPoL introduction&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="https://catalog.libraries.psu.edu/catalog/19551280"&gt;Data Analysis: a Bayesian Tutorial&lt;/a&gt; by Sivia and Skilling&lt;/li&gt;
&lt;li&gt;&lt;a href="https://ui.adsabs.harvard.edu/abs/2010arXiv1008.4686H/abstract"&gt;Data Analysis Recipes: Fitting a Model to Data&lt;/a&gt;, by Hogg et al.&lt;/li&gt;
&lt;li&gt;&lt;a href="https://ui.adsabs.harvard.edu/abs/2012arXiv1205.4446H/abstract"&gt;Data analysis recipes: Probability calculus for inference&lt;/a&gt; by Hogg&lt;/li&gt;
&lt;li&gt;&lt;a href="https://catalog.libraries.psu.edu/catalog/19499523"&gt;Machine Learning: A Probabilistic Perspective&lt;/a&gt; by Murphy, Chapter 10&lt;/li&gt;
&lt;li&gt;&lt;a href="https://catalog.libraries.psu.edu/catalog/3405468"&gt;Pattern Recognition and Machine Learning&lt;/a&gt; by Bishop, Chapter 8&lt;/li&gt;
&lt;li&gt;&lt;a href="https://catalog.libraries.psu.edu/catalog/32525864"&gt;Probabilistic Graphical Models&lt;/a&gt; by Sucar, especially Chapters 7, 8&lt;/li&gt;
&lt;/ul&gt;




&lt;h2 id="last-time"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/as5003/lectures/08-rml/#last-time"&gt;Last time&lt;/a&gt;
&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;general interferometer with north-south and east-west baselines&lt;/li&gt;
&lt;li&gt;arrays with multiple antennas and Earth aperture synthesis&lt;/li&gt;
&lt;li&gt;point spread functions (PSFs) and their relationship to array configuration&lt;/li&gt;
&lt;/ul&gt;




&lt;h2 id="todays-lecture"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/as5003/lectures/08-rml/#todays-lecture"&gt;Today&amp;rsquo;s lecture&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;Now that we&amp;rsquo;ve covered how interferometers work to observe a source and the Fourier transform theory behind that, we&amp;rsquo;re going to focus on the data products (called &amp;ldquo;visibilities&amp;rdquo;) and Bayesian inference techniques for analyzing the data in its natural space. In subsequent lectures we will talk about how we might use the visibilities and Fourier inversion techniques to synthesize images of the source, but this lecture occupies an important intermediate (and foundational) step, where we are treating the visibilities as the &amp;ldquo;raw&amp;rdquo; data product and thinking about how we bring our analysis techniques into that space.&lt;/p&gt;
&lt;p&gt;The topics we will cover include:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Bayesian inference&lt;/li&gt;
&lt;li&gt;Forward-modeling with a &lt;em&gt;generative&lt;/em&gt; model&lt;/li&gt;
&lt;li&gt;Complex-valued noise and measurement (weights)&lt;/li&gt;
&lt;li&gt;Statistical weight and relationship to point source uncertainty&lt;/li&gt;
&lt;li&gt;Forward-modeling visibility data&lt;/li&gt;
&lt;li&gt;Missing spatial frequencies (model constraints)&lt;/li&gt;
&lt;/ul&gt;




&lt;h2 id="probability-calculus-and-bayesian-inference"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/as5003/lectures/08-rml/#probability-calculus-and-bayesian-inference"&gt;Probability calculus and Bayesian Inference&lt;/a&gt;
&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;A good reference for this section is &lt;a href="https://ui.adsabs.harvard.edu/abs/2012arXiv1205.4446H/abstract"&gt;Data analysis recipes: Probability calculus for inference&lt;/a&gt; by Hogg&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;Generally, we write probability distributions like \(p(a)\). The probability distribution is a function describing the probability of the variable \(a\) having some value.&lt;/p&gt;
&lt;p&gt;Probability functions are normalized.
$$
\int_{-\infty}^\infty p(a)\,\mathrm{d}a = 1
$$&lt;/p&gt;
&lt;p&gt;Say that \(a\) represents the height of an individual from the US population, measured in meters.&lt;/p&gt;
&lt;p&gt;TODO: draw a bell curve&lt;/p&gt;
&lt;p&gt;Probability functions &lt;strong&gt;have units&lt;/strong&gt;. In this case, \(p(a)\) has units of \(a^{-1}\), or \(\mathrm{m}^{-1}\).&lt;/p&gt;
&lt;p&gt;If selected an individual from the US population and we wanted to know the probability that their height was between 1.7 and 1.9 meters, we could do an integral over this range&lt;/p&gt;
&lt;p&gt;$$
\int_{1.7\,\mathrm{m}}^{1.9\,\mathrm{m}} p(a)\,\mathrm{d}a
$$&lt;/p&gt;




&lt;h3 id="conditional-probabilities"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/as5003/lectures/08-rml/#conditional-probabilities"&gt;Conditional probabilities&lt;/a&gt;
&lt;/h3&gt;
&lt;p&gt;It&amp;rsquo;s very common that we will talk about multiple parameters at the same time. For example, we can continue our example and let \(b\) be the age of an individual drawn from the US population. We can talk about the probability of an individual drawn from the US population having a certain height &lt;em&gt;given&lt;/em&gt; that we know they are 20 years old&lt;/p&gt;
&lt;p&gt;$$
p(a | b = 20\,\mathrm{yr}).
$$&lt;/p&gt;
&lt;p&gt;What are the units of this probability distribution? It&amp;rsquo;s actually the same as before, it&amp;rsquo;s \(a^{-1}\), or \(\mathrm{m}^{-1}\), because this probability distribution must obey the same normalization&lt;/p&gt;
&lt;p&gt;$$
\int_{-\infty}^\infty p(a | b = 20\,\mathrm{yr})) \,\mathrm{d}a = 1
$$&lt;/p&gt;
&lt;p&gt;You can&amp;rsquo;t do the integral
$$
\int p(a | b ) \,\mathrm{d}b
$$
because this integrand now has units of \(a^{-1}b\), which is nonsensical.&lt;/p&gt;




&lt;h3 id="factorizing-probabilities"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/as5003/lectures/08-rml/#factorizing-probabilities"&gt;Factorizing probabilities&lt;/a&gt;
&lt;/h3&gt;
&lt;p&gt;Now let&amp;rsquo;s consider the probability distribution
$$
p(a,b)
$$
this is a two-dimensional probability distribution. It has units of \((ab)^{-1}\), or in our previous example \(\mathrm{m}^{-1}\, \mathrm{yr}^{-1}\). You can read this as the probability of an individual having \(a\) value of height &lt;em&gt;and&lt;/em&gt; \(b\) value of age, i.e., this is a &lt;em&gt;joint&lt;/em&gt; probability distribution.&lt;/p&gt;
&lt;p&gt;The same normalization rules apply, only now these need to be done over two dimensions.&lt;/p&gt;
&lt;p&gt;You can take &lt;em&gt;any&lt;/em&gt; joint probability distribution and factor it into conditional distributions. So, we could write \(p(a,b)\) in two different ways
$$
p(a,b) = p(a) p(b | a)
$$
or
$$
p(a,b) = p(a|b) p(b).
$$
So, in words, we can say that the probability of \(a\) &lt;em&gt;and&lt;/em&gt; \(b\) (the left hand side) is equal to the probability of \(a\) &lt;em&gt;times&lt;/em&gt; the probability of \(b\) given \(a\).&lt;/p&gt;
&lt;p&gt;As I said, this factorization can apply to any joint probability distribution.&lt;/p&gt;
&lt;p&gt;Side note that &lt;em&gt;if&lt;/em&gt;
$$
p(b | a) = p(b)
$$
we would say that \(a\) and \(b\) are independent variables, and so we would write
$$
p(a, b) = p(a) p(b),
$$
but this isn&amp;rsquo;t true in the general case, only if the variables are independent.&lt;/p&gt;
&lt;p&gt;We can put the two factorization equations together to arrive at another relationship
$$
p(a | b) = \frac{p(b|a) p(a)}{p(b)}
$$
which is called Bayes&amp;rsquo;s theorem.&lt;/p&gt;




&lt;h3 id="marginalization"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/as5003/lectures/08-rml/#marginalization"&gt;Marginalization&lt;/a&gt;
&lt;/h3&gt;
&lt;p&gt;One amazing thing you can do with probability distributions is &lt;em&gt;marginalization&lt;/em&gt;. Say that I told you the joint distribution \(p(a, b)\). As we talked about, we said this distribution had units of \((ab)^{-1}\) or \(\mathrm{m}^{-1}\, \mathrm{yr}^{-1}\). But let&amp;rsquo;s say you only cared about \(p(a)\), the distribution of heights. We can marginalize away the variable we don&amp;rsquo;t want by integration
$$
p(a) = \int p(a, b)\,\mathrm{d}a.
$$&lt;/p&gt;
&lt;p&gt;As we&amp;rsquo;ll see in a moment, this has huge implications when it comes time to do inference.&lt;/p&gt;




&lt;h2 id="likelihood-functions"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/as5003/lectures/08-rml/#likelihood-functions"&gt;Likelihood functions&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;Now, let&amp;rsquo;s revisit Bayes&amp;rsquo;s rule and rewrite it like
$$
p(\mathrm{hypothesis} | \mathrm{data}) \propto p(\mathrm{data} | \mathrm{hypothesis}) \times p(\mathrm{hypothesis}).
$$
We&amp;rsquo;re omitting a constant of proportionality, commonly called the Bayesian evidence.&lt;/p&gt;
&lt;p&gt;The term on the left hand side is called a posterior distribution and is really a wonderful thing to report at the end of your analysis. Say you collected many years of measurements on the positions (orbits) of Jupiter, Saturn, and their moons, and then used those measurements to infer the mass of Saturn (as Laplace famously did). The posterior you would be most interested in would be a 1D distribution of the probability of the mass of Saturn and would represent your degree of belief that Saturn truly had that particular mass. This would be conditional on all of the measurements you made.&lt;/p&gt;
&lt;p&gt;The term on the very right hand side is a prior probability distribution and expresses your belief about the mass of Saturn in the absence of data. For example, we might rightly say that the mass needed to be greater than zero, and less than the mass of the Sun. A simple prior would then ascribe equal probabilities to all values in between (or perhaps equal probabilities to the &lt;em&gt;logarithm&lt;/em&gt; of the mass of Saturn).&lt;/p&gt;
&lt;p&gt;The remaining term, \(p(\mathrm{data} | \mathrm{hypothesis}) \) is called a likelihood function, and it is really where the rubber meets the road in most statistical analyses. Simply put, the likelihood function is how probable the observed data is for a given setting of the hypothesis. So, what is the probability of obtaining the observed positional measurements of Jupiter, Saturn, and their moons &lt;em&gt;if&lt;/em&gt; the mass of Saturn were \(5\times 10^{26}\,\mathrm{kg}\), for example.&lt;/p&gt;
&lt;p&gt;A quick note that likelihood functions show up in frequentist analysis all the time, too. However, the interpretations of probability are different.&lt;/p&gt;




&lt;h3 id="fitting-a-line"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/as5003/lectures/08-rml/#fitting-a-line"&gt;Fitting a line&lt;/a&gt;
&lt;/h3&gt;
&lt;p&gt;Let&amp;rsquo;s dive into a quick example with some real data to make these concepts clearer.&lt;/p&gt;
&lt;p&gt;Typically, when astronomers fit a model to some dataset, such as a line \(y = m x + b\) to a collection of \(\boldsymbol{X} = {x_1, x_2, \ldots, x_N}\) and \(\boldsymbol{Y} = {y_1, y_2, \ldots, y_N}\) points, we require a likelihood function. Simply put, the likelihood function specifies the probability of the data, given a model, and encapsulates our assumptions about the data and noise generating processes.&lt;/p&gt;
&lt;p&gt;TODO: draw a bunch of points for putting a line through.&lt;/p&gt;
&lt;p&gt;For most real-world datasets, we don&amp;rsquo;t measure the &amp;ldquo;true&amp;rdquo; \(y\) value of the line (i.e., \(mx + b\)), but rather make a measurement which has been partially corrupted by some &amp;ldquo;noise.&amp;rdquo; In that case, we say that each \(y_i\) data point is actually generated by&lt;/p&gt;
&lt;p&gt;$$
y_i = m x_i + b + \epsilon
$$&lt;/p&gt;
&lt;p&gt;where \(\epsilon\) is a noise realization from a standard &lt;a href="https://en.wikipedia.org/wiki/Normal_distribution"&gt;normal distribution&lt;/a&gt; with standard deviation \(\sigma\), i.e.,&lt;/p&gt;
&lt;p&gt;$$
\epsilon \sim \mathcal{N}(0, \sigma).
$$&lt;/p&gt;
&lt;p&gt;This information about the data and noise generating process means that we can write down a likelihood function to calculate the probability that we observe the data that we do, given a set of model parameters. The likelihood function is \(p(\boldsymbol{Y} |\boldsymbol{\theta})\). Sometimes it is written as \(\mathcal{L}(\boldsymbol{Y} |\boldsymbol{\theta})\), and frequently, when employed in computation, we&amp;rsquo;ll use the logarithm of the likelihood function, or &amp;ldquo;log-likelihood,&amp;rdquo; \(\ln \mathcal{L}\) to avoid numerical under/overflow issues.&lt;/p&gt;
&lt;p&gt;Let&amp;rsquo;s call \(\boldsymbol{\theta} = {m, b}\) and \(M(x_i |, \boldsymbol{\theta}) = m x_i + b\). This is a very simple example here, but we would still call \(M\) a forward or &lt;em&gt;generative&lt;/em&gt; model. By that, we mean our model is sophisticated enough that we can use it (and some noise model) to fully replicate the dataset, or alternative sets of data indistinguishable from the measured data.&lt;/p&gt;
&lt;p&gt;The probability of observing each datum is a Gaussian (normal distribution) centered on the model value, evaluated at the \(y_i\) value. So, the full likelihood function for this line problem is just the multiplication of all of these probability distributions&lt;/p&gt;
&lt;p&gt;$$
\mathcal{L}(\boldsymbol{Y} |\boldsymbol{\theta}) = \prod_i^N \frac{1}{\sqrt{2 \pi} \sigma} \exp \left [ - \frac{(y_i - M(x_i |\boldsymbol{\theta}))^2}{2 \sigma^2}\right ].
$$&lt;/p&gt;
&lt;p&gt;The logarithm of the likelihood function is&lt;/p&gt;
&lt;p&gt;$$
\ln \mathcal{L}(\boldsymbol{Y} |,\boldsymbol{\theta}) = -N \ln(\sqrt{2 \pi} \sigma) - \frac{1}{2} \sum_i^N \frac{(y_i - M(x_i |\boldsymbol{\theta}))^2}{\sigma^2}.
$$&lt;/p&gt;
&lt;p&gt;You may recognize the right hand term looks similar to the \(\chi^2\) metric,&lt;/p&gt;
&lt;p&gt;$$
\chi^2(\boldsymbol{Y} |\boldsymbol{\theta}) = \sum_i^N \frac{(y_i - M(x_i |\boldsymbol{\theta}))^2}{\sigma^2}
$$&lt;/p&gt;
&lt;p&gt;Assuming that the uncertainty (\(\sigma\)) on each data point is known (and remains constant), the first term in the log likelihood expression remains constant, and we have&lt;/p&gt;
&lt;p&gt;$$
\ln \mathcal{L}(\boldsymbol{Y} |\boldsymbol{\theta}) = - \frac{1}{2} \chi^2 (\boldsymbol{Y} |\boldsymbol{\theta}) + C
$$&lt;/p&gt;
&lt;p&gt;where \(C\) is a constant with respect to the model parameters. It is common to use shorthand to say that &amp;ldquo;the likelihood function is \(\chi^2\)&amp;rdquo; to indicate situations where the data uncertainties are Gaussian. Very often, we (or others) are interested in the parameter values \(\boldsymbol{\theta}_\mathrm{MLE}\) which maximize the likelihood function. Unsurprisingly, these parameters are called the &lt;em&gt;maximum likelihood estimate&lt;/em&gt; (or MLE), and usually they represent something like a &amp;ldquo;best-fit&amp;rdquo; model.&lt;/p&gt;
&lt;p&gt;When it comes time to do parameter inference, however, it&amp;rsquo;s important to keep in mind&lt;/p&gt;
&lt;ol&gt;
&lt;li&gt;the simplifying assumptions we made about the noise uncertainties being constant with respect to the model parameters. If we were to &amp;ldquo;fit for the noise&amp;rdquo; in a hierarchical model, for example, we would need to use the full form of the log-likelihood function, including the \(-N \ln \left (\sqrt{2 \pi} \sigma \right)\) term.&lt;/li&gt;
&lt;li&gt;that in order to maximize the likelihood function we want to &lt;em&gt;minimize&lt;/em&gt; the \(\chi^2\) function.&lt;/li&gt;
&lt;li&gt;that constants of proportionality (e.g., the 1/2 in front of the \(\chi^2\)) can matter when combining likelihood functions with prior distributions for Bayesian parameter inference.&lt;/li&gt;
&lt;/ol&gt;
&lt;p&gt;To be specific, \(\chi^2\) is not the end of the story when we&amp;rsquo;d like to perform Bayesian parameter inference. To do so, we need the posterior probability distribution of the model parameters given the dataset, \(p(\boldsymbol{\theta}|,\boldsymbol{Y})\).&lt;/p&gt;




&lt;h2 id="visibility-data"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/as5003/lectures/08-rml/#visibility-data"&gt;Visibility Data&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;Now that we have reviewed likelihood functions, let&amp;rsquo;s turn back to radio astronomy and go into further detail &lt;em&gt;how&lt;/em&gt; the visibility function is sampled. Recall that the visibility domain is the Fourier transform of the image sky brightness \(\mathcal{V} \leftrightharpoons I\).&lt;/p&gt;
&lt;p&gt;The visibility function is complex-valued, and each measurement of it (denoted by \(V_i\)) is made in the presence of noise&lt;/p&gt;
&lt;p&gt;$$
V_i = \mathcal{V}(u_i, v_i) + \epsilon.
$$&lt;/p&gt;
&lt;p&gt;Here \(\epsilon\) represents a noise realization from a &lt;a href="https://en.wikipedia.org/wiki/Complex_normal_distribution"&gt;complex normal&lt;/a&gt; (Gaussian) distribution. Thankfully, most interferometric datasets &lt;em&gt;do not&lt;/em&gt; exhibit significant covariance between the real and imaginary noise components &lt;em&gt;and&lt;/em&gt; the distributions of the values are similar, so we could equivalently say that the real and imaginary components of the noise are separately generated by draws from normal distributions characterized by standard deviation \(\sigma\)&lt;/p&gt;
&lt;p&gt;$$
\epsilon_\Re \sim \mathcal{N}(0, \sigma)
$$&lt;/p&gt;
&lt;p&gt;$$
\epsilon_\Im \sim \mathcal{N}(0, \sigma)
$$&lt;/p&gt;
&lt;p&gt;where \(\sigma\) is a real-valued quantity. If the units of the visibility function are Janskys, then the units of \(\sigma\) are also Janskys.&lt;/p&gt;
&lt;p&gt;The full complex noise-draw is given by
$$
\epsilon = \epsilon_\Re + i \epsilon_\Im.
$$&lt;/p&gt;
&lt;p&gt;Radio interferometers will commonly represent the uncertainty on each visibility measurement by a &amp;ldquo;weight&amp;rdquo; \(w_i\), where&lt;/p&gt;
&lt;p&gt;$$
w_i = \frac{1}{\sigma_i^2}.
$$&lt;/p&gt;
&lt;p&gt;Like \(\sigma\), the weight itself is a real quantity, in this case having units of \(1/\mathrm{Jy}^2\).&lt;/p&gt;
&lt;p&gt;A full interferometric dataset is a collection of visibility measurements, which we represent by&lt;/p&gt;
&lt;p&gt;$$
\boldsymbol{V} = \{V_1, V_2, \ldots V_N\}_{i=1}^N
$$&lt;/p&gt;
&lt;p&gt;each one having a corresponding \(u_i, v_i\) coordinate. For reference, a typical ALMA dataset might contain a half-million individual visibility samples, acquired over a range of spatial frequencies.&lt;/p&gt;




&lt;h3 id="likelihood-functions-for-fourier-data"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/as5003/lectures/08-rml/#likelihood-functions-for-fourier-data"&gt;Likelihood functions for Fourier data&lt;/a&gt;
&lt;/h3&gt;
&lt;p&gt;Now that we&amp;rsquo;ve introduced likelihood functions in general and the specifics of Fourier data, let&amp;rsquo;s talk about likelihood functions for inference with Fourier data. As before, our statement about the data generating process&lt;/p&gt;
&lt;p&gt;$$
V_i = \mathcal{V}(u_i, v_i) + \epsilon
$$&lt;/p&gt;
&lt;p&gt;leads us to the formulation of the likelihood function.&lt;/p&gt;
&lt;p&gt;First, let&amp;rsquo;s assume we have some model that we&amp;rsquo;d like to fit to our dataset. To be a forward model, it should be able to predict the value of the visibility function for any spatial frequency, i.e., we need to be able to calculate&lt;/p&gt;
&lt;p&gt;$$
\mathcal{V}(u, v) = M_\mathcal{V}(u, v |, \boldsymbol{\theta}).
$$&lt;/p&gt;
&lt;p&gt;Following the discussion about how the complex noise realization \(\epsilon\) is generated, this leads to a log likelihood function&lt;/p&gt;
&lt;p&gt;$$
\ln \mathcal{L}(\boldsymbol{V}|,\boldsymbol{\theta}) = - \frac{1}{2} \chi^2(\boldsymbol{V}|,\boldsymbol{\theta}) + C
$$&lt;/p&gt;
&lt;p&gt;Because the data and model are complex-valued, \(\chi^2\) is evaluated as&lt;/p&gt;
&lt;p&gt;$$
\chi^2(\boldsymbol{V}|,\boldsymbol{\theta}) = \sum_i^N \frac{|V_i - M_\mathcal{V}(u_i, v_i |,\boldsymbol{\theta})|^2}{\sigma_i^2}
$$&lt;/p&gt;
&lt;p&gt;where \(| |\) denotes the modulus squared. Equivalently, the calculation can be broken up into sums over the real and imaginary components of the visibility data and model&lt;/p&gt;
&lt;p&gt;$$
\chi^2(\boldsymbol{V}|,\boldsymbol{\theta}) = \sum_i^N \frac{(V_{\Re,i} - M_\mathcal{V,\Re}(u_i, v_i |,\boldsymbol{\theta}))^2}{\sigma_i^2} + \sum_i^N \frac{(V_{\Im,i} - M_\mathcal{V,\Im}(u_i, v_i |,\boldsymbol{\theta}))^2}{\sigma_i^2}.
$$&lt;/p&gt;
&lt;p&gt;Because images of the sky are real, therefore the real part of the visibility function must always be even and the imaginary part odd. The visibility function is Hermitian. This means that&lt;/p&gt;
&lt;p&gt;$$
\mathcal{V}(u, v) = \mathcal{V}^{*}(-u, -v).
$$&lt;/p&gt;
&lt;p&gt;So, if you make a measurement of \(\mathcal{V}(u, v)\), this means you have also made the same measurement of \(\mathcal{V}^{*}(-u, -v)\). If you are doing forward-modeling of the visibilities as we just described, you only need to use one of the Hermitian pairs, otherwise you will double count your measurements (this only turns out to be a scale factor in the likelihood for most analysis, so it&amp;rsquo;s technically OK). If you are gridding the visibilities to then image them, however, you will certainly want to include the Hermitian pairs. Otherwise, your image will not turn out to be real!&lt;/p&gt;




&lt;h3 id="point-source-sensitivity"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/as5003/lectures/08-rml/#point-source-sensitivity"&gt;Point source sensitivity&lt;/a&gt;
&lt;/h3&gt;
&lt;p&gt;We&amp;rsquo;ll use our forward modeling formalism to fit for the flux of a point source.&lt;/p&gt;
&lt;p&gt;Se also &lt;a href="https://ui.adsabs.harvard.edu/abs/2022MNRAS.513.5790C/abstract"&gt;Casussus and Carcamo&lt;/a&gt; 2022, appendix A4.&lt;/p&gt;




&lt;h2 id="more-complex-visibility-models"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/as5003/lectures/08-rml/#more-complex-visibility-models"&gt;More complex visibility models&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;It&amp;rsquo;s difficult to reason about all but the simplest models directly in the Fourier plane, so usually models are constructed in the image plane \(M_I(l,m |,\boldsymbol{\theta})\) and then Fourier transformed (either analytically, or via the FFT) to construct visibility models&lt;/p&gt;
&lt;p&gt;$$
M_\mathcal{V}(u, v |, \boldsymbol{\theta}) \leftrightharpoons M_I(l,m |,\boldsymbol{\theta})
$$&lt;/p&gt;
&lt;p&gt;For marginally resolved sources, it&amp;rsquo;s common to fit simple models like a 2D Gaussian. We can write down an image plane model and then calculate its Fourier transform analytically.&lt;/p&gt;
&lt;p&gt;But, these could be more complicated models. For example, these models could be channel maps of carbon monoxide emission from a rotating protoplanetary disk (as in &lt;a href="https://ui.adsabs.harvard.edu/abs/2015ApJ...806..154C/abstract"&gt;Czekala et al. 2015&lt;/a&gt;, where \(\boldsymbol{\theta}\) contains parameters setting the structure of the disk), or rings of continuum emission from a protoplanetary disk (as in &lt;a href="https://ui.adsabs.harvard.edu/abs/2018ApJ...869L..48G/abstract"&gt;Guzmán et al. 2018&lt;/a&gt;, where \(\boldsymbol{\theta}\) contains parameters setting the sizes and locations of the rings).&lt;/p&gt;
&lt;p&gt;With the likelihood function specified, we can add prior probability distributions \(p(\boldsymbol{\theta})\), and calculate and explore the posterior probability distribution of the model parameters using algorithms like Markov Chain Monte Carlo. In this type of Bayesian inference, we&amp;rsquo;re usually using forward models constructed with a small to medium number of parameters (e.g., 10 - 30), like in the protoplanetary disk examples of &lt;a href="https://ui.adsabs.harvard.edu/abs/2015ApJ...806..154C/abstract"&gt;Czekala et al. 2015&lt;/a&gt; or &lt;a href="https://ui.adsabs.harvard.edu/abs/2018ApJ...869L..48G/abstract"&gt;Guzmán et al. 2018&lt;/a&gt;.&lt;/p&gt;
&lt;p&gt;All of these type of models would be called &lt;em&gt;parametric&lt;/em&gt; models, because we can represent the model using a finite set of parameters. E.g., for the Gaussian model, we have width in the major and minor axes, rotation angle, and 2D position. So these parameters fully represent the model. One thing you need to be concerned with inference using parametric models is whether you have the right model! If your source is actually a ring instead of a Gaussian, the posterior distribution of your parameters can be rendered meaningless.&lt;/p&gt;
&lt;p&gt;Forward modeling with (simple) parametric models can be very useful for understanding&lt;/p&gt;




&lt;h2 id="discussion-about-model-mis-specification-and-unsampled-visibilities-model-constraints"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/as5003/lectures/08-rml/#discussion-about-model-mis-specification-and-unsampled-visibilities-model-constraints"&gt;Discussion about model mis-specification and unsampled visibilities (model constraints)&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;Could use u / v undersampling to constrain width or size, say.&lt;/p&gt;




&lt;h2 id="references"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/as5003/lectures/08-rml/#references"&gt;References&lt;/a&gt;
&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;&lt;a href="https://mpol-dev.github.io/MPoL/rml_intro.html"&gt;MPoL introduction&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="https://catalog.libraries.psu.edu/catalog/19499523"&gt;Machine Learning: A Probabilistic Perspective&lt;/a&gt; by Murphy, Chapter 10&lt;/li&gt;
&lt;li&gt;&lt;a href="https://catalog.libraries.psu.edu/catalog/3405468"&gt;Pattern Recognition and Machine Learning&lt;/a&gt; by Bishop, Chapter 8&lt;/li&gt;
&lt;li&gt;The fourth paper in the 2019 &lt;a href="https://ui.adsabs.harvard.edu/abs/2019ApJ...875L...4E/abstract"&gt;Event Horizon Telescope Collaboration series&lt;/a&gt; describing the imaging principles&lt;/li&gt;
&lt;li&gt;&lt;a href="https://ui.adsabs.harvard.edu/abs/1986ARA%26A..24..127N/abstract"&gt;Maximum entropy image restoration in astronomy&lt;/a&gt; AR&amp;amp;A by Narayan and Nityananda 1986&lt;/li&gt;
&lt;li&gt;&lt;a href="https://ui.adsabs.harvard.edu/abs/2018A%26C....22...16C/abstract"&gt;Multi-GPU maximum entropy image synthesis for radio astronomy&lt;/a&gt; by Cárcamo et al. 2018&lt;/li&gt;
&lt;li&gt;&lt;a href="https://ui.adsabs.harvard.edu/abs/2022arXiv220911813Z/abstract"&gt;Regularized Maximum Likelihood Techniques for ALMA Observations&lt;/a&gt; by Zawadzki, Czekala, et al.&lt;/li&gt;
&lt;/ul&gt;




&lt;h2 id="last-time-1"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/as5003/lectures/08-rml/#last-time-1"&gt;Last time&lt;/a&gt;
&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;Discussed \(u,v\) coverage and sampling (weights)&lt;/li&gt;
&lt;li&gt;Introduced the &amp;ldquo;dirty image&amp;rdquo; as the inverse Fourier transform of the visibility samples&lt;/li&gt;
&lt;li&gt;Introduced the CLEAN image deconvolution procedure&lt;/li&gt;
&lt;/ul&gt;




&lt;h2 id="this-time"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/as5003/lectures/08-rml/#this-time"&gt;This time&lt;/a&gt;
&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;Review parametric vs. non-parametric models&lt;/li&gt;
&lt;li&gt;Introduce Regularized Maximum Likelihood (RML) imaging&lt;/li&gt;
&lt;/ul&gt;




&lt;h2 id="parametric-vs-non-parametric-models"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/as5003/lectures/08-rml/#parametric-vs-non-parametric-models"&gt;Parametric vs. Non-Parametric Models&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;Recall our discussion on Bayesian inference and what it means to forward-model a dataset and how to calculate a likelihood function.&lt;/p&gt;
&lt;p&gt;Say we have a model that we are fitting to some data. In &amp;ldquo;Machine Learning: A Probabilistic Perspective,&amp;rdquo; Murphy defines a parametric model as one that has a fixed number of parameters, and a non-parametric one as one where the number of parameters grows with the size of the data.&lt;/p&gt;
&lt;p&gt;Something like the line we discussed in our Bayesian Modeling lecture is a parametric model. It has two parameters, a slope and an intercept. If we were observing a source and we wanted to fit a 2D Gaussian to the visibility function, that visibility model would also be a parametric model. Its parameters would be the width of the Gaussian, the position of the source, and the amplitude of the source.&lt;/p&gt;
&lt;p&gt;TODO: draw example of Gaussian function and label parameters&lt;/p&gt;
&lt;p&gt;If you&amp;rsquo;ve ever fit a spline to a bunch of data, then you&amp;rsquo;ve used a non-parametric model. A Gaussian process would also a non-parametric model. In these models there in fact are parameters (like the number or type of splines/GP kernels), but these are usually nuisances to the problem. You wouldn&amp;rsquo;t necessarily fit a spline model to determine the exact number of spline position parameters, but you &lt;em&gt;are&lt;/em&gt; interested in the approximation to \(f(x)\) that the spline has enabled you.&lt;/p&gt;
&lt;p&gt;TODO: draw points and a spline or GP drawn through them&lt;/p&gt;
&lt;p&gt;I would consider a CLEAN model to be a type of non-parametric model too. Through the CLEANing process, you create a model of the source emission from a collection of \(\delta\) functions. Each of these \(\delta\) functions technically has parameters, but those are mostly nuisance parameters in pursuit of their aggregate representation of the model. In general, non-parametric models have the ability to be more expressive than parametric models, but sometimes at the expense of interpretability.&lt;/p&gt;




&lt;h2 id="rml-images-as-non-parametric-models"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/as5003/lectures/08-rml/#rml-images-as-non-parametric-models"&gt;RML images as non-parametric models&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;Now, let me introduce a set of techniques that have been grouped under the banner &amp;ldquo;Regularized Maximum Likelihood Imaging&amp;rdquo; or RML Imaging for short. With RML imaging, we&amp;rsquo;re trying to come up with a model that will fit the dataset. But rather than using a parametric model like a protoplanetary disk structure model or a series of Gaussian rings, we&amp;rsquo;re using a non-parametric model of &lt;em&gt;the image itself&lt;/em&gt;. This could be as simple as parameterizing the image using the intensity values of the pixels themselves, i.e.,&lt;/p&gt;
&lt;p&gt;$$
\boldsymbol{\theta} = {I_1, I_2, \ldots, I_{N^2} }
$$&lt;/p&gt;
&lt;p&gt;assuming we have an \(N \times N\) image.&lt;/p&gt;
&lt;p&gt;A flexible image model for RML imaging is mostly analogous to using a spline or Gaussian process to fit a series of \(\boldsymbol{X} = {x_1, x_2, \ldots, x_N}\) and \(\boldsymbol{Y} = {y_1, y_2, \ldots, y_N}\) points&amp;mdash;the model will nearly always have enough flexibility to capture the structure that exists in the dataset. The most straightforward formulation of a non-parametric image model is the pixel basis set, but we could also use more sophisticated basis sets like a set of wavelet coefficients, or even more exotic basis sets constructed from trained neural networks. These may have some serious advantages when it comes to the &amp;ldquo;regularizing&amp;rdquo; part of &amp;ldquo;regularized maximum likelihood&amp;rdquo; imaging. But first, let&amp;rsquo;s talk about the &amp;ldquo;maximum likelihood&amp;rdquo; part.&lt;/p&gt;
&lt;p&gt;Given some image parameterization (e.g., a pixel basis set of \(N \times N\) pixels, with each pixel &lt;code&gt;cell_size&lt;/code&gt; in width), we would like to find the maximum likelihood image \(\boldsymbol{\theta}_\mathrm{MLE}\). Fortunately, because the Fourier transform is a linear operation, we can analytically calculate the maximum solution (the same way we might find the best-fit slope and intercept for the line example). This maximum likelihood solution is called (in the radio astronomy world) the dirty image, and its associated point spread function is called the dirty beam.&lt;/p&gt;
&lt;p&gt;In the construction of the dirty image, all unsampled spatial frequencies are set to zero power. This means that the dirty image will only contain spatial frequencies about which we have at least some data. This assumption, however, rarely translates into good image fidelity, especially if there are many unsampled spatial frequencies which carry significant power. It&amp;rsquo;s also important to recognize that dirty image is only &lt;em&gt;one&lt;/em&gt; out of a set of &lt;em&gt;many&lt;/em&gt; images that could maximize the likelihood function. From the perspective of the likelihood calculation, we could modify the unsampled spatial frequencies of the dirty image to whatever power we might like, and, because they are &lt;em&gt;unsampled&lt;/em&gt;, the value of the likelihood calculation won&amp;rsquo;t change, i.e., it will still remain maximal.&lt;/p&gt;
&lt;p&gt;When synthesis imaging is described as an &amp;ldquo;ill-posed inverse problem,&amp;rdquo; this is what is meant. There is a (potentially infinite) range of images that could &lt;em&gt;exactly&lt;/em&gt; fit the dataset, and without additional information we have no way of discriminating which is best. As you might suspect, this is now where the &amp;ldquo;regularization&amp;rdquo; part of &amp;ldquo;regularized maximum likelihood&amp;rdquo; imaging comes in.&lt;/p&gt;




&lt;h2 id="regularization"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/as5003/lectures/08-rml/#regularization"&gt;Regularization&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;There are a number of different ways to talk about regularization. If one wants to be Bayesian about it, one would talk about specifying &lt;em&gt;priors&lt;/em&gt;, i.e., we introduce terms like \(p(\boldsymbol{\theta})\) such that we might calculate the maximum a posteriori (MAP) image \(\boldsymbol{\theta}_\mathrm{MAP}\) using the posterior probability distribution&lt;/p&gt;
&lt;p&gt;$$
p(\boldsymbol{\theta} |\, \boldsymbol{V}) \propto \mathcal{L}(\boldsymbol{V} |\, \boldsymbol{\theta}) \, p(\boldsymbol{\theta}).
$$&lt;/p&gt;
&lt;p&gt;For computational reasons related to numerical over/underflow, we would most likely use the logarithm of the posterior probability distribution&lt;/p&gt;
&lt;p&gt;$$
\ln p(\boldsymbol{\theta} |\, \boldsymbol{V}) \propto \ln \mathcal{L}(\boldsymbol{V} |\, \boldsymbol{\theta}) + \ln p(\boldsymbol{\theta}).
$$&lt;/p&gt;
&lt;p&gt;One could accomplish the same goal without necessarily invoking the Bayesian language by simply talking about which parameters \(\boldsymbol{\theta}\) optimize some objective function.&lt;/p&gt;
&lt;p&gt;We&amp;rsquo;ll adopt the perspective that we have some objective &amp;ldquo;cost&amp;rdquo; function that we&amp;rsquo;d like to &lt;em&gt;minimize&lt;/em&gt; to obtain the optimal parameters \(\hat{\boldsymbol{\theta}}\). The machine learning community calls this a &amp;ldquo;loss&amp;rdquo; function \(L(\boldsymbol{\theta})\), and so we&amp;rsquo;ll borrow that terminology here. For an unregularized fit, an acceptable loss function is just the negative log likelihood (&amp;ldquo;nll&amp;rdquo;) term,&lt;/p&gt;
&lt;p&gt;$$
L(\boldsymbol{\theta}) = L_\mathrm{nll}(\boldsymbol{\theta}) = - \ln \mathcal{L}(\boldsymbol{V}|\,\boldsymbol{\theta}) = \frac{1}{2} \chi^2(\boldsymbol{V}|\,\boldsymbol{\theta})
$$&lt;/p&gt;
&lt;p&gt;If we&amp;rsquo;re only interested in \(\hat{\boldsymbol{\theta}}\), it doesn&amp;rsquo;t matter whether we include the 1/2 prefactor in front of \(\chi^2\), the loss function will still have the same optimum. However, when it comes time to add additional terms to the loss function, these prefactors matter in controlling the relative strength of each term.&lt;/p&gt;
&lt;p&gt;When phrased in the terminology of function optimization, additional terms can be described as regularization penalties. To be specific, let&amp;rsquo;s add a term that regularizes the sparsity of an image.&lt;/p&gt;
&lt;p&gt;$$
L_\mathrm{sparsity}(\boldsymbol{\theta}) = \sum_i |I_i|
$$&lt;/p&gt;
&lt;p&gt;In short, the L1 norm promotes sparse solutions (solutions where many pixel values are zero). The combination of these two terms leads to a new loss function&lt;/p&gt;
&lt;p&gt;$$
L(\boldsymbol{\theta}) = L_\mathrm{nll}(\boldsymbol{\theta}) + \lambda_\mathrm{sparsity} L_\mathrm{sparsity}(\boldsymbol{\theta})
$$&lt;/p&gt;
&lt;p&gt;Where we control the relative &amp;ldquo;strength&amp;rdquo; of the regularization via the scalar prefactor \(\lambda_\mathrm{sparsity}\). If \(\lambda_\mathrm{sparsity} = 0\), no sparsity regularization is applied. Non-zero values of \(\lambda_\mathrm{sparsity}\) will add in regularization that penalizes non-sparse \(\boldsymbol{\theta}\) values. How strong this penalization is depends on the strength relative to the other terms in the loss calculation.&lt;/p&gt;
&lt;p&gt;We can equivalently specify this using Bayesian terminology, such that&lt;/p&gt;
&lt;p&gt;$$
p(\boldsymbol{\theta} |\,\boldsymbol{V}) = \mathcal{L}(\boldsymbol{V}|,\boldsymbol{\theta}) \, p(\boldsymbol{\theta})
$$&lt;/p&gt;
&lt;p&gt;where&lt;/p&gt;
&lt;p&gt;$$
p(\boldsymbol{\theta}) = C \exp \left (-\lambda_\mathrm{sparsity} \sum_i | I_i| \right)
$$&lt;/p&gt;
&lt;p&gt;and \(C\) is a normalization factor. When working with the logarithm of the posterior, this constant term is irrelevant.&lt;/p&gt;




&lt;h2 id="the-mpol-package-for-regularized-maximum-likelihood-imaging"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/as5003/lectures/08-rml/#the-mpol-package-for-regularized-maximum-likelihood-imaging"&gt;The MPoL package for Regularized Maximum Likelihood imaging&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;&lt;em&gt;Million Points of Light&lt;/em&gt; or &amp;ldquo;MPoL&amp;rdquo; is a Python package that is used to perform regularized maximum likelihood imaging. By that we mean that the package provides the building blocks to create flexible image models and optimize them to fit interferometric datasets. The package is developed completely in the open on &lt;a href="https://github.com/MPoL-dev/MPoL"&gt;Github&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;We strive to&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;create an open, welcoming, and supportive community for new users and contributors (see our &lt;code&gt;code of conduct &amp;lt;https://github.com/MPoL-dev/MPoL/blob/main/CODE_OF_CONDUCT.md&amp;gt;&lt;/code&gt;&lt;strong&gt;and &lt;code&gt;developer documentation &amp;lt;developer-documentation.html&amp;gt;&lt;/code&gt;&lt;/strong&gt;)&lt;/li&gt;
&lt;li&gt;support well-tested (|Tests badge|) and stable releases (i.e., &lt;code&gt;pip install mpol&lt;/code&gt;) that run on all currently-supported Python versions, on Linux, MacOS, and Windows&lt;/li&gt;
&lt;li&gt;maintain up-to-date &lt;code&gt;API documentation &amp;lt;api.html&amp;gt;&lt;/code&gt;__&lt;/li&gt;
&lt;li&gt;cultivate tutorials covering real-world applications&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;We also recommend checking out several other excellent packages for RML imaging:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;&lt;a href="https://github.com/astrosmili/smili"&gt;SMILI&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="https://github.com/achael/eht-imaging"&gt;eht-imaging&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="https://github.com/miguelcarcamov/gpuvmem"&gt;GPUVMEM&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;There are a few things about MPoL that we believe make it an appealing platform for RML modeling.&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Built on PyTorch&lt;/strong&gt;: Many of MPoL&amp;rsquo;s exciting features stem from the fact that it is built on top of a rich computational library that supports autodifferentiation and construction of complex neural networks. Autodifferentiation libraries like &lt;a href="https://github.com/aesara-devs/aesara"&gt;Theano/Aesara&lt;/a&gt;, &lt;a href="https://www.tensorflow.org/"&gt;Tensorflow&lt;/a&gt;, &lt;a href="https://pytorch.org/"&gt;PyTorch&lt;/a&gt;, and &lt;a href="https://jax.readthedocs.io/"&gt;JAX&lt;/a&gt; have revolutionized the way we compute and optimize functions. For now, PyTorch is the library that best satisfies our needs, but we&amp;rsquo;re keeping a close eye on the Python autodifferentiation ecosystem should a more suitable framework arrive. If you are familiar with scientific computing with Python but haven&amp;rsquo;t yet tried any of these frameworks, don&amp;rsquo;t worry, the syntax is easy to pick up and quite similar to working with numpy arrays.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Autodifferentiation&lt;/strong&gt;: PyTorch gives MPoL the capacity to autodifferentiate through a model. The &lt;em&gt;gradient&lt;/em&gt; of the objective function is exceptionally useful for finding the &amp;ldquo;downhill&amp;rdquo; direction in a large parameter space (such as the set of image pixels). Traditionally, these gradients would have needed to been calculated analytically (by hand) or via finite-difference methods which can be noisy in high dimensions. By leveraging the autodifferentiation capabilities, this allows us to rapidly formulate and implement complex prior distributions which would otherwise be difficult to differentiate by hand.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Optimization&lt;/strong&gt;: PyTorch provides a full-featured suite of research-grade &lt;a href="https://pytorch.org/docs/stable/optim.html"&gt;optimizers&lt;/a&gt; designed to train deep neural networks. These same optimizers can be employed to quickly find the optimum RML image.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;GPU acceleration&lt;/strong&gt;: PyTorch wraps CUDA libraries, making it seamless to take advantage of (multi-)GPU acceleration to optimize images. No need to use a single line of CUDA.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Model composability&lt;/strong&gt;: Rather than being a monolithic program for single-click RML imaging, MPoL strives to be a flexible, composable, RML imaging &lt;em&gt;library&lt;/em&gt; that provides primitives that can be used to easily solve your particular imaging challenge. One way we do this is by mimicking the PyTorch ecosystem and writing the RML imaging workflow using &lt;a href="https://pytorch.org/tutorials/beginner/nn_tutorial.html"&gt;PyTorch modules&lt;/a&gt;. This makes it easy to mix and match modules to construct arbitrarily complex imaging workflows. We&amp;rsquo;re working on tutorials that describe these ideas in depth, but one example would be the ability to use a single latent space image model to simultaneously fit single dish and interferometric data.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;A bridge to the machine learning/neural network community&lt;/strong&gt;: MPoL will happily calculate RML images for you using &amp;ldquo;traditional&amp;rdquo; image priors, lest you are the kind of person that turns your nose up at the words &amp;ldquo;machine learning&amp;rdquo; or &amp;ldquo;neural network.&amp;rdquo; However, if you are the kind of person that sees opportunity in these tools, because MPoL is built on PyTorch, it is straightforward to take advantage of them for RML imaging. For example, if one were to train a variational autoencoder on protoplanetary disk emission morphologies, the latent space + decoder architecture could be easily plugged in to MPoL and serve as an imaging basis set.&lt;/p&gt;
&lt;/li&gt;
&lt;/ul&gt;




&lt;h2 id="last-time-2"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/as5003/lectures/08-rml/#last-time-2"&gt;Last time&lt;/a&gt;
&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;Recap of (parametric) forward modeling in a Bayesian context&lt;/li&gt;
&lt;li&gt;Recap of the CLEAN procedural image deconvolution algorithm&lt;/li&gt;
&lt;li&gt;Introduction of RML process as a non-parametric model&lt;/li&gt;
&lt;li&gt;Discussion of regularization, in the context of priors&lt;/li&gt;
&lt;li&gt;Discussion of loss function space (defined by probability distribution) vs. the optimization engineering that helps you navigate it&lt;/li&gt;
&lt;/ul&gt;




&lt;h2 id="today"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/as5003/lectures/08-rml/#today"&gt;Today&lt;/a&gt;
&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;Overarching question&amp;mdash;how do you assess whether something is good? Forays into Machine Learning&lt;/li&gt;
&lt;li&gt;Deeper dive into future RML applications and opportunities&lt;/li&gt;
&lt;/ul&gt;




&lt;h2 id="model-comparison"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/as5003/lectures/08-rml/#model-comparison"&gt;Model comparison&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;Last time we talked about the difference between parametric and non-parametric models, i.e., the difference between fitting a line with slope and intercept vs. fitting a spline or Gaussian process. And we made the simple distinction that a parametric model has a fixed number of parameters, whereas a non-parametric model generally has parameters that grow with your number of data points. The truth is that in several contexts these exist as part of a continuum.&lt;/p&gt;
&lt;p&gt;Today we&amp;rsquo;re going to take a journey along this continuum and examine some of the failure modes that can come about. The discussion will first be general and applicable to &lt;em&gt;many&lt;/em&gt; problems, but then we&amp;rsquo;ll zero in on the case of interferometric imaging (both CLEAN and RML) specifically.&lt;/p&gt;
&lt;p&gt;Let&amp;rsquo;s think of a polynomial basis. If you recall back to one of our first lectures, where I asked you to draw a function through a set of discrete samples. Let&amp;rsquo;s narrow in on the specific case of a polynomial basis set of degree \(N\), where \(N\) is the number of terms. I&amp;rsquo;m going to write it like this&lt;/p&gt;
&lt;p&gt;$$
y = a_0 + a_1 x + a_2 x^2 + a_3 x^3 + \ldots.
$$
but similar arguments apply to Legendre polynomials or Chebyshev polynomials, etc, and in practice it&amp;rsquo;s better to use on of those for your actual fitting problem.&lt;/p&gt;
&lt;p&gt;But lets say we have 10 data points. We can fit a 0th order polynomial, 1st order, etc.., using say a \(\chi^2\) likelihood function. And then we can get to a polynomial of degree ten. Who has heard the common critique &amp;ldquo;your model has more parameters than data, it&amp;rsquo;s so flexible&amp;rdquo;. This situation is where the common advice against using a model with more parameters than your data comes from, and when people utter this critique, I think this is the situation that they are referring to.&lt;/p&gt;
&lt;p&gt;What are the criticisms of this model? On one hand, it has fit the data &lt;em&gt;perfectly&lt;/em&gt;. It&amp;rsquo;s done what we&amp;rsquo;ve asked. On the other hand, it doesn&amp;rsquo;t seem to do what we want. If we were to get new data, our model probably wouldn&amp;rsquo;t be that useful.&lt;/p&gt;
&lt;p&gt;What can we do? Well, the common wisdom would have us stick to models with fewer parameters. But, this is being a little shortsighted. Instead, we can add a &lt;em&gt;regularization&lt;/em&gt; that discourages the fit from taking on large amplitudes in many of the terms. One type of regularization is &amp;ldquo;ridge regression,&amp;rdquo; also called Tikhonov or L2 regularization. It adds an extra term to the fit metric that says&lt;/p&gt;
&lt;p&gt;$$
\lambda \sum_{i=0}^{N-1}|a_i|^2
$$&lt;/p&gt;
&lt;p&gt;then we find that the amplitudes of those higher order terms (which might not be necessary) will be diminished. If you recall from very early in the semester, where we talked about the concept of band-limited signals, this regularization is related. In the limit we let \(N\rightarrow \infty\), we arrive at a Gaussian process, and the autocovariance of the function (the Gaussian process kernel) is related to the power spectrum of the signal. If we say the signal is band-limited, then that puts a &lt;em&gt;cap&lt;/em&gt; on the number of higher order terms that we can actually use.&lt;/p&gt;
&lt;p&gt;This math is fully equivalent to the RML imaging problem we introduced last week, and it also raises the same problem: how do we set the regularizer strength? What is the best choice?&lt;/p&gt;




&lt;h2 id="cross-validation"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/as5003/lectures/08-rml/#cross-validation"&gt;Cross validation&lt;/a&gt;
&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;Useful thoughts from &lt;a href="https://biometry.github.io/APES/LectureNotes/2017-Resampling/CrossValidationLecture.html"&gt;https://biometry.github.io/APES/LectureNotes/2017-Resampling/CrossValidationLecture.html&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;The idea is to test the &lt;em&gt;predictive power&lt;/em&gt; of your model. In this case, the model would be your setup of your. In the RML case, the model would be the settings of your image pixelization,&lt;/p&gt;
&lt;p&gt;If we have &lt;em&gt;the right model&lt;/em&gt;, we will generalize perfectly to new data. The problem is that our training data are always limited and will usually always have some noise.&lt;/p&gt;
&lt;p&gt;The problem of non-independence of your random hold-outs. When the data are small, it is possible to overfit your cross validation. This is a hard place to be in, especially when getting new data is expensive.&lt;/p&gt;</description></item><item><title>Image Plane Deconvolution (CLEAN)</title><link>https://iancze.github.io/courses/as5003/lectures/07-image-plane-deconvolution/</link><pubDate>Tue, 19 Sep 2023 00:00:00 +0000</pubDate><guid>https://iancze.github.io/courses/as5003/lectures/07-image-plane-deconvolution/</guid><description>



&lt;h3 id="references"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/as5003/lectures/07-image-plane-deconvolution/#references"&gt;References&lt;/a&gt;
&lt;/h3&gt;
&lt;ul&gt;
&lt;li&gt;&lt;a href="https://leo.phys.unm.edu/~gbtaylor/astr423/s98book.pdf"&gt;Synthesis Imaging in Radio Astronomy II&lt;/a&gt;: Lecture 7: Imaging by Briggs, Schwab, and Sramek and Lecture 8: Deconvolution by Cornwell, Braun, and Briggs&lt;/li&gt;
&lt;li&gt;&lt;a href="https://ui.adsabs.harvard.edu/abs/2021ApJS..257....2C/abstract"&gt;Molecules with ALMA at Planet-forming Scales (MAPS). II. CLEAN Strategies for Synthesizing Images of Molecular Line Emission in Protoplanetary Disks&lt;/a&gt; by Czekala et al. 2021&lt;/li&gt;
&lt;/ul&gt;




&lt;h3 id="outline"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/as5003/lectures/07-image-plane-deconvolution/#outline"&gt;Outline&lt;/a&gt;
&lt;/h3&gt;
&lt;ul&gt;
&lt;li&gt;Recap of visibility datasets and the sampling function&lt;/li&gt;
&lt;li&gt;Image plane implications of sampling&amp;ndash;the dirty image&lt;/li&gt;
&lt;li&gt;Noise and &amp;ldquo;weighting&amp;rdquo;&lt;/li&gt;
&lt;li&gt;The CLEAN image deconvolution algorithm&lt;/li&gt;
&lt;/ul&gt;




&lt;h2 id="visibility-datasets"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/as5003/lectures/07-image-plane-deconvolution/#visibility-datasets"&gt;Visibility datasets&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;Recall from last time that the visibility function is the Fourier transform of the sky brightness distribution&lt;/p&gt;
&lt;p&gt;$$
\mathcal{V}(u,v) \leftrightharpoons I(l,m)
$$&lt;/p&gt;
&lt;p&gt;and that each baseline (pair of antennas) of an interferometric array corresponds to a sample of the visibility function at a specific \(u,v\) point. The \(u,v\) point corresponds to the length of the &lt;em&gt;projected&lt;/em&gt; baseline in multiples of the observing wavelength and is the &lt;em&gt;spatial frequency&lt;/em&gt; of the image plane that is being sampled.&lt;/p&gt;
&lt;p&gt;For a large array with &amp;gt; 50 antennas, like ALMA, you get nearly 1000 unique &lt;em&gt;instantaneous&lt;/em&gt; baselines, from each pairwise combination of antennas in the array. As the earth rotates, you can quickly acquire new projected baseline samples.&lt;/p&gt;




&lt;h2 id="sampling-function"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/as5003/lectures/07-image-plane-deconvolution/#sampling-function"&gt;Sampling function&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;If you do radio interferometry, you will very often see the baseline distributions plotted as a series of \(\delta\) functions on the \(u,v\) plane.&lt;/p&gt;
&lt;p&gt;TODO: draw approximate plot with points&lt;/p&gt;
&lt;p&gt;This is called the sampling function&lt;/p&gt;
&lt;p&gt;$$
S(u,v) = \sum_{k=1}^M \delta(u - u_k, v - v_k).
$$&lt;/p&gt;
&lt;p&gt;And, we can write down the sampling of the visibility function as&lt;/p&gt;
&lt;p&gt;$$
S(u, v) \times \mathcal{V}(u, v).
$$&lt;/p&gt;
&lt;p&gt;If you recall from our Fourier transform distribution, this sampling function is also called the &lt;em&gt;transfer function&lt;/em&gt;. It allows certain spatial frequencies through the interferometric system. Both terminologies are used in the radio astronomy community.&lt;/p&gt;




&lt;h3 id="image-plane-implications"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/as5003/lectures/07-image-plane-deconvolution/#image-plane-implications"&gt;Image plane implications&lt;/a&gt;
&lt;/h3&gt;
&lt;p&gt;Now, let&amp;rsquo;s discuss the image-plane ramifications of the sampling operation. We started with \(I(l,m)\), the &amp;ldquo;true&amp;rdquo; sky brightness, i.e., the one you would observe if you had a perfectly sensitive telescope with infinite resolving power.&lt;/p&gt;
&lt;p&gt;$$
\mathcal{V}(u,v) \leftrightharpoons I(l,m)
$$&lt;/p&gt;
&lt;p&gt;but we&amp;rsquo;ve &lt;em&gt;multiplied&lt;/em&gt; it by the sampling distribution&lt;/p&gt;
&lt;p&gt;$$
S(u, v) \times \mathcal{V}(u, v).
$$&lt;/p&gt;
&lt;p&gt;Remember how we talked about interferometers as &lt;em&gt;spatial filters&lt;/em&gt;? We just showed how this conceptually works in the Fourier domain, the \(u,v\) coverage provided by the antenna spacings &lt;em&gt;is&lt;/em&gt; the transfer function.&lt;/p&gt;
&lt;p&gt;We can also show that interferometers are spatial filters by considering the image plane implications. This same operation implies that the true sky brightness is &lt;em&gt;convolved&lt;/em&gt; by something in the image plane
$$
I(l, m) * B_D(l, m) \leftrightharpoons S(u, v) \times \mathcal{V}(u, v).
$$&lt;/p&gt;
&lt;p&gt;The quantity \(B_D(l,m,)\) is called the dirty beam. We&amp;rsquo;ll soon talk about the CLEAN algorithm, so the dirty/clean terminology will soon make sense. But first, let&amp;rsquo;s just think about this for a second. This dirty beam is the same thing we showed in previous lectures, and can also be thought of as the sum of the fringe functions.&lt;/p&gt;
&lt;p&gt;If you take an image, convolve it with the dirty beam, and then take its Fourier transform, you&amp;rsquo;ll see that you will have visibility samples only at the spatial frequencies corresponding to your baselines.&lt;/p&gt;
&lt;p&gt;Another way to think of the dirty beam is as the &lt;em&gt;impulse response&lt;/em&gt; of the interferometric system. Let&amp;rsquo;s assume we are observing a point source \(I(l,m) = \delta(l,m)\). The visibility function corresponding to a point source is a constant, so we have&lt;/p&gt;
&lt;p&gt;$$
\delta(l,m) * B_D(l,m) = S(u,v) \times \mathrm{constant}
$$&lt;/p&gt;
&lt;p&gt;$$
B_D(l,m) \leftrightharpoons S(u,v).
$$&lt;/p&gt;
&lt;p&gt;So, another way we can say the same thing is that the dirty beam is the &lt;em&gt;point spread function&lt;/em&gt; (PSF) of the interferometer, and it is given by the Fourier transform of the sampling function, which is set by the configuration of the baselines within the array.&lt;/p&gt;




&lt;h2 id="the-dirty-image"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/as5003/lectures/07-image-plane-deconvolution/#the-dirty-image"&gt;The Dirty Image&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;In last week&amp;rsquo;s lecture, we talked a bit about the actual visibility data products, a discrete set of (noisy) visibility samples&lt;/p&gt;
&lt;p&gt;$$
\mathbf{V} = \{V_1, V_2, \ldots, V_M\}^M_{k=1}.
$$&lt;/p&gt;
&lt;p&gt;The idea is that each was sampled with some (complex) noise draw&lt;/p&gt;
&lt;p&gt;$$
V_i = \mathcal{V}(u_i, v_i) + \epsilon
$$&lt;/p&gt;
&lt;p&gt;such that
$$
\epsilon_\Re \sim \mathcal{N}(0, \sigma)
$$&lt;/p&gt;
&lt;p&gt;$$
\epsilon_\Im \sim \mathcal{N}(0, \sigma)
$$&lt;/p&gt;
&lt;p&gt;$$
\epsilon = \epsilon_\Re + i \epsilon_\Im.
$$&lt;/p&gt;
&lt;p&gt;And then we said that radio interferometers commonly represent the uncertainty on each visibility measurement by a &amp;ldquo;weight&amp;rdquo; \(w_i\), where&lt;/p&gt;
&lt;p&gt;$$
w_i = \frac{1}{\sigma_i^2}.
$$&lt;/p&gt;




&lt;h3 id="more-details-about-the-sampling-function"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/as5003/lectures/07-image-plane-deconvolution/#more-details-about-the-sampling-function"&gt;More details about the sampling function&lt;/a&gt;
&lt;/h3&gt;
&lt;p&gt;The fact that each visibility measurement is made in the presence of noise means that we should be taking this into account in our sampling function. So a more sophisticated sampling function looks like&lt;/p&gt;
&lt;p&gt;$$
S(u, v) = \sum_{k=1}^M T_k D_k w_k \delta(u - u_k, v - v_k).
$$&lt;/p&gt;
&lt;p&gt;In addition to weighting each sample by its inverse variance \(w_k\) (something you might do to take a statistical average), there are other factors you might fiddle with, like a &amp;ldquo;taper&amp;rdquo; \(T_k\) and density weight \(D_k\). For now, you can just think of them as equal to 1.0.&lt;/p&gt;
&lt;p&gt;Now, let&amp;rsquo;s think about how we would take our sampled visibilities \(\mathbf{V} = \{V_1, V_2, \ldots, V_M\}^M_{k=1}\) and make an image from them. Well, the way to do this looks like an inverse Fourier transform&lt;/p&gt;
&lt;p&gt;$$
I_D(l, m) = C \sum_{k=1}^{N^\dagger} T_k D_k w_k V_k \exp \{2 \pi i (u_k l + v_k m) \}.
$$
with normalization constant
$$
C = 1 / \sum_{k=1}^{N^\dagger} T_k D_k w_k.
$$&lt;/p&gt;
&lt;p&gt;where \(N^\dagger = 2 N\) is the set of visibilities that includes their complex conjugates such that the sampling is Hermitian. This image that results is called the &lt;em&gt;dirty image&lt;/em&gt;, and we denote it with a &amp;ldquo;D&amp;rdquo; subscript.&lt;/p&gt;
&lt;p&gt;Continuing with the dirty beam concept, we also have&lt;/p&gt;
&lt;p&gt;$$
I_D(l,m) = I(l, m) * B_D(l, m) \leftrightharpoons S(u, v) \times \mathcal{V}(u, v).
$$&lt;/p&gt;




&lt;h3 id="a-quick-note-about-the-ill-defined-imaging-problem-and-weighting-choices"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/as5003/lectures/07-image-plane-deconvolution/#a-quick-note-about-the-ill-defined-imaging-problem-and-weighting-choices"&gt;A quick note about the &amp;ldquo;ill-defined&amp;rdquo; imaging problem and weighting choices&lt;/a&gt;
&lt;/h3&gt;
&lt;p&gt;Making images from Fourier samples is generally an ill-defined inverse process, which is only complicated in the presence of noise. What do we mean by ill-defined?&lt;/p&gt;
&lt;p&gt;In the forward process, we collect samples of the visibility function at specific \(u,v\) values \(S(u, v) \times \mathcal{V}(u, v)\). To make a dirty image, we take the inverse Fourier transform of those values to produce &lt;em&gt;an&lt;/em&gt; image. This is the inverse process.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;The ill-defined nature of imaging&lt;/strong&gt; (even in the presence of noise): Think about what&amp;rsquo;s happening in the forward process. If the visibility function had non-zero amplitudes at some \(u,v\) values that we didn&amp;rsquo;t sample, then these never enter our dataset (obviously). The inverse process itself doesn&amp;rsquo;t include these values (obviously), and by their omission, assumes that their amplitudes are equal to zero.&lt;/p&gt;
&lt;p&gt;So, as far as our interferometer is concerned, it can&amp;rsquo;t distinguish between a set of degenerate image brightness distributions on the sky so long as they have the exact same \(\mathcal{V}(u,v)\) values at the sampled \(u,v\) points. The unsampled \(\mathcal{V}(u,v)\) locations can take on arbitrary values and still result in the exact same dataset.&lt;/p&gt;
&lt;p&gt;Another way of saying the same thing is to think of an interferometer as a &lt;em&gt;spatial filter&lt;/em&gt;, i.e., its transfer function (the array configuration) only allows measurements of certain spatial frequencies to enter the dataset. But most images contain power at many spatial frequencies, including those that have been filtered out. So, if you just try to make an image with the spatial frequencies in your dataset, your image will most likely be missing some spatial frequencies that would be there in actuality.&lt;/p&gt;




&lt;h4 id="weighting-choices"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/as5003/lectures/07-image-plane-deconvolution/#weighting-choices"&gt;Weighting choices&lt;/a&gt;
&lt;/h4&gt;
&lt;p&gt;The whole discussion about the ill-defined inverse problem applies &lt;em&gt;even if&lt;/em&gt; we sampled the visibility function perfectly with no noise, so long as there are still unsampled \(u,v\) points that contain significant visibility &amp;ldquo;power.&amp;rdquo;&lt;/p&gt;
&lt;p&gt;The problem gets even more complicated when we consider measurement noise and the fact that array configurations usually sample some parts of \(u,v\) space better than others. In my opinion, this is really why various weighting schemes are as popular as they are. The common way to do this is by tuning the \(D_k\) and \(T_k\) terms. As we&amp;rsquo;ll show later in the slides, tuning the \(D_k\) terms provide a tradeoff between:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;&lt;strong&gt;natural weighting&lt;/strong&gt; maximizing point source sensitivity at the cost of spatial resolution (broader beam)&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;uniform weighting&lt;/strong&gt; maximizing spatial resolution (narrow beam) at the cost of point source sensitivity (higher RMS noise floor in the image)&lt;/li&gt;
&lt;li&gt;&amp;ldquo;Briggs&amp;rdquo; &lt;strong&gt;robust weighting&lt;/strong&gt; a tradeoff between these two regimes, ranging from (-2 to 2). The tradeoff is non-linear, so coming from natural weighting, for example, good spatial resolution can be gained with only modest sacrifices in point source sensitivity. Or, vice versa, coming from uniform weighting, good sensitivity to point sources can be gained with only modest losses in resolution.&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;Typically, a reasonable starting point with ALMA observations is to use some in-between value of Briggs weighting like -0.5, 0.0, or 0.5. Different weighting choices can change your sensitivities on different spatial scales.&lt;/p&gt;
&lt;p&gt;The \(T_k\) terms are for applying a &lt;em&gt;taper&lt;/em&gt;, whereby one downweights longer baseline observations.&lt;/p&gt;




&lt;h2 id="clean"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/as5003/lectures/07-image-plane-deconvolution/#clean"&gt;CLEAN&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;We&amp;rsquo;ve talked about how&lt;/p&gt;
&lt;p&gt;$$
I_D(l,m) = I(l, m) * B_D(l, m) \leftrightharpoons S(u, v) \times \mathcal{V}(u, v).
$$&lt;/p&gt;
&lt;p&gt;Specifically, the &lt;em&gt;dirty image&lt;/em&gt; is the &lt;em&gt;convolution&lt;/em&gt; of the true sky brightness \(I\) with the dirty beam \(B_D\). We know what the dirty beam is to high precision.&lt;/p&gt;
&lt;p&gt;First, it&amp;rsquo;s important to note that convolution is a &lt;em&gt;lossy&lt;/em&gt; procedure, you (irrevocably) lose information. For example, consider applying a Gaussian blur to an image. The high resolution information in that image has been lost.&lt;/p&gt;
&lt;p&gt;CLEAN is an &lt;em&gt;image deconvolution&lt;/em&gt; algorithm. We just said that convolution is a lossy procedure, so, how does the algorithm get that information back? What follows are my own opinions about the CLEAN algorithm, its use cases I&amp;rsquo;m most familiar with in the protoplanetary disk community, and its limitations.&lt;/p&gt;
&lt;p&gt;TODO: draw a 1D cut of the dirty beam&lt;/p&gt;
&lt;p&gt;The short answer is that CLEAN can help restore an image &lt;em&gt;up to a point&lt;/em&gt;. The thing that CLEAN is best at is removing the effect of those nasty sidelobes from a dirty beam, and replacing them with a more Gaussian beam response that is usually easier to work with. CLEAN will not give you &amp;ldquo;super-resolution&amp;rdquo; access to lost spatial frequencies that you have lost, but it can help you make better looking images, and ones that have better dynamic range.&lt;/p&gt;




&lt;h3 id="iterative-processes"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/as5003/lectures/07-image-plane-deconvolution/#iterative-processes"&gt;Iterative processes&lt;/a&gt;
&lt;/h3&gt;
&lt;p&gt;CLEAN is a procedure that iteratively builds up a model image. To carry this out on the whiteboard, I&amp;rsquo;m going to do things in 1D. In a moment we&amp;rsquo;ll show an example with 2D images in the slides.&lt;/p&gt;
&lt;p&gt;TODO: draw in 1D a dirty image of a few point sources, a representation of the dirty beam, and a blank model image&lt;/p&gt;
&lt;p&gt;Before you start, we&amp;rsquo;ll define a quantity called the CLEAN beam. It&amp;rsquo;s usually chosen to be a Gaussian fit to the main lobe of the beam.&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;
&lt;ol&gt;
&lt;li&gt;First, we identify the &lt;em&gt;peak&lt;/em&gt; location in the dirty image.&lt;/li&gt;
&lt;/ol&gt;
&lt;/li&gt;
&lt;li&gt;2a. Then, we subtract some fraction of the flux times the dirty beam from this location. This dirty image becomes a &amp;ldquo;residual image&amp;rdquo; now.&lt;/li&gt;
&lt;li&gt;2b. At the same time, we add a \(\delta\) function at corresponding location in the model image with the same amplitude as the flux we subtracted. So, if we subtracted 0.1 Jy of flux in the dirty beam, then we would add a \(\delta\) function with amplitude of 0.1 Jy in the model image.&lt;/li&gt;
&lt;li&gt;2c. You can think of these steps as equivalent, because a \(\delta\) function times the dirty beam gives you back the dirty beam. These steps are &lt;em&gt;also&lt;/em&gt; equivalently carried out in the visibility domain.&lt;/li&gt;
&lt;li&gt;
&lt;ol start="3"&gt;
&lt;li&gt;Go back to step 1, and repeat with the next-highest &lt;em&gt;peak&lt;/em&gt; location. Continue this loop until the peak flux in the image drops below some threshold&lt;/li&gt;
&lt;/ol&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;ol start="4"&gt;
&lt;li&gt;Once this threshold is reached, the CLEANing is done. The final step is to put everything back together. The model image is convolved with the CLEAN beam to form the restored image. This &amp;ldquo;smooths out&amp;rdquo; the model image to some resolution limit, and hides imperfections on smaller scales.&lt;/li&gt;
&lt;/ol&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;ol start="5"&gt;
&lt;li&gt;The remainder of the residual image is added back to the restored image to give a sense of the &amp;ldquo;noise&amp;rdquo; in the image.&lt;/li&gt;
&lt;/ol&gt;
&lt;/li&gt;
&lt;/ul&gt;




&lt;h3 id="limitations"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/as5003/lectures/07-image-plane-deconvolution/#limitations"&gt;Limitations&lt;/a&gt;
&lt;/h3&gt;
&lt;p&gt;CLEAN is &lt;em&gt;procedural&lt;/em&gt;. What this means is that you set parameters that guide the above process and then carry on until some termination criterion is reached. This could also be part of an interactive process. There is no guarantee that the CLEANed image is unique, either.&lt;/p&gt;
&lt;p&gt;In my opinion, CLEAN is best at removing the sidelobe effects of the dirty beam, improving the dynamic range of your image, and possibly detecting fainter (point-like) sources that would have been hidden by the sidelobes of other brighter point sources.&lt;/p&gt;
&lt;p&gt;In the above example, we said that we would use a \(\delta\) function to build up a model image. You may have already identified this choice of basis set or &amp;ldquo;CLEAN component&amp;rdquo; as a potential limitation. This works great for fields of point sources, but what about extended sources? It turns out that it actually works &lt;em&gt;OK&lt;/em&gt; for extended sources, so long as you have many of them. This adds to the computational time, though, and is why I think we&amp;rsquo;re now currently in an interesting place, approaching the limitations of CLEAN.&lt;/p&gt;
&lt;p&gt;There are other extensions to CLEAN (called multi-scale), which use CLEAN components of varying sizes, like little Gaussian blobs. This can help substantially over using just \(\delta\) functions. For very resolved structures, though, you still run into the problem that your components aren&amp;rsquo;t sufficiently like the morphology of the source you&amp;rsquo;re trying to deconvolve.&lt;/p&gt;</description></item><item><title>2D Interferometry, PSFs, Gridding, and Dirty Images</title><link>https://iancze.github.io/courses/as5003/lectures/06-psfs-and-gridding/</link><pubDate>Tue, 19 Sep 2023 00:00:00 +0000</pubDate><guid>https://iancze.github.io/courses/as5003/lectures/06-psfs-and-gridding/</guid><description>&lt;ul&gt;
&lt;li&gt;Slides [&lt;a href="beams-and-arrays.pdf"&gt;PDF&lt;/a&gt;] [&lt;a href="beams-and-arrays.key"&gt;Keynote&lt;/a&gt; (better for movies)]&lt;/li&gt;
&lt;/ul&gt;




&lt;h2 id="references-for-today"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/as5003/lectures/06-psfs-and-gridding/#references-for-today"&gt;References for today&lt;/a&gt;
&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;&lt;a href="https://catalog.libraries.psu.edu/catalog/20789467"&gt;Interferometry and Synthesis in Radio Astronomy&lt;/a&gt; by Thompson, Moran, and Swenson, particularly Appendix 2.1&lt;/li&gt;
&lt;li&gt;&lt;a href="https://www.cv.nrao.edu/~sransom/web/xxx.html"&gt;Essential Radio Astronomy&lt;/a&gt; by James Condon and Scott Ransom&lt;/li&gt;
&lt;li&gt;&lt;a href="https://catalog.libraries.psu.edu/catalog/2010095"&gt;The Fourier Transform and its Applications&lt;/a&gt; by R. Bracewell&lt;/li&gt;
&lt;/ul&gt;




&lt;h2 id="review-of-last-time"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/as5003/lectures/06-psfs-and-gridding/#review-of-last-time"&gt;Review &lt;em&gt;of last time&lt;/em&gt;&lt;/a&gt;
&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;We talked about R.A., Dec., and direction cosines as coordinates on the sky&lt;/li&gt;
&lt;li&gt;We introduced a two-element interferometer with multiply and add correlator backend&lt;/li&gt;
&lt;li&gt;We introduced the fringe pattern \(F(l)\) of a two-antenna interferometer, which is a cosine wave on the sky&lt;/li&gt;
&lt;li&gt;We discussed how the frequency of this fringe pattern (i.e., the &lt;em&gt;spatial frequency&lt;/em&gt;) changes as a function of baseline length between the two antennas&lt;/li&gt;
&lt;li&gt;We introduced the visibility function \(\mathcal{V}(u)\) as the Fourier transform of the sky-brightness \(I(l)\)&lt;/li&gt;
&lt;li&gt;And discussed how the output from the interferometer changes in response to a point source and an extended source&lt;/li&gt;
&lt;/ul&gt;




&lt;h2 id="this-time"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/as5003/lectures/06-psfs-and-gridding/#this-time"&gt;This time&lt;/a&gt;
&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;Extend our discussion to a general interferometer with north-south and east-west baselines&lt;/li&gt;
&lt;li&gt;Discuss arrays with multiple antennas&lt;/li&gt;
&lt;li&gt;Earth aperture synthesis&lt;/li&gt;
&lt;li&gt;\(u,v\) sampling distributions for various arrays&lt;/li&gt;
&lt;li&gt;point spread functions (PSFs) and their relationship to the sampling distribution&lt;/li&gt;
&lt;/ul&gt;




&lt;h2 id="wrapping-up-1d"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/as5003/lectures/06-psfs-and-gridding/#wrapping-up-1d"&gt;Wrapping up 1D&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;Last time, we had said the response \(R(l)\) of the interferometer to some sky distribution \(I(l)\) is to act as a convolution
$$
R(l) = \cos(2 \pi u l) * I(l).
$$
In this case, you can think of the fringe function as a &lt;em&gt;point spread function&lt;/em&gt; (a terrible one), that is convolving out the true sky brightness distribution.&lt;/p&gt;
&lt;p&gt;Another way to look at this, though, is through our old friend the multiplication/convolution algorithm. Let \(u_0\) be the (fixed, instantaneous) baseline distance between two antennas, measured in multiples of the observing wavelength
$$
u = \frac{D \cos \theta}{\lambda}
$$
where \(\theta\) is the angle from zenith.&lt;/p&gt;
&lt;p&gt;Then, the Fourier pair of the fringe function is
$$
\cos(2 \pi u_0 l) \leftrightharpoons \frac{1}{2} [\delta(u + u_0) + \delta(u - u_0)].
$$
These are two delta functions situated at \(\pm u_0\).&lt;/p&gt;
&lt;p&gt;The Fourier pair of the sky brightness distribution is called the &lt;em&gt;visibility function&lt;/em&gt;
$$
I(l) \leftrightharpoons \mathcal{V}(u).
$$&lt;/p&gt;
&lt;p&gt;In the Fourier plane, the response of the interferometer is a multiplication
$$
\cos(2 \pi u l) * I(l) \leftrightharpoons \frac{1}{2} [\delta(u + u_0) + \delta(u - u_0)] \times \mathcal{V}(u)
$$
i.e., the interferometer has &lt;em&gt;sampled&lt;/em&gt; the visibility function at locations \(\pm u_0\) corresponding to the baseline distance of the two antennas.&lt;/p&gt;
&lt;p&gt;Another way to think about this is that the interferometer acts as a &lt;em&gt;spatial filter&lt;/em&gt; that only responds to the two spatial frequencies \(\pm u_0\). \(\mathcal{V}(u)\) represents the amplitude and phase of the sinusoidal component of the intensity distribution with spatial frequency \(u\) cycles per radian. The negative spatial frequency doesn&amp;rsquo;t have a physical meaning but is a mathematical convenience. Because the intensity distribution on the sky is a real quantity, the visibility function itself is symmetric about the origin in a Hermitian sense, meaning it has real even parts and odd imaginary parts.&lt;/p&gt;




&lt;h2 id="moving-on-to-the-general-case"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/as5003/lectures/06-psfs-and-gridding/#moving-on-to-the-general-case"&gt;Moving on to the general case&lt;/a&gt;
&lt;/h2&gt;
&lt;ol&gt;
&lt;li&gt;We&amp;rsquo;re going to take our two-element interferometer and re-derive the same relationships using a general vector formalism&lt;/li&gt;
&lt;li&gt;Then, we&amp;rsquo;re going to introduce a general 3D cartesian coordinate set to the problem, which is used by most interferometers&lt;/li&gt;
&lt;/ol&gt;




&lt;h3 id="coordinates"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/as5003/lectures/06-psfs-and-gridding/#coordinates"&gt;Coordinates&lt;/a&gt;
&lt;/h3&gt;
&lt;p&gt;Let&amp;rsquo;s consider a generic situation of a two-element interferometer observing (tracking) a source on the sky with phase center \(\mathbf{s}_0\).&lt;/p&gt;
&lt;figure&gt;&lt;img src="https://iancze.github.io/courses/as5003/lectures/06-psfs-and-gridding/tms-general-coord.png"
 alt="TMS Fig 3.1"&gt;&lt;figcaption&gt;
 &lt;p&gt;TMS Fig 3.1&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;

&lt;p&gt;&lt;strong&gt;Antenna power pattern&lt;/strong&gt;: An element of the source with solid angle \(d\Omega\) at some position \(\mathbf{s} = \mathbf{s}_0 + \mathbf{\sigma}\) will contribute an element of power \( \frac{1}{2} A(\sigma) I(\sigma) \Delta \nu d\Omega\), where \(A\) is some (normalized) power pattern of a single antenna. For now, you can consider it to be a directionally smooth function that is effectively constant over the field of view of interest.&lt;/p&gt;
&lt;p&gt;From what we just talked about for the 1D example, the component of the correlator output will be equal to the received power and to the fringe term \(\cos(2 \pi \nu \tau_g)\).&lt;/p&gt;
&lt;p&gt;Let \(\mathbf{D}_\lambda\) be a baseline vector which points from the central antenna to the other one, and specifies the baseline length in multiples of the observing wavelength. Then&lt;/p&gt;
&lt;p&gt;$$
\nu \tau_g = \mathbf{D}_\lambda \cdot \mathbf{s} = \mathbf{D} \cdot (\mathbf{s}_0 + \mathbf{\sigma}).
$$&lt;/p&gt;
&lt;p&gt;To calculate the output from the correlator, we need to integrate over the spatial distribution of the source&lt;/p&gt;
&lt;p&gt;$$
r(D_\lambda, s_0) = \Delta \nu \int_{4 \pi} A(\mathbf{\sigma}) I(\sigma) \cos [2 \pi D_\lambda \cdot (s_0 + \mathbf{\sigma})]\,\mathrm{d}\Omega
$$&lt;/p&gt;
&lt;p&gt;Here we see an opportunity to use our sine/cosine difference angle formulae again to split this up into sine and cosine components and then use Euler&amp;rsquo;s formula to put it back together.&lt;/p&gt;
&lt;p&gt;Let&amp;rsquo;s define the complex visibility as
$$
\mathcal{V} = \int_{4 \pi} A_N(\sigma) I(\sigma) e^{-i 2 \pi D_\lambda \cdot \sigma}\,\mathrm{d}\Omega
$$
which I hope you agree looks suspiciously like a Fourier transform.&lt;/p&gt;
&lt;p&gt;When an interferometer observes a source, it is sampling the visibility function at these points (corresponding to a spatial frequency of \(\pm \mathbf{D}_\lambda \cdot \mathbf{s}_0 \)). You can think of this measurement as just recording the real and imaginary values of the visibility function, or alternatively, some &lt;em&gt;visibility amplitude&lt;/em&gt; and some &lt;em&gt;visibility phase&lt;/em&gt;
$$
\mathcal{V} = |\mathcal{V}|e^{i \phi}.
$$&lt;/p&gt;




&lt;h3 id="3d-coordinates"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/as5003/lectures/06-psfs-and-gridding/#3d-coordinates"&gt;3D coordinates&lt;/a&gt;
&lt;/h3&gt;
&lt;p&gt;Now that we&amp;rsquo;ve introduced how the visibility function comes about in a general vector formalism, let&amp;rsquo;s get concrete with respect to coordinates on Earth and in the sky.&lt;/p&gt;
&lt;figure&gt;&lt;img src="https://iancze.github.io/courses/as5003/lectures/06-psfs-and-gridding/tms-uv-curved.png"
 alt="Credit: TMS Fig 3.2. Note that the \(l\) and \(m\) coordinates technically index the flat image plane tangent to \(\mathbf{s_0}\), not curved as they are shown here."&gt;&lt;figcaption&gt;
 &lt;p&gt;Credit: TMS Fig 3.2. Note that the \(l\) and \(m\) coordinates technically index the &lt;em&gt;flat&lt;/em&gt; image plane tangent to \(\mathbf{s_0}\), not curved as they are shown here.&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;

&lt;p&gt;Let&amp;rsquo;s focus in on the \(u,v, w\) coordinate system in the bottom of the figure. The plane is centered at the location of one of the antennas \(u=0,v=0, w=0\). We can also draw our baseline vector \(\mathbf{D}_\lambda\) in this coordinate system, pointing to the other antenna. The \(w\) unit vector is pointing towards phase center \(\mathbf{s}_0\) (i.e., the direction of the source).&lt;/p&gt;
&lt;p&gt;Another way of saying this is
$$
\mathbf{D}_\lambda \cdot \mathbf{s}_0 = w.
$$
So, you can think of the \(u,v\) plane \(w = 0\) as oriented orthogonal to the vector pointing towards phase center. But, N.B. that the baseline vector &lt;em&gt;itself&lt;/em&gt; does not necessarily live in the \(w = 0\) plane, it can have a non-zero \(w\) component.&lt;/p&gt;
&lt;p&gt;Using this coordinate system, let&amp;rsquo;s focus on rewriting the \(\mathbf{D}_\lambda \cdot \mathbf{s}\) term. Recall that the dot product between two vectors is&lt;/p&gt;
&lt;p&gt;$$
\mathbf{a} \cdot \mathbf{b} = ||a|| \; ||b|| \cos \theta.
$$
Therefore we have
$$
\mathbf{D}_\lambda \cdot \mathbf{s} = \left ( ul + vm + wn \right).
$$
Wow, that was simple. Hopefully now you see why it was convenient to use \(l, m\) as direction cosines!&lt;/p&gt;
&lt;p&gt;\(n\) is the third direction cosine and is w.r.t. the \(w\) axis. It is not independent of \(l, m\) and can be written in terms of them as
$$
n = \sqrt{1 - l^2 - m^2}.
$$&lt;/p&gt;
&lt;p&gt;So we would normally write
$$
\mathbf{D}_\lambda \cdot \mathbf{s} = \left ( ul + vm + w\sqrt{1 - l^2 - m^2} \right).
$$&lt;/p&gt;
&lt;p&gt;This factor also appears in the solid angle differential. As we move from phase center, the solid angle is changed by a factor
$$
d \Omega = \frac{\mathrm{d}l\; \mathrm{d} m}{\sqrt{1 - l^2 - m^2}}.
$$
This is adjusting for the fact that the solid angle is something on the celestial sphere, but we are measuring it using the direction cosines on the tangent plane.&lt;/p&gt;
&lt;p&gt;With these relationships in hand, we can rewrite the visibility function as
$$
\mathcal{V}(u, v, w) = \int_{-\infty}^\infty \int_{-\infty}^\infty A_N(l,m) I(l,m) \exp \left \{ -i 2 \pi \left [ ul + vm + w \left ( \sqrt{1 - l^2 - m^2} - 1 \right )\right ] \right \} \;\frac{\mathrm{d}l\;\mathrm{d}m}{\sqrt{1 - l^2 - m^2}}.
$$&lt;/p&gt;
&lt;p&gt;The factor in the exponential comes about from the measurement of angular position with respect to phase center (\(\mathbf{D}_\lambda \cdot \mathbf{s})\), as we saw in the &amp;ldquo;general coordinates&amp;rdquo; example.&lt;/p&gt;
&lt;p&gt;&lt;em&gt;If&lt;/em&gt; all of the measurements could be made with the antennas in a plane normal to the \(w\) direction such that \(w=0\), then we would turn this equation into an exact 2D transform. But this isn&amp;rsquo;t usually the case and we need to make approximations.&lt;/p&gt;
&lt;p&gt;So long as we are in the small-field regime and \(l\) and \(m\) are small enough such that the term
$$
\left ( \sqrt{1 - l^2 - m^2} - 1 \right ) w
$$
can be neglected (\(\simeq - \frac{1}{2}(l^2 + m^2)w\) in this regime), then we have
$$
\mathcal{V}(u, v, w) \simeq \mathcal{V}(u, v, 0) = \int_{-\infty}^\infty \int_{-\infty}^\infty \frac{A_N(l,m) I(l,m)}{\sqrt{1 - l^2 - m^2}} \exp \left \{ -i 2 \pi \left [ ul + vm \right ] \right \} \;\mathrm{d}l\;\mathrm{d}m.
$$&lt;/p&gt;
&lt;p&gt;OK! So we&amp;rsquo;ve arrived at the result that I told you about at the beginning of last week&amp;rsquo;s class, that the visibility function is the Fourier transform of the sky brightness (modified by the primary beam of each antenna, which we can mostly ignore for this discussion as a constant). The approximation we made for the \(w\) term places a limit on the maximum size of the field that we can image (at once). There are approaches designed to overcome this scenario, but at least in the context of this course we will restrict our discussion to those that don&amp;rsquo;t require it. This is generally the case for all images made with VLA or ALMA for a single pointing (i.e., imaging the full primary beam). If you use multiple pointings of the array (generally called &amp;ldquo;mosaicing&amp;rdquo;) to make an even larger image, you&amp;rsquo;ll need to take into account the effects of the \(w\) term.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Revisiting &amp;ldquo;slightly extended source&lt;/strong&gt;: A &amp;ldquo;slightly extended source&amp;rdquo; is something that is larger than the dirty beam but smaller than the primary beam of each telescope. Instantaneous field of view of an interferometer is the same as the primary beam of each telescope, treated as a single dish (see previous section). Each single dish antenna is still seeing the same thing as before, it&amp;rsquo;s just that we have a correlator backend that&amp;rsquo;s doing things with the signals, allowing us to create a &lt;em&gt;synthesized beam&lt;/em&gt; that is considerably smaller than the size of the primary beam. For example, at 220 GHz (band 6), ALMA has a primary beam of about 20 arcseconds in diameter. However, it&amp;rsquo;s common to make synthesized beams on the size of 0.1 arcseconds or smaller.&lt;/p&gt;
&lt;p&gt;Let&amp;rsquo;s also briefly discuss what the \(u,v\) coordinate plane looks like, now that we are in 2D:&lt;/p&gt;
&lt;figure&gt;&lt;img src="https://iancze.github.io/courses/as5003/lectures/06-psfs-and-gridding/2D-uv-plane.png"
 alt="Credit: TMS Fig 2.7"&gt;&lt;figcaption&gt;
 &lt;p&gt;Credit: TMS Fig 2.7&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;





&lt;h3 id="units-of-mathcalv"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/as5003/lectures/06-psfs-and-gridding/#units-of-mathcalv"&gt;Units of \(\mathcal{V}\)&lt;/a&gt;
&lt;/h3&gt;
&lt;p&gt;What are the units of \(\mathcal{V}\) itself? We can get at this by looking at the units of \(I_\nu(l,m)\) and how we carried out the Fourier transform integral (in its simplified form).&lt;/p&gt;
&lt;p&gt;$$
\mathcal{V}(u, v) = \int_{-\infty}^\infty \int_{-\infty}^\infty I(l,m) \exp \left \{ -i 2 \pi \left [ ul + vm \right ] \right \} \;\mathrm{d}l\;\mathrm{d}m.
$$&lt;/p&gt;
&lt;p&gt;If we parameterized our image using \(\mathrm{Jy} / \mathrm{arcsec}^2\) and we integrated over \( \mathrm{d}l\, \mathrm{d}m\) (both assuming they had units of arcsec), then \(\mathcal{V}\) must have units of Jy. I.e., you can think of it sort of like the flux being observed at that angular scale. The visibility function is complex-valued, so if you want to discuss the &amp;ldquo;power&amp;rdquo; of an image at some angular scale then you should consider \(|\mathcal{V}|^2\).&lt;/p&gt;</description></item><item><title>Interferometry in Practice</title><link>https://iancze.github.io/courses/as5003/lectures/05-interferometry/</link><pubDate>Tue, 19 Sep 2023 00:00:00 +0000</pubDate><guid>https://iancze.github.io/courses/as5003/lectures/05-interferometry/</guid><description>&lt;ul&gt;
&lt;li&gt;&lt;a href="https://psu.mediaspace.kaltura.com/media/Astro+589A+Lecture+5/1_2zi2doy2"&gt;Video Recording&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;




&lt;h2 id="references-for-today"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/as5003/lectures/05-interferometry/#references-for-today"&gt;References for today&lt;/a&gt;
&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;&lt;a href="https://catalog.libraries.psu.edu/catalog/2010095"&gt;The Fourier Transform and its Applications&lt;/a&gt; by R. Bracewell&lt;/li&gt;
&lt;li&gt;&lt;a href="https://catalog.libraries.psu.edu/catalog/20789467"&gt;Interferometry and Synthesis in Radio Astronomy&lt;/a&gt; by Thompson, Moran, and Swenson, particularly Appendix 2.1&lt;/li&gt;
&lt;li&gt;&lt;a href="https://www.cv.nrao.edu/~sransom/web/xxx.html"&gt;Essential Radio Astronomy&lt;/a&gt; by James Condon and Scott Ransom&lt;/li&gt;
&lt;/ul&gt;




&lt;h2 id="this-time"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/as5003/lectures/05-interferometry/#this-time"&gt;This time&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;The punchline of today is that the complex-valued visibility function \(\mathcal{V}\) is the 2D Fourier transform of an image on the celestial sphere (with a small field of view)&lt;/p&gt;
&lt;p&gt;$$
I(l, m) \leftrightharpoons \mathcal{V}(u, v)
$$&lt;/p&gt;
&lt;p&gt;and it is the visibility function that interferometers measure directly. The values of \(u, v\) for which interferometers are able to measure the visibility function depend on how the array of antennas is laid out and the spacing between them.&lt;/p&gt;
&lt;p&gt;Most of today&amp;rsquo;s lecture will follow Chapters 2 and 3 of &lt;a href="https://catalog.libraries.psu.edu/catalog/20789467"&gt;Interferometry and Synthesis in Radio Astronomy&lt;/a&gt; by Thompson, Moran, and Swenson. First, we will introduce a two-element interferometer and it&amp;rsquo;s response to a point source. Then, we&amp;rsquo;ll complexify this a bit to talk about an extended source (but still in 1D). Then, we&amp;rsquo;ll move on to discuss intensity distributions and the visibility function in the general case and then derive the relationship between \(I(l, m) \leftrightharpoons \mathcal{V}(u, v)\).&lt;/p&gt;
&lt;p&gt;We&amp;rsquo;ll first develop the geometry and math of this relationship for small fields of view, so that you understand the result, at least in an abstract manner. Then we&amp;rsquo;ll spend the latter part of the course working through how a radio interferometer like the VLA or ALMA works to actually sample the visibility function.&lt;/p&gt;




&lt;h2 id="ra-and-dec"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/as5003/lectures/05-interferometry/#ra-and-dec"&gt;R.A. and Dec&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;Let&amp;rsquo;s review our coordinates for images on the celestial sphere.&lt;/p&gt;
&lt;p&gt;Declination&amp;rsquo;s the easier one, in my opinion. No matter where you are, if you move in declination, you move along a great circle (i.e., a circle that actually traces the circumference of the celestial sphere). We measure this in terms of 0 (celestial equator) to +90 degrees at the north celestial pole and -90 degrees at the southern celestial pole. You can split a degree into 60 arcminutes and an arcminute into 60 arcseconds.&lt;/p&gt;
&lt;figure&gt;&lt;a href="https://skyandtelescope.org/wp-content/uploads/RA-Dec-wiki-Tom-RuenCC-BY-SA-3.0.jpg"&gt;&lt;img src="https://iancze.github.io/courses/as5003/lectures/05-interferometry/celestial-sphere.jpg"
 alt="Credit: Sky and Telescope"&gt;&lt;/a&gt;&lt;figcaption&gt;
 &lt;p&gt;Credit: Sky and Telescope&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;

&lt;p&gt;Right Ascension is the one that sometimes trips me up. Because the sky rotates, we have this system of marking it using sidereal &lt;em&gt;time&lt;/em&gt;: 24 &lt;em&gt;hours&lt;/em&gt; of right ascension, sometimes broken up into 60 &lt;em&gt;minutes&lt;/em&gt;, and 60 &lt;em&gt;seconds&lt;/em&gt;. Although they are still in multiples of 60, these &lt;em&gt;minutes&lt;/em&gt; and &lt;em&gt;seconds&lt;/em&gt; we use for right ascension do not have the same angular size as arcminutes and arcseconds (even if we are on the celestial equator). For example, let&amp;rsquo;s say we have two points on the sky.&lt;/p&gt;
&lt;pre&gt;&lt;code&gt;p1 = '00h42m00s', '+41d12m'
p2 = '00h42m01s', '+41d12m'
&lt;/code&gt;&lt;/pre&gt;
&lt;p&gt;Same declination, but their R.A. values differ by one second. Can anyone guess how many &lt;em&gt;arcseconds&lt;/em&gt; separate these points? The answer is about 11.3 arcseconds.&lt;/p&gt;
&lt;p&gt;Usually, when we are talking about observing a smaller source (like a protoplanetary disk, or galaxy), we point in some direction towards that object and then define a small little postage stamp, commonly in units of \(\Delta \delta \) and \(\Delta \alpha \cos \delta\), in which case, the units describing the image are arcseconds. We&amp;rsquo;re still talking about spherical astronomy though, so this isn&amp;rsquo;t necessarily limited to small fields of view. We could have \(\Delta \delta\) be several degrees, for example.&lt;/p&gt;
&lt;p&gt;TODO: Figure: point relative to center, and then Delta directions coming off of it.&lt;/p&gt;




&lt;h3 id="direction-cosines"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/as5003/lectures/05-interferometry/#direction-cosines"&gt;Direction cosines&lt;/a&gt;
&lt;/h3&gt;
&lt;p&gt;In a moment, we&amp;rsquo;re going talk about the mechanics of interferometers observing the celestial sphere and how these relate to Fourier transforms. Before we talk about that, though, there&amp;rsquo;s a concept I want to introduce while we&amp;rsquo;re still talking about units for images.&lt;/p&gt;
&lt;p&gt;Practically speaking, most images we might make with ALMA or the VLA will have a small field of view (&amp;lt; 1 arcminute). In this regime, it simplifies a lot to talk about &amp;ldquo;flat&amp;rdquo; images, i.e., image planes that are tangent to the field center. In 1D, it would look like this&lt;/p&gt;
&lt;figure&gt;&lt;img src="https://iancze.github.io/courses/as5003/lectures/05-interferometry/direction-cosine.png"
 alt="The concept of the direction cosine. Credit: Fig 3.3 TMS"&gt;&lt;figcaption&gt;
 &lt;p&gt;The concept of the direction cosine. Credit: Fig 3.3 TMS&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;

&lt;p&gt;The direction cosines are
$$
l = \sin(\Delta \alpha \cos \delta)
$$
and
$$
m = \sin(\Delta \delta)
$$
relative to the phase center. You can see from the figure that it is the direction cosine that is actually tracking the position on the tangent plane, where we are defining our image. I know I just wrote down \(\sin\) but I called these direction cosines. The term comes about from the way you&amp;rsquo;d set up the problem in two dimensions, where you might use cosine of the complementary angle instead, but it&amp;rsquo;s just a matter of convention.&lt;/p&gt;
&lt;p&gt;Because they are outputs from trigonometric functions, they are technically unitless. Though l and m are technically unitless and measures of &lt;em&gt;linear&lt;/em&gt; distance, for small angular extent, they could also be considered to have units of radians. So it will be common that we refer to
$$
l = \sin(\Delta \alpha \cos \delta) \approx \Delta \alpha \cos \delta
$$
and
$$
m = \sin(\Delta \delta) \approx \Delta \delta.
$$&lt;/p&gt;
&lt;p&gt;This probably sounds pointless, since we just arrived back at the same units we started with. Hopefully the reasons why we might wish to use these units will become apparent after we cover more about the interferometer. And remember that you can always &lt;em&gt;exactly&lt;/em&gt; convert from direction cosine back to angular usits (e.g. \(\Delta \alpha \cos \delta\)) by doing \(\sin^{-1}\), it&amp;rsquo;s just that for most small angles we&amp;rsquo;ll be dealing with, this operation is essentially an identity function. All of this goes out the window when we consider wide-field imaging (which we won&amp;rsquo;t have time to talk about in this course, unfortunately).&lt;/p&gt;
&lt;hr&gt;




&lt;h2 id="introduction-to-a-2-element-interferometer"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/as5003/lectures/05-interferometry/#introduction-to-a-2-element-interferometer"&gt;Introduction to a 2-element interferometer&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;Consider this geometric situation&lt;/p&gt;
&lt;figure&gt;&lt;img src="https://iancze.github.io/courses/as5003/lectures/05-interferometry/elementary-interferometer.png"
 alt="Credit: TMS Fig 2.1"&gt;&lt;figcaption&gt;
 &lt;p&gt;Credit: TMS Fig 2.1&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;

&lt;p&gt;The antennas are spaced directly east-west, and they are observing a source in the &lt;em&gt;far-field&lt;/em&gt;, i.e., the radiation from a distant cosmic source appears as a plane wave. First, we will consider the case of a &lt;em&gt;point-source&lt;/em&gt;; later we will extend this formalism to spatially extended sources. We&amp;rsquo;ll assume the primary beam of each antenna is large, such that they can observe radiation from a source located a wide range of \(\theta\) angles. We&amp;rsquo;ll assume that we&amp;rsquo;re observing in a narrow slice of frequency around \(\nu\), essentially monochromatic.&lt;/p&gt;
&lt;p&gt;The wavefront from the source arrives at the right antenna some time
$$
\tau_g = \frac{D}{c} \sin \theta
$$
before it reaches the left one. This is called the &lt;em&gt;geometric time delay&lt;/em&gt;.&lt;/p&gt;
&lt;p&gt;Each antenna has its own signal voltage stream:
$$
V_1 = \sin 2 \pi \nu t
$$
and
$$
V_2 = \sin 2 \pi \nu (t - \tau_g).
$$&lt;/p&gt;
&lt;p&gt;These streams are multiplied together in a &lt;em&gt;correlator&lt;/em&gt; and then time-averaged over some interval. The output of the correlator is proportional to
$$
F(t, \tau_g) = \sin (2 \pi \nu t) \sin 2 \pi \nu (t - \tau_g),
$$
which we can expand using our trig sum identities for sine to
$$
F(t, \tau_g) = \sin^2(2 \pi \nu t) \cos(2 \pi \nu \tau_g) - \sin(2 \pi \nu t)\cos(2 \pi \nu t) \sin(2 \pi \nu \tau_g).
$$&lt;/p&gt;
&lt;p&gt;We can simplify this equation based on our knowledge that the correlator multiplies and then adds (integrates, typically for a few seconds).&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;The central frequency \(\nu\) is on the order of 10s of MHz to nearly a THz&lt;/li&gt;
&lt;li&gt;\(\theta\) (baked into \(\tau_g\)) is rotating at the Earth&amp;rsquo;s rotational velocity, which is \(10^{-4}\;\mathrm{rad\,s}^{-1}\).&lt;/li&gt;
&lt;li&gt;\(D\) must be smaller than \(10^7\;\)m for terrestrial baselines&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;This means that the rate of variation of \(\nu \tau_g \ll \nu t\) by several orders of magnitude.&lt;/p&gt;
&lt;p&gt;So long as our averaging interval is \(T \gg 1/\nu\) (which is satisfied by a typical multi-second integration),
$$
\langle \sin^2 (2 \pi \nu t) \rangle = 1/2
$$
and
$$
\langle \sin(2 \pi \nu t)\cos(2 \pi \nu t) \rangle = 0
$$
so we&amp;rsquo;re left with
$$
F \propto \cos (2 \pi \nu \tau_g).
$$
We can also define \(l = \sin \theta\) and then we can write
$$
F \propto \cos (2 \pi \nu \tau_g) = \cos \left (\frac{2 \pi D l}{\lambda} \right ).
$$&lt;/p&gt;
&lt;p&gt;This is called the &lt;em&gt;fringe function&lt;/em&gt; of a two-element interferometer&lt;/p&gt;
&lt;p&gt;TODO: draw as a linear relationship vs. \(l\), i.e., an oscillating sine wave
TODO: then include the fringe plot itself&lt;/p&gt;
&lt;figure&gt;&lt;img src="https://iancze.github.io/courses/as5003/lectures/05-interferometry/fringe.png"
 alt="The fringe function (plotted here as \(|F|\)) can be thought of as the directional power pattern of the interferometer in the case the antennas are isotropic. Credit: TMS Fig 2.2"&gt;&lt;figcaption&gt;
 &lt;p&gt;The fringe function (plotted here as \(|F|\)) can be thought of as the directional power pattern of the interferometer in the case the antennas are isotropic. Credit: TMS Fig 2.2&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;

&lt;p&gt;So, you see that the 2-element interferometer has a sine/cosine sensitivity to the sky along the east-west axis. It has no sensitivity along the north-south direction.&lt;/p&gt;




&lt;h2 id="2-element-interferometer-for-a-spatially-resolved-source"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/as5003/lectures/05-interferometry/#2-element-interferometer-for-a-spatially-resolved-source"&gt;2-element interferometer for a spatially resolved source&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;Now we&amp;rsquo;ll consider a slightly more complex interferometer.&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;The antennas are (somewhat) directional&lt;/li&gt;
&lt;li&gt;They track the source as it moves across the sky, from the rotation of the Earth. This introduces an &lt;em&gt;instrumental&lt;/em&gt; time delay&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;$$
\tau_i = \frac{D}{c} \sin \theta_0.
$$
I.e., this keeps the waveforms in sync so long as we&amp;rsquo;re looking directly at \(\theta_0\).&lt;/p&gt;
&lt;p&gt;The direction the antennas are pointed is called the &lt;em&gt;phase reference position&lt;/em&gt;, which we&amp;rsquo;ll denote as \(\theta_0\) and tracks the source as it rotates across the sky.&lt;/p&gt;
&lt;p&gt;TODO: make a figure showing \(\Delta \theta\) offset.&lt;/p&gt;
&lt;p&gt;And let us consider radiation from a direction \(\theta_0 - \Delta \theta\), where \(\Delta \theta\) is a small angle. As before, the fringe response term is
$$
\cos (2 \pi \nu \tau) = \cos \left \{ 2 \pi \nu \left [ \frac{D}{c} \sin (\theta_0 - \Delta \theta) - \tau_i \right ] \right \}.
$$
Using the sine formulas for difference and simplifying with \(\cos \Delta \simeq 1\) for small angles, we have
$$
\cos (2 \pi \nu \tau) \simeq \cos \left [ 2 \pi \nu \frac{D}{c} \sin \Delta \theta \cos \theta_0 \right].
$$&lt;/p&gt;
&lt;p&gt;Let&amp;rsquo;s stare at this equation a bit more. Assuming we&amp;rsquo;re holding observing frequency fixed, the angular resolution of the fringes is determined by the projected length of the baseline orthogonal to the direction of the source, which is \(D \cos \theta_0\). This is a pretty ordinary physical measurement of a distance, i.e., we would measure it to be something like 50 &lt;em&gt;meters&lt;/em&gt;.&lt;/p&gt;
&lt;p&gt;Of course, observing frequency also makes a difference. If we&amp;rsquo;re observing at higher frequencies (shorter \(\lambda\)), the fringe resolution is going to better (this is just another form of the \(\lambda/D\) resolution relationship for telescopes showing up).&lt;/p&gt;
&lt;p&gt;So, we can define a new variable for this projected baseline length
$$
u = \frac{D \cos \theta_0}{\lambda} = \frac{\nu_0 D \cos \theta_0}{c}.
$$
\(u\) is &lt;em&gt;the number of wavelengths&lt;/em&gt; (at that observing wavelength) that are needed to span the projected baseline length. It is measured in multiples of &amp;ldquo;\(\lambda\),&amp;rdquo; i.e., you might see a baseline length described as \(u = \) 300 kilolambda. \(u\) is called a &lt;em&gt;spatial frequency&lt;/em&gt;.&lt;/p&gt;
&lt;p&gt;Now we will redefine our sky coordinate variable \(l = \sin \Delta \theta\), and we find that we can write the fringe response as
$$
F(l) \propto \cos (2 \pi \nu_0 \tau) \propto \cos (2 \pi u l).
$$&lt;/p&gt;
&lt;p&gt;If \(u\) gets larger (either by moving the antennas further apart and increasing the projected baseline, or by observing at a higher frequency), then the spatial resolution (spacing of the fringes) will get better.&lt;/p&gt;
&lt;p&gt;We see that the quantity \((ul)\) appears inside of a trigonometric function, so this means the quantity must be dimensionless.&lt;/p&gt;
&lt;p&gt;This motivates the many different ways we can think about these variables in the image plane and the visibility domain.&lt;/p&gt;




&lt;h3 id="unitless"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/as5003/lectures/05-interferometry/#unitless"&gt;Unitless&lt;/a&gt;
&lt;/h3&gt;
&lt;p&gt;The first way is to recognize that&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;\(l\) is technically unitless, since it is \(\sin(\Delta \theta)\)&lt;/li&gt;
&lt;li&gt;\(u\) itself is also technically unitless, it&amp;rsquo;s just the baseline length measured in a number of wavelengths (e.g., kilolambda)&lt;/li&gt;
&lt;li&gt;Inside this \(\cos\) term, though, \(u\) plays the role of a &lt;em&gt;frequency&lt;/em&gt;, i.e., &amp;ldquo;cycles per unit \(l\)&amp;rdquo;&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;Both of these variables do correspond to actual distances, the interferometer does have some baseline, which corresponds to its ability to resolve a source of some actual size on the sky. It&amp;rsquo;s just with this way of thinking about it, we measured both of those sizes using dimensionless units (and that&amp;rsquo;s OK)!&lt;/p&gt;




&lt;h3 id="unitful"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/as5003/lectures/05-interferometry/#unitful"&gt;Unitful&lt;/a&gt;
&lt;/h3&gt;
&lt;p&gt;We can put a little bit more sense to this by bringing back the small angle approximation, and saying that because \(\Delta \theta\) is small, then \(l = \sin \Delta \theta \approx \Delta \theta\), and so it is as if we measured \(l\) in some angular unit, like radians or arcseconds.&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;If \(l\) is small, then we would say that it can be measured in radians (which can then be converted to arcseconds)&lt;/li&gt;
&lt;li&gt;Then \(u\) is measured in cycles/radian or cycles/arcsec&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;Because of this small angle approximation for the interferometer geometry, it allows us to equate &amp;ldquo;multiples of lambda&amp;rdquo; to &amp;ldquo;cycles per angle&amp;rdquo; as a spatial frequency.&lt;/p&gt;
&lt;p&gt;Let&amp;rsquo;s walk through an example, and say that we&amp;rsquo;re observing a&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;do small angle approximation to say l in units of radians&lt;/li&gt;
&lt;li&gt;convert to u&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;TODO: redraw waveform. This makes a lot of sense if you just draw the waveform up on the sky.&lt;/p&gt;
&lt;p&gt;Compared to last time, we now assumed some directional sensitivity for each antenna, such as this power pattern&lt;/p&gt;
&lt;figure&gt;&lt;img src="https://iancze.github.io/courses/as5003/lectures/05-interferometry/primary-beam.png"
 alt="Credit: Tools of Radio Astronomy, Fig 7.1"&gt;&lt;figcaption&gt;
 &lt;p&gt;Credit: Tools of Radio Astronomy, Fig 7.1&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;

&lt;figure&gt;&lt;a href="http://www.aoc.nrao.edu/events/synthesis/2022/slides/Fundamentals-2022.pdf"&gt;&lt;img src="https://iancze.github.io/courses/as5003/lectures/05-interferometry/fringe-beam.png"
 alt="The fringe pattern modified by the antenna power pattern. Credit: Rick Perley&amp;rsquo;s slides, NRAO summer synthesis imaging school 2022."&gt;&lt;/a&gt;&lt;figcaption&gt;
 &lt;p&gt;The fringe pattern modified by the antenna power pattern. Credit: Rick Perley&amp;rsquo;s slides, NRAO summer synthesis imaging school 2022.&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;

&lt;p&gt;Recap: so now we&amp;rsquo;ve redefined the fringe function to talk about the response to a spatially resolved source, as a function of projected baseline length. And, we&amp;rsquo;ve introduced the concept of spatial frequency.&lt;/p&gt;




&lt;h3 id="fourier-transform-relationship"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/as5003/lectures/05-interferometry/#fourier-transform-relationship"&gt;Fourier transform relationship&lt;/a&gt;
&lt;/h3&gt;
&lt;p&gt;We just derived and drew the fringe function as the response of the interferometer &lt;em&gt;on the sky&lt;/em&gt; and examined how it changes as we change the position of the antennas on the ground. The essential response \(R(l)\) of the interferometer to some sky distribution \(I(l)\) is to act as a convolution
$$
R(l) = \cos(2 \pi u l) * I(l)
$$&lt;/p&gt;
&lt;p&gt;Another way to look at this is to consider the Fourier pair of the fringe function when the antennas are at a fixed (instantaneous) baseline distance \(u_0\),
$$
\cos(2 \pi u_0 l) \leftrightharpoons \frac{1}{2} [\delta(u + u_0) + \delta(u - u_0)].
$$&lt;/p&gt;
&lt;p&gt;Let us define the &lt;em&gt;visibility function&lt;/em&gt; \(\mathcal{V}(u)\) as the Fourier transform of the sky brightness distribution (the true one)&lt;/p&gt;
&lt;p&gt;$$
I(l) \leftrightharpoons \mathcal{V}(u).
$$
\(\mathcal{V}(u)\) represents the amplitude and phase of the sinusoidal component of the intensity distribution with spatial frequency \(u\) cycles per radian.&lt;/p&gt;
&lt;p&gt;Then, we can bring out our old friend the multiplication/convolution algorithm.
$$
\cos(2 \pi u l) * I(l) \leftrightharpoons \frac{\mathcal{V}(u)}{2} [\delta(u + u_0) + \delta(u - u_0)].
$$&lt;/p&gt;</description></item><item><title>The Fourier Transform II</title><link>https://iancze.github.io/courses/as5003/lectures/03-fourier-transform-ii/</link><pubDate>Wed, 13 Sep 2023 00:00:00 +0000</pubDate><guid>https://iancze.github.io/courses/as5003/lectures/03-fourier-transform-ii/</guid><description>



&lt;h2 id="references-for-today"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/as5003/lectures/03-fourier-transform-ii/#references-for-today"&gt;References for today&lt;/a&gt;
&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;&lt;a href="https://catalog.libraries.psu.edu/catalog/2010095"&gt;The Fourier Transform and its Applications&lt;/a&gt; by R. Bracewell&lt;/li&gt;
&lt;li&gt;&lt;a href="https://catalog.libraries.psu.edu/catalog/20789467"&gt;Interferometry and Synthesis in Radio Astronomy&lt;/a&gt; by Thompson, Moran, and Swenson, particularly Appendix 2.1&lt;/li&gt;
&lt;li&gt;&lt;a href="https://catalog.libraries.psu.edu/catalog/34517505"&gt;Fourier Analysis and Imaging&lt;/a&gt; by R. Bracewell&lt;/li&gt;
&lt;li&gt;&lt;a href="https://en.wikipedia.org/wiki/Nyquist%E2%80%93Shannon_sampling_theorem"&gt;Wikipedia on Nyqist-Shannon sampling theorem&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;




&lt;h2 id="review-of-last-time"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/as5003/lectures/03-fourier-transform-ii/#review-of-last-time"&gt;Review &lt;em&gt;of last time&lt;/em&gt;&lt;/a&gt;
&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;
&lt;p&gt;Defined Fourier transform, inverse
$$
F(s) = \int_{-\infty}^{\infty} f(x) e^{-i 2 \pi x s}\,\mathrm{d}x
$$&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;Note that because \(x\) and \(s\) appear in the argument of the exponential, their product must be dimensionless. This means that they will also have inverse units, e.g., &amp;ldquo;seconds&amp;rdquo; and &amp;ldquo;cycles per second.&amp;rdquo;&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;Introduced convolution, impluse symbol, and theorems&lt;/p&gt;
&lt;/li&gt;
&lt;/ul&gt;




&lt;h2 id="where-are-we-headed-today"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/as5003/lectures/03-fourier-transform-ii/#where-are-we-headed-today"&gt;Where are we headed &lt;em&gt;today&lt;/em&gt;?&lt;/a&gt;
&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;Finish up Fourier transform theorems&lt;/li&gt;
&lt;li&gt;Nyquist sampling theorem&lt;/li&gt;
&lt;li&gt;Discrete Fourier Transform&lt;/li&gt;
&lt;/ul&gt;




&lt;h2 id="continuing-from-last-time-fourier-transform-theorems"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/as5003/lectures/03-fourier-transform-ii/#continuing-from-last-time-fourier-transform-theorems"&gt;Continuing from last time: &lt;em&gt;Fourier transform theorems&lt;/em&gt;&lt;/a&gt;
&lt;/h2&gt;




&lt;h3 id="convolutionmultiplication"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/as5003/lectures/03-fourier-transform-ii/#convolutionmultiplication"&gt;Convolution/multiplication&lt;/a&gt;
&lt;/h3&gt;
&lt;p&gt;The convolution of two functions corresponds to the multiplication of their Fourier transforms.&lt;/p&gt;
&lt;p&gt;If&lt;/p&gt;
&lt;p&gt;$$
f(x) \leftrightharpoons F(s)
$$&lt;/p&gt;
&lt;p&gt;and&lt;/p&gt;
&lt;p&gt;$$
g(x) \leftrightharpoons G(s)
$$&lt;/p&gt;
&lt;p&gt;then&lt;/p&gt;
&lt;p&gt;$$
f(x) * g(x) \leftrightharpoons F(s)G(s).
$$&lt;/p&gt;
&lt;p&gt;This is an &lt;em&gt;extremely&lt;/em&gt; useful theorem. At least in my career, this, and concepts related to sampling, have been the ones I have used the most often. You may have already used this theorem (numerically) if you&amp;rsquo;ve ever carried out a convolution operation using &lt;a href="https://docs.scipy.org/doc/scipy/reference/generated/scipy.signal.fftconvolve.html"&gt;scipy.signal.fftconvolve&lt;/a&gt; in Python, which can be dramatically faster than directly implementing the convolution, at least for certain array sizes.&lt;/p&gt;




&lt;h3 id="rayleighs-theorem-parsevals-theorem-for-fourier-series"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/as5003/lectures/03-fourier-transform-ii/#rayleighs-theorem-parsevals-theorem-for-fourier-series"&gt;Rayleigh&amp;rsquo;s theorem (Parseval&amp;rsquo;s theorem for Fourier Series)&lt;/a&gt;
&lt;/h3&gt;
&lt;p&gt;The amount of energy in a system is the same whether you calculate it in the time domain or in the frequency domain.&lt;/p&gt;
&lt;p&gt;The integral of the mod-squared of a function is equal to the integral of the mod-squared of its spectrum&lt;/p&gt;
&lt;p&gt;$$
\int_{-\infty}^\infty |f(x)|^2\,\mathrm{d}x = \int_{-\infty}^\infty |F(s)|^2\,\mathrm{d}s.
$$&lt;/p&gt;




&lt;h3 id="autocorrelation-theorem"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/as5003/lectures/03-fourier-transform-ii/#autocorrelation-theorem"&gt;Autocorrelation theorem&lt;/a&gt;
&lt;/h3&gt;
&lt;p&gt;The autocorrelation of a function is&lt;/p&gt;
&lt;p&gt;$$
f(x) * f(x) = \int_{-\infty}^\infty f^{*}(u) f(u + x)\,\mathrm{d}u
$$&lt;/p&gt;
&lt;p&gt;and it has the Fourier transform
$$
f(x) * f(x) \leftrightharpoons |F(s)|^2.
$$&lt;/p&gt;
&lt;p&gt;Thus, the power spectrum is the Fourier transform of the autocorrelation function. It can also be computed directly by taking the &amp;ldquo;mod-squared&amp;rdquo; of \(F(s)\),&lt;/p&gt;
&lt;p&gt;$$
|F|^2 = F F^*.
$$&lt;/p&gt;
&lt;p&gt;If you&amp;rsquo;ve ever worked with (stationary) Gaussian processes (e.g., squared-exponential, Matern, etc&amp;hellip;), you might recognize this relationship between the autocorrelation (the kernel function) and the power spectrum of the Gaussian process.&lt;/p&gt;




&lt;h3 id="the-derivative-theorem"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/as5003/lectures/03-fourier-transform-ii/#the-derivative-theorem"&gt;The derivative theorem&lt;/a&gt;
&lt;/h3&gt;
&lt;p&gt;If&lt;/p&gt;
&lt;p&gt;$$
f(x) \leftrightharpoons F(s)
$$&lt;/p&gt;
&lt;p&gt;then&lt;/p&gt;
&lt;p&gt;$$
f^\prime(x) \leftrightharpoons i 2 \pi s F(s).
$$&lt;/p&gt;




&lt;h3 id="using-the-transform-pairs"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/as5003/lectures/03-fourier-transform-ii/#using-the-transform-pairs"&gt;Using the transform pairs&lt;/a&gt;
&lt;/h3&gt;
&lt;p&gt;Now that we have a few basic transform pairs, and some of the transform theorems, you can mix and match these to build up a library of new transform pairs. You will explore this in the problem set.&lt;/p&gt;




&lt;h3 id="definite-integral"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/as5003/lectures/03-fourier-transform-ii/#definite-integral"&gt;Definite integral&lt;/a&gt;
&lt;/h3&gt;
&lt;p&gt;The zero-valued frequency of a Fourier transform is equal to the definite integral of a function over all space&lt;/p&gt;
&lt;p&gt;$$
\int_{-\infty}^\infty f(x)\,\mathrm{d}x = F(0).
$$&lt;/p&gt;
&lt;p&gt;I.e., to compute the area under the curve, you can just read off the zero-frequency value of the Fourier transform.&lt;/p&gt;




&lt;h2 id="smoothness-and-compactness"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/as5003/lectures/03-fourier-transform-ii/#smoothness-and-compactness"&gt;Smoothness and compactness&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;In general, the smoother a function is in the time domain, the more compact its Fourier transform will be in the frequency domain.&lt;/p&gt;
&lt;p&gt;Smoother functions will have a larger number of continuous derivatives. Something like the Gaussian envelope \(\exp(-\pi x^2)\) is &amp;ldquo;as smooth as possible&amp;rdquo; and therefore its Fourier transform (also a Gaussian envelope) is as compact as possible.&lt;/p&gt;




&lt;h2 id="filters-and-transfer-functions"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/as5003/lectures/03-fourier-transform-ii/#filters-and-transfer-functions"&gt;Filters and transfer functions&lt;/a&gt;
&lt;/h2&gt;




&lt;h3 id="time-domain"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/as5003/lectures/03-fourier-transform-ii/#time-domain"&gt;Time domain&lt;/a&gt;
&lt;/h3&gt;
&lt;p&gt;We can say that we have some (electrical) waveform
$$
V_1(t) = A \cos (2 \pi f t)
$$ which is a single-valued function of time. You can think of this as a voltage time-series or another physical quantity. By definition, the waveform is real.&lt;/p&gt;
&lt;p&gt;Let&amp;rsquo;s put on our electrical/acoustical/mechanical engineering hats for a moment and consider that a &lt;em&gt;filter&lt;/em&gt; is a physical system with an input an and output, e.g., something that is transmitting vibrations or oscillations, like our waveform.&lt;/p&gt;
&lt;figure&gt;&lt;img src="https://iancze.github.io/courses/as5003/lectures/03-fourier-transform-ii/filter.png"
 alt="How a filter changes the amplitude and phase of an waveform. Credit: Ian Czekala"&gt;&lt;figcaption&gt;
 &lt;p&gt;How a filter changes the amplitude and phase of an waveform. Credit: Ian Czekala&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;

&lt;p&gt;If we feed our waveform into a &lt;em&gt;linear&lt;/em&gt; filter, we get output
$$
V_2(t) = B \cos (2 \pi f t + \phi).
$$&lt;/p&gt;
&lt;p&gt;The output is still a waveform, but its amplitude and its phase have changed. These changes are likely frequency dependent, too.&lt;/p&gt;
&lt;p&gt;We can specify the filter by a frequency-dependent quantity \(T(f)\) called the &lt;em&gt;transfer function&lt;/em&gt;. It is a complex-valued function (having both an amplitude and a phase) and is given by
$$
T(f) = \frac{B}{A}e^{i \phi}.
$$&lt;/p&gt;
&lt;p&gt;Interesting, perhaps, but maybe not immediately obviously useful. Let&amp;rsquo;s introduce the &lt;em&gt;spectrum&lt;/em&gt; and then circle back to the transfer function.&lt;/p&gt;




&lt;h3 id="obtaining-v_2-using-the-frequency-domain"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/as5003/lectures/03-fourier-transform-ii/#obtaining-v_2-using-the-frequency-domain"&gt;Obtaining \(V_2\) using the frequency domain&lt;/a&gt;
&lt;/h3&gt;
&lt;p&gt;The &lt;em&gt;spectrum&lt;/em&gt; of the waveform is the Fourier transform of \(V(t)\), which we&amp;rsquo;ll call \(S(f)\) in this section. We&amp;rsquo;ve broken slightly from our \(f \leftrightharpoons F\) notation, but \(V \leftrightharpoons S\) is a classic in the signal processing and electrical engineering fields, so we&amp;rsquo;ll at least build familiarity with it in this example.&lt;/p&gt;
&lt;p&gt;The &amp;ldquo;spectrum&amp;rdquo; here is just the Fourier transform quantity, it can definitely be complex-valued.&lt;/p&gt;
&lt;p&gt;The &amp;ldquo;spectra&amp;rdquo; that we typically talk about in astrophysics are measurements of the electromagnetic spectrum&amp;mdash;you&amp;rsquo;ve probably never come across as one that&amp;rsquo;s complex-valued, right? What&amp;rsquo;s going on here?&lt;/p&gt;
&lt;p&gt;Consider the &lt;em&gt;units&lt;/em&gt; of a flux measurement \(F_\nu\) of the electromagnetic spectrum. In lecture 1, we covered that the cgs units of flux are
$$
\mathrm{ergs}\;\mathrm{s}^{-1}\;\mathrm{cm}^{-2}\;\mathrm{Hz}^{-1}.
$$&lt;/p&gt;
&lt;p&gt;The clue is in the \(\mathrm{ergs}\;\mathrm{s}^{-1}\) part, which we could also write in terms of &amp;ldquo;watts&amp;rdquo; if we wanted to be strictly S.I. about it. When we are measuring the electromagnetic spectrum, we are actually measuring the &lt;strong&gt;power&lt;/strong&gt; spectral density, \(|F(\nu)|^2\). The absolute squared means the quantity \(|F(\nu)|^2\) is real-valued, and is the reason why you never hear about measurements of the electromagnetic spectrum containing imaginary values!&lt;/p&gt;
&lt;p&gt;In this course, at least, we&amp;rsquo;ll try to be explicit about which spectrum we&amp;rsquo;re referring to. When we actually mean power spectrum, we&amp;rsquo;ll try to call it as such. Otherwise, &amp;ldquo;spectrum&amp;rdquo; will refer to a quantity like \(S\).&lt;/p&gt;
&lt;p&gt;Now, let&amp;rsquo;s revisit our filter example, where we had input and output \(V_1(t)\) and \(V_2(t)\) respectively. From our discussion, we also have&lt;/p&gt;
&lt;p&gt;$$
V_1 \leftrightharpoons S_1
$$
and
$$
V_2 \leftrightharpoons S_2.
$$&lt;/p&gt;
&lt;p&gt;The transfer function concept is &lt;em&gt;especially&lt;/em&gt; useful when we think about the &lt;em&gt;spectrum&lt;/em&gt; of the waveforms, because we have
$$
S_2(f) = T(f) S_1(f).
$$&lt;/p&gt;
&lt;p&gt;I.e., the spectrum of the output waveform is simply the spectrum of the input waveform &lt;em&gt;multiplied&lt;/em&gt; by the transfer function.&lt;/p&gt;
&lt;p&gt;Once you have \(S_2\), then you can get \(V_2(t)\) from
$$
V_2 \leftrightharpoons S_2.
$$&lt;/p&gt;
&lt;p&gt;Two examples are low-pass and high-pass filters.&lt;/p&gt;
&lt;figure&gt;&lt;a href="https://en.wikipedia.org/wiki/Filter_%28signal_processing"&gt;&lt;img src="https://iancze.github.io/courses/as5003/lectures/03-fourier-transform-ii/filter-pass.png"
 alt="Examples of different filter transfer functions \(T(f)\). Credit: Wikipedia/SpinningSpark"&gt;&lt;/a&gt;&lt;figcaption&gt;
 &lt;p&gt;Examples of different filter transfer functions \(T(f)\). Credit: Wikipedia/SpinningSpark&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;





&lt;h3 id="obtaining-v_2-using-the-time-domain"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/as5003/lectures/03-fourier-transform-ii/#obtaining-v_2-using-the-time-domain"&gt;Obtaining \(V_2\) using the time domain&lt;/a&gt;
&lt;/h3&gt;
&lt;p&gt;Now let&amp;rsquo;s think of digital signal processing, where you wanted to practically apply a filter to some waveform to produce a new waveform. As we just outlined, you could acquire \(V_1(t)\), Fourier transform it to access its spectrum \(S_1(f)\), multiply by the transfer function \(T_(f)\), and then do the inverse Fourier transform to get \(V_2(t)\). Is there a way to do this &lt;em&gt;directly&lt;/em&gt; in the time domain? What if you don&amp;rsquo;t have the complete waveform all at once?&lt;/p&gt;
&lt;p&gt;The answer is provided by the convolution theorem for Fourier transforms. Since the transfer function is applied via a multiplication in the Fourier domain, we could equivalently carry out the same operation by a convolution in the time domain.&lt;/p&gt;
&lt;p&gt;The convolutional kernel would be
$$
I(t) \leftrightharpoons T(f)
$$
and we&amp;rsquo;d have
$$
V_2(t) = I(t) * V_1(t).
$$&lt;/p&gt;
&lt;p&gt;To summarize,
&lt;figure&gt;&lt;img src="https://iancze.github.io/courses/as5003/lectures/03-fourier-transform-ii/square.png"
 alt="Credit Bracewell, Chapter 9."&gt;&lt;figcaption&gt;
 &lt;p&gt;Credit Bracewell, Chapter 9.&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;
&lt;/p&gt;




&lt;h3 id="determining-it"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/as5003/lectures/03-fourier-transform-ii/#determining-it"&gt;Determining \(I(t)\)&lt;/a&gt;
&lt;/h3&gt;
&lt;p&gt;If you had some system already in place, and you wanted to determine \(I(t)\) experimentally, what is one way you could do it? What waveform could you send the system?&lt;/p&gt;
&lt;p&gt;One simple option would be to send
$$
V_1(t) = \delta(t),
$$
then
$$
V_2(t) = I(t) * \delta(t) = I(t).
$$&lt;/p&gt;




&lt;h2 id="nyquist-shannon-sampling"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/as5003/lectures/03-fourier-transform-ii/#nyquist-shannon-sampling"&gt;Nyquist-Shannon sampling&lt;/a&gt;
&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;&lt;a href="https://www.youtube.com/watch?v=FcXZ28BX-xE&amp;amp;ab_channel=SteveBrunton"&gt;Youtube/SteveBrunton&lt;/a&gt; on The Sampling Theorem&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;Thus far we have been talking about continuous functions. As astrophysicists, though, we&amp;rsquo;re frequently dealing with discrete data points, which are presumed to be &lt;em&gt;samples&lt;/em&gt; of some unknown function. Maybe you&amp;rsquo;re the one designing the experiment to capture these data points, or maybe you&amp;rsquo;ve just been handed some dataset.&lt;/p&gt;
&lt;figure&gt;&lt;img src="https://iancze.github.io/courses/as5003/lectures/03-fourier-transform-ii/lines.png"
 alt="Say you are given a set of (noisless) samples that look like this. What do you think the function should look like in between the points? Credit: Ian Czekala"&gt;&lt;figcaption&gt;
 &lt;p&gt;Say you are given a set of (noisless) samples that look like this. What do you think the function should look like in between the points? Credit: Ian Czekala&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;

&lt;p&gt;Concisely put, the &lt;em&gt;sampling theorem&lt;/em&gt; states that under a certain condition, a function can be &lt;em&gt;completely&lt;/em&gt; reconstructed from a set of discrete samples&amp;mdash;without information loss. I.e., the set of discrete samples is &lt;em&gt;fully equivalent&lt;/em&gt; to having access to the full set of function values. Today, this sampling theorem is known as the Nyquist-Shannon sampling theorem, the Whittaker–Nyquist–Shannon theorem, or simply &amp;ldquo;the sampling theorem.&amp;rdquo;&lt;/p&gt;
&lt;p&gt;If we were to try to use these data points to actually reconstruct a function, then what sort of constraint would we need to impose on the function? We&amp;rsquo;d want to place some constraint on its spectrum, i.e., that there are no higher frequency components oscillating around faster than our sampling points.&lt;/p&gt;
&lt;p&gt;Before we dive into the derivation of the sampling theorem, let&amp;rsquo;s first take another look at what can go wrong when you undersample a time series.&lt;/p&gt;




&lt;h3 id="aliasing"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/as5003/lectures/03-fourier-transform-ii/#aliasing"&gt;Aliasing&lt;/a&gt;
&lt;/h3&gt;
&lt;figure&gt;&lt;img src="https://iancze.github.io/courses/as5003/lectures/03-fourier-transform-ii/sine-wave.png"
 alt="If the true signal is given by the solid line, but we undersample it, then the sine-wave we naively reconstruct from the samples would have the wrong frequency. Here we would say that the higher frequency signal has been aliased into the lower frequency range. Credit: Wikipedia Pluke"&gt;&lt;figcaption&gt;
 &lt;p&gt;If the true signal is given by the solid line, but we &lt;em&gt;undersample&lt;/em&gt; it, then the sine-wave we naively reconstruct from the samples would have the wrong frequency. Here we would say that the higher frequency signal has been aliased into the lower frequency range. Credit: Wikipedia Pluke&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;

&lt;p&gt;This is also called the Stroboscopic effect. There are some nice examples online:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;&lt;a href="https://www.youtube.com/watch?v=dNVtMmLlnoE&amp;amp;ab_channel=SmarterEveryDay"&gt;Youtube/SmarterEveryDay&lt;/a&gt;.&lt;/li&gt;
&lt;li&gt;&lt;a href="https://www.youtube.com/watch?v=smDpCsVVgPA&amp;amp;ab_channel=Edyourself"&gt;Stationary helicopter&lt;/a&gt;.&lt;/li&gt;
&lt;li&gt;Exoplanet transits! &lt;a href="https://ui.adsabs.harvard.edu/abs/2010ApJ...722..937D/abstract"&gt;Dawson and Fabrycky 2010&lt;/a&gt; find a shorter period for the exoplanet 55 Cnc e, previously confounded by the timing of the RV observations.&lt;/li&gt;
&lt;/ul&gt;




&lt;h3 id="derivation-of-sampling-theorem"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/as5003/lectures/03-fourier-transform-ii/#derivation-of-sampling-theorem"&gt;Derivation of sampling theorem&lt;/a&gt;
&lt;/h3&gt;
&lt;p&gt;Now, let&amp;rsquo;s use our understanding of Fourier transforms and the sampling/replicating function to develop a precise formulation of the sampling theorem.&lt;/p&gt;
&lt;p&gt;Recall that the &amp;ldquo;shah&amp;rdquo; function is an infinite series of delta functions spaced a unit dimension apart, and that it is its own Fourier transform
$$
\mathrm{shah}(x) \leftrightharpoons \mathrm{shah}(s).
$$&lt;/p&gt;
&lt;p&gt;Via the similarity theorem, if the delta functions of the shah get closer in the \(x\) domain, then they spread out in the Fourier domain, and vice-versa.&lt;/p&gt;
&lt;p&gt;We can adjust the spacing of the samples by dilating or shrinking the shah function by some factor. Here, we&amp;rsquo;ll write this as the sampling interval \(\Delta x = \tau\) or the sampling frequency \(1/\tau\).&lt;/p&gt;
&lt;p&gt;According to the similarity theorem, adjusting the sampling frequency in the \(x\) domain has the following effect in the frequency domain
$$
\mathrm{shah}(x/\tau) \leftrightharpoons \tau \mathrm{shah}(\tau s).
$$&lt;/p&gt;
&lt;p&gt;For example, if \(\tau = 0.2\), then we have&lt;/p&gt;
&lt;figure&gt;&lt;img src="https://iancze.github.io/courses/as5003/lectures/03-fourier-transform-ii/panel-b.png"
 alt="The shah function is its own Fourier transform. Via the similarity theorem, if we compress the shah function in the time domain (left), we expand it in the Fourier domain (right). Credit: Bracewell, Fig 10.3"&gt;&lt;figcaption&gt;
 &lt;p&gt;The shah function is its own Fourier transform. Via the similarity theorem, if we compress the shah function in the time domain (left), we expand it in the Fourier domain (right). Credit: Bracewell, Fig 10.3&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;

&lt;p&gt;Now let&amp;rsquo;s consider a function and its Fourier transform&lt;/p&gt;
&lt;figure&gt;&lt;img src="https://iancze.github.io/courses/as5003/lectures/03-fourier-transform-ii/function.png"
 alt="A generic function (left) and its Fourier transform (right). We say that this function is &amp;lsquo;band-limited&amp;rsquo; because its Fourier transform is 0 for all frequencies above some cutoff frequency \(|s| &amp;gt; s_c\). Credit: Bracewell Fig 10.2"&gt;&lt;figcaption&gt;
 &lt;p&gt;A generic function (left) and its Fourier transform (right). We say that this function is &amp;lsquo;band-limited&amp;rsquo; because its Fourier transform is 0 for all frequencies above some &lt;em&gt;cutoff frequency&lt;/em&gt; \(|s| &amp;gt; s_c\). Credit: Bracewell Fig 10.2&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;

&lt;p&gt;As before, we will use multiplication by the shah function to represent sampling of the function.&lt;/p&gt;
&lt;figure&gt;&lt;img src="https://iancze.github.io/courses/as5003/lectures/03-fourier-transform-ii/sampled-function.png"
 alt="Left: the sampled version of \(f\), which has the Fourier transform on the right. So long as the sampling frequency exceeds twice the cutoff frequency, the Fourier transform &amp;lsquo;islands&amp;rsquo; do not overlap (top two rows). Credit: Bracewell Fig 10.3"&gt;&lt;figcaption&gt;
 &lt;p&gt;Left: the sampled version of \(f\), which has the Fourier transform on the right. So long as the sampling frequency exceeds twice the cutoff frequency, the Fourier transform &amp;lsquo;islands&amp;rsquo; do not overlap (top two rows). Credit: Bracewell Fig 10.3&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;

&lt;p&gt;The non-overlappingness of the &amp;lsquo;islands&amp;rsquo; is the key to properly sampling a function, and we&amp;rsquo;ll see why in a moment when we talk about reconstruction. But first, let&amp;rsquo;s make a quantitative statement of the sampling theorem (Bracewell):&lt;/p&gt;
&lt;p&gt;If \(s_c\) is the cutoff frequency defining the band-limited nature of the signal, then so long as the function is sampled at equal intervals not exceeding \(\Delta x = 1/(2 s_c)\) then the function is properly sampled, i.e.
$$
\frac{1}{\tau} \geq 2 s_c.
$$&lt;/p&gt;




&lt;h3 id="restoration-of-signal-kernels"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/as5003/lectures/03-fourier-transform-ii/#restoration-of-signal-kernels"&gt;Restoration of signal kernels&lt;/a&gt;
&lt;/h3&gt;
&lt;p&gt;Now let&amp;rsquo;s talk about how we would actually reconstruct the continuous function from a set of samples. Let&amp;rsquo;s re-examine our plot of the Fourier domain&lt;/p&gt;
&lt;figure&gt;&lt;img src="https://iancze.github.io/courses/as5003/lectures/03-fourier-transform-ii/reconstruction.png"
 alt="Credit: Bracewell Fig 10.3"&gt;&lt;figcaption&gt;
 &lt;p&gt;Credit: Bracewell Fig 10.3&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;

&lt;p&gt;The &amp;ldquo;function&amp;rdquo; on the left is technically not the same (continuous) function that we started with, it is a discrete representation of it. We did just say, though, that if the function was band-limited, then these samples contained all of the same information as if we had access to the full function. So how do we go from these samples back to the full function?&lt;/p&gt;
&lt;p&gt;Let&amp;rsquo;s look at the Fourier side of this plot and compare it to the original Fourier side. The main difference is that &lt;em&gt;this&lt;/em&gt; Fourier plot has repeating &amp;lsquo;islands&amp;rsquo; at progressively higher frequencies, essentially to infinity. How can we get rid of these higher frequency islands?&lt;/p&gt;
&lt;p&gt;The answer is to multiply by a boxcar function in Fourier domain, completely truncating these higher order terms. Then, we can do the inverse Fourier transform and recover the original, continuous function.&lt;/p&gt;
&lt;p&gt;What is the analogous operation for the time-domain? This is the same thing as we discussed with the transfer function. Since it was a multiplication in the Fourier domain, it is a convolution in the time domain. And the convolutional kernel is the Fourier transform of the boxcar, which is a sinc function.&lt;/p&gt;
&lt;p&gt;So, to &lt;em&gt;exactly&lt;/em&gt; reconstruct a band-limited function from a set of samples, we do sinc-interpolation.&lt;/p&gt;
&lt;figure&gt;&lt;a href="https://www.dsprelated.com/freebooks/pasp/Windowed_Sinc_Interpolation.html"&gt;&lt;img src="https://iancze.github.io/courses/as5003/lectures/03-fourier-transform-ii/sinc-interpolation.png"
 alt="Credit: DSP related."&gt;&lt;/a&gt;&lt;figcaption&gt;
 &lt;p&gt;Credit: DSP related.&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;





&lt;h3 id="undersampling-and-aliasing"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/as5003/lectures/03-fourier-transform-ii/#undersampling-and-aliasing"&gt;Undersampling and aliasing&lt;/a&gt;
&lt;/h3&gt;
&lt;p&gt;If we didn&amp;rsquo;t sample the function at a sufficiently high rate, then we would have overlapping islands. Essentially, the higher frequency components of the Fourier islands are &amp;ldquo;folded-over&amp;rdquo; back into the range of frequencies &lt;em&gt;we thought&lt;/em&gt; was band-limited, resulting in a corrupted signal.&lt;/p&gt;
&lt;p&gt;In an alias, a higher frequency signal is masquerading as a lower-frequency signal.&lt;/p&gt;
&lt;figure&gt;&lt;img src="https://iancze.github.io/courses/as5003/lectures/03-fourier-transform-ii/aliased.png"
 alt="Credit: Bracewell Fig 10.3"&gt;&lt;figcaption&gt;
 &lt;p&gt;Credit: Bracewell Fig 10.3&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;





&lt;h3 id="compressed-sensing"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/as5003/lectures/03-fourier-transform-ii/#compressed-sensing"&gt;Compressed sensing&lt;/a&gt;
&lt;/h3&gt;
&lt;p&gt;You may have heard of &amp;ldquo;compressed sensing,&amp;rdquo; which is one of major signal processing results of the last few decades. The idea is that you can reconstruct a functional form using far fewer samples than required for the Nyquist rate, using some dictionary of functional forms, or knowledege that the signal may be sparse. You can, indeed, perfectly reconstruct the signal through optimization using the \(L_1\) norm. If you don&amp;rsquo;t want to make the assumption that your signal is sparse, though, it&amp;rsquo;s a good idea to sample at the Nyquist rate.&lt;/p&gt;




&lt;h2 id="extra-fourier-series"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/as5003/lectures/03-fourier-transform-ii/#extra-fourier-series"&gt;Extra: Fourier series&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;You probably first encountered Fourier series as part of your calculus course and later on as part of a partial differential equations course.
Say we have some periodic function \(g(x)\), then the Fourier series associated with this is
$$
a_0 + \sum_1^\infty (a_n \cos 2 \pi n f x + b_n \sin 2 \pi n f x)
$$
where the Fourier coefficients are determined by
$$
a_0 = \frac{1}{T} \int_{-T/2}^{T/2} g(x) \,\mathrm{d}x
$$&lt;/p&gt;
&lt;p&gt;$$
a_n = \frac{1}{T} \int_{-T/2}^{T/2} g(x) \cos 2 \pi n f x \,\mathrm{d}x
$$&lt;/p&gt;
&lt;p&gt;$$
b_n = \frac{1}{T} \int_{-T/2}^{T/2} g(x) \sin 2 \pi n f x \,\mathrm{d}x
$$&lt;/p&gt;
&lt;p&gt;i.e., we&amp;rsquo;ve projected the function onto its basis set of sines and cosines.&lt;/p&gt;
&lt;p&gt;Already, I&amp;rsquo;m sure you are starting to see the close connection with what we&amp;rsquo;ve discussed of the Fourier transform. Traditionally, Fourier series are used as a jumping off point for the discussion of the Fourier transform.&lt;/p&gt;
&lt;p&gt;In the last lecture, however, we signaled our intention to take the opposite approach, whereby we skipped over Fourier series and &lt;em&gt;started&lt;/em&gt; with the idea that Fourier transforms exist because we observe physical systems which exhibit their behavior. Now, let&amp;rsquo;s unify the discussion and demonstrate the Fourier series as an extreme situation of the Fourier transform.&lt;/p&gt;




&lt;h3 id="fourier-transform-of-sine-and-cosine"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/as5003/lectures/03-fourier-transform-ii/#fourier-transform-of-sine-and-cosine"&gt;Fourier transform of sine and cosine&lt;/a&gt;
&lt;/h3&gt;
&lt;p&gt;Let&amp;rsquo;s put together a number of the theorems that we&amp;rsquo;ve discussed to build up our understanding of what a Fourier series looks like in the Fourier domain.&lt;/p&gt;
&lt;p&gt;In the previous lecture we introduced the Fourier transform theorem for a delta function located at the origin&lt;/p&gt;
&lt;p&gt;$$
F(s) = \int_{-\infty}^{\infty} \delta(0) \exp (-i 2 \pi x s)\,\mathrm{d}x = 1
$$
which is a constant.&lt;/p&gt;
&lt;p&gt;We also introduced the shift theorem, which says
$$
f(x - a) \leftrightharpoons \exp(- 2 \pi i a s) F(s).
$$&lt;/p&gt;
&lt;p&gt;We can couple these together and write a relationship
$$
\delta(x - a) \leftrightharpoons \exp(-2 \pi i a s).
$$&lt;/p&gt;
&lt;p&gt;Now, let&amp;rsquo;s see if we can use this Fourier pair to derive the Fourier pairs for cosine and sine. In your quantum physics or partial differential equations classes, you probably used the Euler identity to write cosine and sine like
$$
\cos 2 \pi a s = \frac{e^{i 2 \pi a s} + e^{-i 2 \pi a s}}{2}
$$
and
$$
\sin 2 \pi a s = \frac{e^{i 2 \pi a s} - e^{-i 2 \pi a s}}{2i}
$$&lt;/p&gt;
&lt;p&gt;So we can do a bit of rearranging and arrive at&lt;/p&gt;
&lt;p&gt;$$
\cos \pi x \leftrightharpoons \mathrm{even}(x) = \frac{1}{2}\delta \left (x + \frac{1}{2} \right) + \frac{1}{2}\delta \left (x - \frac{1}{2} \right)
$$
and
$$
\sin \pi x \leftrightharpoons i\mathrm{odd}(x) = i \frac{1}{2}\delta \left (x + \frac{1}{2}\right) - i \frac{1}{2}\delta \left (x - \frac{1}{2}\right).
$$&lt;/p&gt;
&lt;p&gt;where these symbols are the even and odd impulse pairs.&lt;/p&gt;
&lt;figure&gt;&lt;img src="https://iancze.github.io/courses/as5003/lectures/03-fourier-transform-ii/even-odd-impulse.png"
 alt="The Fourier transform pairs of cosine and sine as the even and odd impulse pairs, respectively. Credit: Bracewell Fig 6.1"&gt;&lt;figcaption&gt;
 &lt;p&gt;The Fourier transform pairs of cosine and sine as the even and odd impulse pairs, respectively. Credit: Bracewell Fig 6.1&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;

&lt;p&gt;Now we&amp;rsquo;re well on our way to determining the Fourier spectrum of a Fourier series. The Fourier series is just a sum of sines and cosines at different frequencies. The Fourier transform is a linear operator, so we can just add together the contributions from each component in the Fourier domain
$$
a_0 + \sum_1^\infty (a_n \cos 2 \pi n f x + b_n \sin 2 \pi n f x)
$$&lt;/p&gt;
&lt;p&gt;We arrive at the result that the spectrum of a Fourier series is a collection of delta functions whose locations and amplitudes correspond to the frequencies and values of the Fourier coefficients, respectively.&lt;/p&gt;
&lt;figure&gt;&lt;a href="https://www.tutorialspoint.com/what-is-fourier-spectrum-theory-and-example#:~:text=The%20graph%20plotted%20between%20the,spectrum%20of%20a%20periodic%20signal.&amp;amp;text=Amplitude%20Spectrum%20%E2%88%92%20The%20amplitude%20spectrum,of%20Fourier%20coefficients%20versus%20frequency"&gt;&lt;img src="https://iancze.github.io/courses/as5003/lectures/03-fourier-transform-ii/line-spectra.png"
 alt="The amplitude of the line spectra corresponding to a Fourier series. Credit: TutorialsPoint"&gt;&lt;/a&gt;&lt;figcaption&gt;
 &lt;p&gt;The amplitude of the line spectra corresponding to a Fourier series. Credit: TutorialsPoint&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;

&lt;p&gt;Now that we&amp;rsquo;ve derived the Fourier spectrum of a Fourier series, we can see at least two reasons why this represents an extreme situation of the Fourier transform:&lt;/p&gt;
&lt;ol&gt;
&lt;li&gt;The input waveform is strictly periodic&lt;/li&gt;
&lt;li&gt;The input waveform is infinite in duration&lt;/li&gt;
&lt;/ol&gt;
&lt;p&gt;As we talked about in the last lecture, these conditions are violated in the real world. If we limit the duration of the sine wave to a finite duration (say, by multiplication of a truncated Gaussian (because technically the Gaussian is also non-zero over an infinite domain), which we call a &lt;em&gt;window function&lt;/em&gt;), then we see what happens in the Fourier domain: the delta functions are broadened by convolution with the Fourier transform of the window function.&lt;/p&gt;
&lt;figure&gt;&lt;img src="https://iancze.github.io/courses/as5003/lectures/03-fourier-transform-ii/broadening-line.png"
 alt="Left: the dotted line represents a broad window function to eventually make the waveform finite in duration. Right: this has the effect of broadening the delta functions by convolution with the Fourier transform of the window function. Credit: Bracewell Fig 10.12"&gt;&lt;figcaption&gt;
 &lt;p&gt;Left: the dotted line represents a broad window function to eventually make the waveform finite in duration. Right: this has the effect of broadening the delta functions by convolution with the Fourier transform of the window function. Credit: Bracewell Fig 10.12&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;





&lt;h2 id="the-discrete-fourier-transform-dft"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/as5003/lectures/03-fourier-transform-ii/#the-discrete-fourier-transform-dft"&gt;The Discrete Fourier transform (DFT)&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;Now we&amp;rsquo;ll talk about how we deal with samples of data. We&amp;rsquo;ll stick with the same example that we&amp;rsquo;re dealing with a function of time. But rather than \(t\), having units of seconds, we&amp;rsquo;ll simply label each data point by an index \(m\) which takes on non-negative, integer values like \(m = 0, 1, \ldots, N\).&lt;/p&gt;
&lt;figure&gt;&lt;img src="https://iancze.github.io/courses/as5003/lectures/03-fourier-transform-ii/DFT-samples.png"
 alt="Credit: Bracewell Fig 11.2"&gt;&lt;figcaption&gt;
 &lt;p&gt;Credit: Bracewell Fig 11.2&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;

&lt;p&gt;The forward discrete Fourier transform (DFT) is
$$
F_k = \sum_{m=0}^{N-1} f_m \exp \left ( - 2 \pi i \frac{m k}{N} \right)
$$
and we could compute \(F_k\) for \(k = 0, 1, \ldots, N-1\).&lt;/p&gt;
&lt;p&gt;Here, the discrete index variable \(k\) has replaced the continuous-frequency variable \(s\), just like \(m\) replaced the continuous-time variable \(t\).&lt;/p&gt;
&lt;p&gt;The inverse discrete Fourier transform is
$$
f_m = \frac{1}{N} \sum_{k=0}^{N -1} F_k \exp \left ( 2 \pi i \frac{m k}{N}\right).
$$
Like the continuous-Fourier transform, one of the differences from the forward is the \(+i\) in the exponential. The other is the inclusion of the normalization pre-factor.&lt;/p&gt;
&lt;p&gt;Note: depending on whom you talk to, you&amp;rsquo;ll see a wide variety of conventions as to where the normalization prefactor goes and where the \(2 \pi\) lives. The convention presented here is the same one used by the &lt;a href="https://numpy.org/doc/stable/reference/routines.fft.html#module-numpy.fft"&gt;Python/NumPy package&lt;/a&gt; and the &lt;a href="https://juliamath.github.io/AbstractFFTs.jl/stable/api/#Public-Interface"&gt;Julia/AbstractFFTs.jl&lt;/a&gt; package, so it &lt;em&gt;should&lt;/em&gt; be the one you encounter most frequently.&lt;/p&gt;
&lt;p&gt;Like the continuous Fourier transform, if you take the DFT of a set of samples and then take the iDFT of that, you will end up with the original set of samples.&lt;/p&gt;




&lt;h3 id="units-of-the-dft"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/as5003/lectures/03-fourier-transform-ii/#units-of-the-dft"&gt;Units of the DFT&lt;/a&gt;
&lt;/h3&gt;
&lt;p&gt;The DFT only knows/assumes that it was fed a set of equally spaced samples
$$
f_m = f(x_m)
$$
where
$$
m = 0, 1, \ldots, N - 1.
$$&lt;/p&gt;
&lt;p&gt;So, at its most abstract, the DFT takes in a bunch of \(N\) samples spaced \(\Delta x\) apart and returns \(N\) samples corresponding to the Fourier components. The frequency of each component corresponds is given by \(k/N\) in units of &amp;ldquo;cycles per sampling interval.&amp;rdquo;&lt;/p&gt;
&lt;p&gt;I.e., so if we had \(N = 8\) samples, then the \(k=3\) frequency component returned from the DFT would be equal to \(3/8\) cycles per &amp;ldquo;the interval between samples.&amp;rdquo;&lt;/p&gt;
&lt;p&gt;On its own, the DFT doesn&amp;rsquo;t provide any information about what type of variable \(x\) is or what the spacing is. But there is hope. We can make this concrete, we just have to be careful. Using our example of a time series with \(N=8\) samples \(\{f_m\}\), say we know that \(\Delta x = 0.1\) seconds,
$$
\Delta x = x_{m+1} -x_m
$$&lt;/p&gt;
&lt;p&gt;The spacing in the frequency domain will be 1/8 cycles per 0.1 seconds, or 1.25 Hz.&lt;/p&gt;




&lt;h3 id="dft-as-a-matrix-operation"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/as5003/lectures/03-fourier-transform-ii/#dft-as-a-matrix-operation"&gt;DFT as a matrix operation&lt;/a&gt;
&lt;/h3&gt;
&lt;p&gt;Thus far we have just been talking about a &amp;ldquo;set&amp;rdquo; of samples. We can also think of the collection of data as a vector
$$
\mathbf{f} =
\begin{bmatrix}
f_0 \\
f_1 \\
\vdots \\
f_{N-1} \\
\end{bmatrix}
$$&lt;/p&gt;
&lt;p&gt;and the frequency samples as a vector too&lt;/p&gt;
&lt;p&gt;$$
\mathbf{F} =
\begin{bmatrix}
F_0 \\
F_1 \\
\vdots \\
F_{N-1} \\
\end{bmatrix}.
$$&lt;/p&gt;
&lt;p&gt;We&amp;rsquo;ve now mentioned that the Fourier transform is a linear operator a few times. Another way of demonstrating this is that we could write the DFT as a matrix multiplication
$$
\mathbf{F} = \mathbf{W} \mathbf{f}.
$$&lt;/p&gt;
&lt;p&gt;If we look back at the definition of the DFT
$$
F_k = \sum_{m=0}^{N-1} f_m \exp \left ( - 2 \pi i \frac{m k}{N} \right)
$$
hopefully we can see how this might be cast in matrix form. If we define the quantity
$$
\omega = e^{- 2 \pi i / N}
$$
then we can write
$$
W = \begin{bmatrix}
1 &amp;amp; 1 &amp;amp; 1 &amp;amp; \ldots &amp;amp; 1 \\
1 &amp;amp; \omega &amp;amp; \omega^2 &amp;amp; \ldots &amp;amp; \omega^{N - 1} \\
1 &amp;amp; \omega^2 &amp;amp; \omega^4 &amp;amp; \ldots &amp;amp; \omega^{2(N-1)} \\
\vdots &amp;amp; \vdots &amp;amp; \vdots &amp;amp; \ddots &amp;amp; \vdots \\
1 &amp;amp; \omega^{N-1} &amp;amp; \omega^{2(N-1)} &amp;amp; \ldots &amp;amp; \omega^{(N-1)(N-1)}\\
\end{bmatrix}
$$&lt;/p&gt;
&lt;p&gt;This formulation can be really useful if you&amp;rsquo;re into forward modeling using linear models. Because we can write the DFT as a matrix multiplication, it can essentially be just another linear transformation to your linear model. Determining the &amp;ldquo;best-fit&amp;rdquo; parameters can still be done analytically. We&amp;rsquo;ll talk about this a bit more when we get to the lecture on Bayesian inference.&lt;/p&gt;
&lt;figure&gt;&lt;img src="https://iancze.github.io/courses/as5003/lectures/03-fourier-transform-ii/DFT-graphical.png"
 alt="The elements of the DFT matrix represented as samples of complex exponentials. Credit: Wikipedia/Glogger"&gt;&lt;figcaption&gt;
 &lt;p&gt;The elements of the DFT matrix represented as samples of complex exponentials. Credit: Wikipedia/Glogger&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;





&lt;h2 id="fast-fourier-transform-fft"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/as5003/lectures/03-fourier-transform-ii/#fast-fourier-transform-fft"&gt;Fast Fourier Transform (FFT)&lt;/a&gt;
&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;&lt;a href="https://www.youtube.com/watch?v=E8HeD-MUrjY&amp;amp;ab_channel=SteveBrunton"&gt;Youtube/SteveBrunton&lt;/a&gt; on the Fast Fourier Transform&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;The Fast Fourier Transform or FFT is an algorithm (class of algorithms) for computing the discrete Fourier transform. Practically speaking, if you&amp;rsquo;re going to perform a Fourier transform on discrete data with a computer, you will almost certainly use an FFT algorithm, so FFT and DFT end up being synonymous.&lt;/p&gt;
&lt;p&gt;If we have a data array of length \(N\), the complexity of the DFT is \(\mathcal{O}(N^2)\), while the complexity of the FFT is only \(\mathcal{O}(N \log N)\). This can make a huge difference in computational time if N is large. Say if you have an array of \(N = 4096\) datapoints, then you could be looking at a factor of 1000 speedup.&lt;/p&gt;
&lt;p&gt;The development and implementation of FFT packages turned the DFT from something that was too slow for many practical applications into a formidable analysis tool. This really enabled the widespread usage of the Fourier transform in signal manipulation and data analysis. I don&amp;rsquo;t think anyone would argue with you that strongly if you said that the FFT is the most important algorithm of the last century &lt;a href="http://pi.math.cornell.edu/~ajt/presentations/TopTenAlgorithms.pdf"&gt;Cornell&amp;rsquo;s top ten algorithms of the 20th century&lt;/a&gt;.&lt;/p&gt;
&lt;p&gt;There are many different FFT algorithms out there. The most popular one is the &lt;a href="https://en.wikipedia.org/wiki/Cooley%E2%80%93Tukey_FFT_algorithm"&gt;Cooley-Tukey&lt;/a&gt; algorithm and relies on the factorization of a size \(N\) DFT matrix into \(N_1\) smaller DFTs of sizes \(N_2\) in a recursive manner. Therefore there is a (historical) preference towards array sizes that powers of \(2\). However, there are algorithms out that still give \(\mathcal{O}(N \log N)\) even for prime values of \(N\), so it&amp;rsquo;s not much of a constraint in practice.&lt;/p&gt;
&lt;p&gt;We don&amp;rsquo;t have time to go into much more detail about the FFT algorithm here, but hopefully you can see from the structure of the \(\mathbf{W}\) matrix that there are plenty of opportunities to make the calculation more efficient by factoring and caching values.&lt;/p&gt;
&lt;p&gt;Now, lets look at some code examples of how we might actually use the FFT in Python.&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;&lt;a href="FFT-lecture.slides.html"&gt;Jupyter Notebook Slides&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;</description></item><item><title>The Fourier Transform I</title><link>https://iancze.github.io/courses/as5003/lectures/02-fourier-transform-i/</link><pubDate>Wed, 13 Sep 2023 00:00:00 +0000</pubDate><guid>https://iancze.github.io/courses/as5003/lectures/02-fourier-transform-i/</guid><description>&lt;p&gt;&lt;strong&gt;Reference Materials for this lecture&lt;/strong&gt;:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;&lt;a href="https://catalog.libraries.psu.edu/catalog/2010095"&gt;The Fourier Transform and its Applications&lt;/a&gt; by R. Bracewell&lt;/li&gt;
&lt;li&gt;&lt;a href="https://catalog.libraries.psu.edu/catalog/20789467"&gt;Interferometry and Synthesis in Radio Astronomy&lt;/a&gt; by Thompson, Moran, and Swenson, particularly Appendix 2.1&lt;/li&gt;
&lt;li&gt;&lt;a href="https://catalog.libraries.psu.edu/catalog/34517505"&gt;Fourier Analysis and Imaging&lt;/a&gt; by R. Bracewell&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;&lt;strong&gt;Useful (and entertaining!) introductions to the topic&lt;/strong&gt;:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Youtube: &lt;a href="https://www.youtube.com/watch?v=spUNpyF58BY&amp;amp;ab_channel=3Blue1Brown"&gt;3Blue1Brown&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;




&lt;h2 id="complex-numbers"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/as5003/lectures/02-fourier-transform-i/#complex-numbers"&gt;Complex Numbers&lt;/a&gt;
&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;&lt;a href="https://en.wikipedia.org/wiki/Complex_number#Polar_complex_plane"&gt;Wikipedia Reference&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;A complex number \(z\) is one in the form of \(z = a + b i\), where \(i\) is the imaginary unit. The imaginary unit satisfies the equation
$$
i^2 = -1.
$$&lt;/p&gt;
&lt;p&gt;So a complex number has both real (\(a\)) and imaginary (\(b\)) components to it. We can represent this as a plot on the Cartesian plane:&lt;/p&gt;
&lt;figure&gt;&lt;a href="https://en.wikipedia.org/wiki/File:Complex_number_illustration.svg"&gt;&lt;img src="https://iancze.github.io/courses/as5003/lectures/02-fourier-transform-i/complex-cartesian.png"
 alt="Representation of a complex number on the Cartesian plane. Credit: Wolfkeeper"&gt;&lt;/a&gt;&lt;figcaption&gt;
 &lt;p&gt;Representation of a complex number on the Cartesian plane. Credit: Wolfkeeper&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;

&lt;p&gt;Alternatively, we can also represent a complex number on the polar plane, using an amplitude \(r\) and phase \(\varphi\):&lt;/p&gt;
&lt;figure&gt;&lt;a href="https://en.wikipedia.org/wiki/File:Complex_number_illustration_modarg.svg"&gt;&lt;img src="https://iancze.github.io/courses/as5003/lectures/02-fourier-transform-i/complex-polar.png"
 alt="Representation of a complex number on the polar plane. Credit: Kan8eDie, based on Wolfkeeper"&gt;&lt;/a&gt;&lt;figcaption&gt;
 &lt;p&gt;Representation of a complex number on the polar plane. Credit: Kan8eDie, based on Wolfkeeper&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;

&lt;p&gt;$$
z = r e^{i \varphi} = r (\cos \varphi + i \sin \varphi).
$$&lt;/p&gt;
&lt;p&gt;This is closely related to Euler&amp;rsquo;s formula
$$
e^{i x} = \cos x + i \sin x
$$
for a complex sinusoid. It&amp;rsquo;s useful to keep in mind Euler&amp;rsquo;s identity:
$$
e^{i \pi} + 1 = 0.
$$&lt;/p&gt;
&lt;p&gt;It&amp;rsquo;s possible to convert from the Cartesian form to polar form and vice-versa:
$$
r = |z| = \sqrt{a^2 + b^2}
$$
and
$$
\varphi = \arg(z) = \arg(a + bi)
$$
which is most easily carried out using the &lt;code&gt;arctan2&lt;/code&gt; function, to avoid quadrant ambiguities.
$$
\varphi = \mathrm{arctan2}(b, a).
$$&lt;/p&gt;
&lt;p&gt;You can go from polar back to Cartesian by writing
$$
z = r e^{i \varphi} = r (\cos \varphi + i \sin \varphi)
$$
and then doing
$$
a = \Re(z) = r \cos \varphi
$$
and
$$
b = \Im(z) = r \sin \varphi.
$$&lt;/p&gt;




&lt;h2 id="the-fourier-transform"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/as5003/lectures/02-fourier-transform-i/#the-fourier-transform"&gt;The Fourier Transform&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;References include a mix of&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Ch 2 of &lt;a href="https://catalog.libraries.psu.edu/catalog/2010095"&gt;The Fourier Transform and its Applications&lt;/a&gt; by R. Bracewell&lt;/li&gt;
&lt;li&gt;Appendix 2.1 of &lt;a href="https://catalog.libraries.psu.edu/catalog/20789467"&gt;Interferometry and Synthesis in Radio Astronomy&lt;/a&gt; by Thompson, Moran, and Swenson&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;First, we&amp;rsquo;ll introduce the equations and then explain what&amp;rsquo;s going on.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Forward transform&lt;/strong&gt;:
$$
F(s) = \int_{-\infty}^{\infty} f(x) e^{-i 2 \pi x s}\,\mathrm{d}x
$$
also called the &amp;ldquo;minus-\(i\)&amp;rdquo; transform.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Inverse transform&lt;/strong&gt;:
$$
f(x) = \int_{-\infty}^{\infty} F(s) e^{i 2 \pi x s}\,\mathrm{d}s
$$
also called the &amp;ldquo;plus-\(i\)&amp;rdquo; transform.&lt;/p&gt;
&lt;p&gt;Note that there are alternate conventions for the Fourier transform pairs, which vary as to whether the \(2 \pi\) factor appears in the exponent or as a pre-factor. We prefer the notation we&amp;rsquo;ve provided here because we find it much easier to keep track of \(2\pi\) factors.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Successive transforms&lt;/strong&gt;:&lt;/p&gt;
&lt;p&gt;We can show that
$$
f(x) = \int_{-\infty}^{\infty} \left [ \int_{-\infty}^{\infty} f(x) e^{-i 2 \pi x s}\,\mathrm{d}x \right ] e^{i 2 \pi x s}\,\mathrm{d}s,
$$
i.e., successive transformations yield back the original function. We don&amp;rsquo;t have time to walk through the proof, but it&amp;rsquo;s available in &lt;a href="https://catalog.libraries.psu.edu/catalog/20789467"&gt;TMS Section A2.1&lt;/a&gt;.&lt;/p&gt;
&lt;p&gt;Therefore, we have
$$
f(x) \leftrightharpoons F(s),
$$
where the \(\leftrightharpoons\) denotes a Fourier transform pair. Sometimes you might also see the notation \(\leftrightarrow\). Generally, if we have functions like \(f\) or \(g\), then we denote their Fourier transform pairs with captial letters (e.g., \(F\), \(G\))).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How to think about the domains&lt;/strong&gt;
The Fourier transform maps domains from \(x\) to \(s\), where \(s\) has units of &amp;ldquo;cycles per unit of \(x\)&amp;rdquo;. E.g., if \(x\) is in units of time (seconds), then \(s\) has units of cycles/second, commonly known as hertz. The FT can also be applied to spatial distance coordinates (e.g., \(x\) in meters) and spatial coordinates on the sky (e.g., \(x\) in arcseconds).&lt;/p&gt;
&lt;p&gt;So let&amp;rsquo;s say someone is playing a chord on a musical instrument. If \(f(x)\) represents the time-series of pressure values near your ear, then \(F(s)\) represents the notes that are being played (we would probably be most interested in something like \(|F(s)|^2\) for power as a function of frequency, like a spectrogram).&lt;/p&gt;
&lt;p&gt;&lt;a href="https://musiclab.chromeexperiments.com/spectrogram/"&gt;Musical Spectrogram link&lt;/a&gt;&lt;/p&gt;




&lt;h3 id="conditions-on-existence"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/as5003/lectures/02-fourier-transform-i/#conditions-on-existence"&gt;Conditions on existence&lt;/a&gt;
&lt;/h3&gt;
&lt;p&gt;Given the strong evidence via physical systems that the Fourier transform of particular time-series exists (e.g., spectrums for waveforms, antenna radiation patterns), there are actually some mathematical functions whose Fourier transforms do not exist.&lt;/p&gt;
&lt;p&gt;Physical possibility (in physical systems) is a sufficient condition for the existence of its transform.&lt;/p&gt;
&lt;p&gt;Sometimes, though, we consider waveforms like&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;\(\sin(t)\): harmonic wave&lt;/li&gt;
&lt;li&gt;\(H(t)\): Heaviside step&lt;/li&gt;
&lt;li&gt;\(\delta(t)\): impulse&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;Strictly speaking, none of these functions has a Fourier transform. What do we mean that they do not have Fourier transforms?
$$
F(s) = \int_{-\infty}^{\infty} f(x) e^{-i 2 \pi x s}\,\mathrm{d}x
$$
does not converge for all values of \(s\). Consider \(\sin(t)\) integrated from \((-\infty, \infty)\)&amp;hellip; it&amp;rsquo;ll just keep oscillating about 0.&lt;/p&gt;
&lt;p&gt;None of these waveforms are actually physically possible, though, because a waveform \(\sin(t)\) would have need to been switched on an infinite time ago, a step function would need to be maintained for an infinite time, and an impulse would need to be infinitely large for an infinitely short time.&lt;/p&gt;
&lt;p&gt;So, what are the conditions for the existence of Fourier transforms.&lt;/p&gt;
&lt;ol&gt;
&lt;li&gt;The integral of \(|f(x)|\) from \(-\infty\) to \(\infty\) exists.&lt;/li&gt;
&lt;li&gt;Any discontinuities in \(f(x)\) are finite.&lt;/li&gt;
&lt;/ol&gt;
&lt;p&gt;In a physical circumstance, these conditions would be violated when there is infinite energy.&lt;/p&gt;




&lt;h3 id="transforms-in-the-limit"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/as5003/lectures/02-fourier-transform-i/#transforms-in-the-limit"&gt;Transforms in the limit&lt;/a&gt;
&lt;/h3&gt;
&lt;p&gt;Though we just outlined functional forms whose Fourier transforms do not &lt;em&gt;strictly&lt;/em&gt; exist, we can find a way to think such that the transforms of these functions do exist in a practical sense. This is by considering them &lt;em&gt;in the limit&lt;/em&gt;.&lt;/p&gt;
&lt;p&gt;Consider a periodic function whose transform would strictly not exist, such as \(P(x) = \sin(x) \), since
$$
\int_{-\infty}^\infty |P(x)|\,\mathrm{d}x
$$
would not converge.&lt;/p&gt;
&lt;p&gt;We could modify this function ever so slightly by multiplication with a &lt;em&gt;very&lt;/em&gt; broad Gaussian envelope \(\exp(-\alpha x^2)\), where \(\alpha\) is a small positive number, then this modified version may have a transform. As we let \(\alpha \rightarrow 0\), then we approach \(P\) in the limit.
$$
\int_{-\infty}^\infty |e^{-\alpha x^2} P(x)|\,\mathrm{d}x
$$&lt;/p&gt;
&lt;p&gt;As \(\alpha \rightarrow 0\) the transform may still not exist for all \(s\), however, we can still be quite productive with the sequence of transforms that do exist as we approach this limit. Therefore, we can practically use the Fourier transform for all physical systems that we might consider. We&amp;rsquo;ll revisit this in more detail when we talk about Fourier series, line spectra, and the sampling theorem.&lt;/p&gt;




&lt;h3 id="oddness-evenness-and-complex-conjugates"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/as5003/lectures/02-fourier-transform-i/#oddness-evenness-and-complex-conjugates"&gt;Oddness, Evenness, and Complex Conjugates&lt;/a&gt;
&lt;/h3&gt;
&lt;figure&gt;&lt;img src="https://iancze.github.io/courses/as5003/lectures/02-fourier-transform-i/even-odd.png"
 alt="An even function \(E(x)\) and an odd function \(O(x)\), followed by their sum. Credit: The Fourier Transform and Its Applications, Bracewell, Figs 2.2 and 2.3."&gt;&lt;figcaption&gt;
 &lt;p&gt;An even function \(E(x)\) and an odd function \(O(x)\), followed by their sum. Credit: The Fourier Transform and Its Applications, Bracewell, Figs 2.2 and 2.3.&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;

&lt;p&gt;&lt;strong&gt;Even functions&lt;/strong&gt; have
$$
E(-x) = E(x)
$$
and are symmetrical.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Odd functions&lt;/strong&gt; have
$$
O(-x) = -O(x)
$$
and are antisymmetrical.&lt;/p&gt;
&lt;p&gt;The sum of even and odd functions is, in general, neither even nor odd.&lt;/p&gt;
&lt;p&gt;Any function \(f(x)\) can be split unambiguously into it&amp;rsquo;s odd and even parts, though, where
$$
E(x) = \frac{1}{2} \left [ f(x) + f(-x) \right]
$$
and
$$
O(x) = \frac{1}{2} \left [ f(x) - f(-x) \right]
$$
and so we have
$$
f(x) = E(x) + O(x),
$$
where both \(E\) and \(O\) are in general complex-valued.&lt;/p&gt;
&lt;p&gt;Evenness and oddness are very useful because we can these definitions to re-write the Fourier transform as
$$
F(s) = 2 \int_0^\infty E(x) \cos(2 \pi x s)\,\mathrm{d}x - 2 i \int_0^\infty O(x) \sin (2 \pi x s)\,\mathrm{d}x.
$$&lt;/p&gt;
&lt;p&gt;We see that&lt;/p&gt;
&lt;ol&gt;
&lt;li&gt;If a function is even, its transform is even&lt;/li&gt;
&lt;li&gt;If a function is odd, its transform is odd&lt;/li&gt;
&lt;/ol&gt;
&lt;p&gt;and other results summarized by Bracewell in Chapter 2:&lt;/p&gt;
&lt;figure&gt;&lt;img src="https://iancze.github.io/courses/as5003/lectures/02-fourier-transform-i/ft-relationships.png"
 alt="If a function has the characteristics in the left column, then its Fourier transform has the characteristics in the right column. Credit: Bracewell Ch. 2"&gt;&lt;figcaption&gt;
 &lt;p&gt;If a function has the characteristics in the left column, then its Fourier transform has the characteristics in the right column. Credit: Bracewell Ch. 2&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;

&lt;p&gt;These relationships are extremely useful for quickly ascertaining the basic nature of the Fourier transform of any function you may encounter, as well as planning how to numerically compute transforms using &lt;code&gt;fft&lt;/code&gt; or &lt;code&gt;rfft&lt;/code&gt; packages.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Complex conjugates&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;If we have the complex conjugate of function \(f(x)\), denoted by \(f^*(x)\), then we have&lt;/p&gt;
&lt;p&gt;$$
f^{*}(x) \leftrightharpoons F^{*}(-s)
$$&lt;/p&gt;




&lt;h2 id="transforms-of-some-simple-functions"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/as5003/lectures/02-fourier-transform-i/#transforms-of-some-simple-functions"&gt;Transforms of some simple functions&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;Let&amp;rsquo;s practice by taking the Fourier transforms of some functions.&lt;/p&gt;




&lt;h3 id="boxcar"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/as5003/lectures/02-fourier-transform-i/#boxcar"&gt;Boxcar&lt;/a&gt;
&lt;/h3&gt;
&lt;p&gt;The rectangle or &amp;ldquo;boxcar&amp;rdquo; function is
&lt;figure&gt;&lt;img src="https://iancze.github.io/courses/as5003/lectures/02-fourier-transform-i/rectangle.png"
 alt="The rectangle, or boxcar function. Credit: Bracewell Ch. 3"&gt;&lt;figcaption&gt;
 &lt;p&gt;The rectangle, or boxcar function. Credit: Bracewell Ch. 3&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;
&lt;/p&gt;
&lt;p&gt;Let&amp;rsquo;s calculate the Fourier transform. The function itself is simple, so this is mainly an exercise in choosing the right limits
$$
F(s) = \int_{-\infty}^{\infty} f(x) e^{-i 2 \pi x s}\,\mathrm{d}x
$$&lt;/p&gt;
&lt;p&gt;$$
F(s) = \int_{-1/2}^{1/2} e^{-i 2 \pi x s}\,\mathrm{d}x
$$&lt;/p&gt;
&lt;p&gt;We can use Euler&amp;rsquo;s formula to write
$$
F(s) = \int_{-1/2}^{1/2} \cos(2 \pi x s) + i \sin(2 \pi x s) \,\mathrm{d}x
$$
and visually seen that the \(\sin\) term would eventually cancel itself out, or we could have relied upon the fact that we know \(\Pi(x)\) is an even function (\(O(x) = 0\)) and used
$$
F(s) = 2 \int_0^\infty E(x) \cos(2 \pi x s)\,\mathrm{d}x - 2 i \int_0^\infty O(x) \sin (2 \pi x s)\,\mathrm{d}x.
$$
to yield
$$
F(s) = 2 \int_0^{1/2} \cos(2 \pi x s)\,\mathrm{d}x
$$&lt;/p&gt;
&lt;p&gt;$$
F(s) = 2 \Big |_0^{1/2} \frac{\sin 2 \pi x s}{2 \pi s} = \frac{\sin \pi s}{\pi s} = \mathrm{sinc}(s)
$$&lt;/p&gt;
&lt;p&gt;Note that we (and Bracewell) define
$$
\mathrm{sinc}(s) = \frac{\sin \pi s}{\pi s}
$$
this is called the normalized sinc function, and (IMO) is the most useful because of it&amp;rsquo;s nice Fourier pair relationships. There is also the &amp;ldquo;unnormalized&amp;rdquo; sinc function, which doesn&amp;rsquo;t have the factors of \(\pi\) in it, but we won&amp;rsquo;t use that in this course. The &lt;strong&gt;normalized sinc function&lt;/strong&gt; has the properties that&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;its peak is equal to 1 \(\mathrm{sinc}(0) = 1\)&lt;/li&gt;
&lt;li&gt;its &amp;ldquo;nulls&amp;rdquo; are located at non-zero integer values of \(n\) for \(\mathrm{sinc}(n)\)&lt;/li&gt;
&lt;li&gt;its integral from \(-\infty,\infty\) is equal to 1&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;So we have the Fourier pair:
$$
\Pi(x) \leftrightharpoons \mathrm{sinc}(s)
$$
The unit rectangle (or boxcar) function has the Fourier transform pair of a normalized sinc function.&lt;/p&gt;
&lt;p&gt;This is also the same relationship that we introduced in the first lecture: the far field electric field pattern (sinc) is the Fourier transform of the electric field illuminating the aperture of the telescope (boxcar).&lt;/p&gt;




&lt;h3 id="gaussian"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/as5003/lectures/02-fourier-transform-i/#gaussian"&gt;Gaussian&lt;/a&gt;
&lt;/h3&gt;
&lt;p&gt;How about the Fourier transform of a Gaussian function?
$$
f(x) = \exp \left ( -\frac{x^2}{2 a ^2} \right )
$$&lt;/p&gt;
&lt;p&gt;$$
F(s) = \int_{-\infty}^{\infty} \exp \left ( -\frac{x^2}{2 a ^2} \right ) \exp (-i 2 \pi x s)\,\mathrm{d}x
$$&lt;/p&gt;
&lt;p&gt;$$
F(s) = \int_{-\infty}^{\infty} \exp \left ( -\frac{x^2}{2 a ^2} -i 2 \pi x s \right )\,\mathrm{d}x
$$&lt;/p&gt;
&lt;p&gt;Usually, when I see something like this, my standard approach is to start browsing books of integrals like &lt;a href="https://en.wikipedia.org/wiki/Gradshteyn_and_Ryzhik"&gt;Gradshteyn and Ryzhik&lt;/a&gt; for ideas about how I might rearrange the integrand and successfully evaluate the integral. I&amp;rsquo;d usually just go for this. But, in this case, we can actually do something using a trick you probably learned in jr. high school, called &lt;a href="https://www.mathsisfun.com/algebra/completing-square.html"&gt;completing the square&lt;/a&gt;.&lt;/p&gt;
&lt;p&gt;Ok, so we&amp;rsquo;ve got terms in the exponent with \(x^2\) and \(x\), and an equation that looks like
$$
a x^2 + bx + c = 0
$$
which we want to turn into something that looks like
$$
a(x + d)^2 + e = 0.
$$&lt;/p&gt;
&lt;p&gt;The answer is that
$$
d = \frac{b}{2a}
$$
and
$$
e = c - \frac{b^2}{4 a}
$$
such that our rearranged exponent becomes
$$
-\left [ \frac{x^2}{2 a ^2} + i 2 \pi x s \right ] = -[(x - 2i \pi a^2 s)^2 + 4 \pi^2 a^4 s^2]/2a^2
$$
Why was this useful? Well, the integral is over \(x\), so we can pull out all terms that do not depend on \(x\), giving us a rearranged integral of
$$
F(s) = \exp(-2 \pi^2 a^2 s^2 ) \int_{-\infty}^\infty \exp \left (-\frac{(x - 2 i \pi a^2 s)^2}{2 a^2} \right)\,\mathrm{d}x
$$&lt;/p&gt;
&lt;p&gt;Here it is helpful to remember your Gaussian integration formulas such that
$$
\int_{-\infty}^\infty e^{-a(x + b)^2}\,\mathrm{d}x = \sqrt{\frac{\pi}{a}}.
$$&lt;/p&gt;
&lt;p&gt;Thus, the integral contributes another factor of \(\sqrt{2 \pi} a\) and the final result is&lt;/p&gt;
&lt;p&gt;$$
F(s) = \sqrt{2 \pi} a \exp(-2 \pi^2 a^2 s^2 )
$$&lt;/p&gt;
&lt;p&gt;What functional form is this? This is another Gaussian, though the normalization and standard deviation are a bit different! So, we see that the Gaussian function is a Fourier transform pair with itself.&lt;/p&gt;
&lt;p&gt;My usual approach when dealing with Gaussians is to start with the Fourier duals (Bracewell)
$$
e^{- \pi x^2} \leftrightharpoons e^{- \pi s^2}
$$
and then use the similarity and shift theorems (discussed later) to morph it into the form needed, picking up any additional prefactors as necessary.&lt;/p&gt;
&lt;p&gt;Hopefully now you have a taste of how to compute Fourier transforms. At its most basic, it&amp;rsquo;s just a matter of setting up and evaluating the integral. For many function forms, you can use integration strategies to arrive at analytic solutions. As a practical matter, in the next lecture we&amp;rsquo;ll see how we can evaluate this integral numerically.&lt;/p&gt;




&lt;h2 id="convolution"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/as5003/lectures/02-fourier-transform-i/#convolution"&gt;Convolution&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;The convolution of two functions \(f(x)\) and \(g(x)\) is defined as
$$
\int_{-\infty}^\infty f(u) g(x - u)\,\mathrm{d}u
$$
and is frequently written using the \(*\) symbol. The convolution produces a new function, so we have
$$
h(x) = f(x) * g(x).
$$&lt;/p&gt;
&lt;p&gt;Convolution as a process is very useful to think of graphically&lt;/p&gt;
&lt;figure&gt;&lt;img src="https://iancze.github.io/courses/as5003/lectures/02-fourier-transform-i/convolution-1.png"
 alt="A graphical representation of the convolution of two functions \(f(x)\) and \(g(x)\). The \(g\) function is flipped, shifted to \(x\), and then multiplied against \(f\). The value of the convolution \(h(x)\) is given by the integral of the multiplied product. Credit: Bracewell Ch. 3"&gt;&lt;figcaption&gt;
 &lt;p&gt;A graphical representation of the convolution of two functions \(f(x)\) and \(g(x)\). The \(g\) function is flipped, shifted to \(x\), and then multiplied against \(f\). The value of the convolution \(h(x)\) is given by the integral of the multiplied product. Credit: Bracewell Ch. 3&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;

&lt;figure&gt;&lt;img src="https://iancze.github.io/courses/as5003/lectures/02-fourier-transform-i/convolution-2.png"
 alt="In general, convolution by most functions (e.g., boxcars, Gaussians, etc&amp;hellip;) results in a smoothing out of high-frequency structure. Credit: Bracewell Ch. 3"&gt;&lt;figcaption&gt;
 &lt;p&gt;In general, convolution by most functions (e.g., boxcars, Gaussians, etc&amp;hellip;) results in a smoothing out of high-frequency structure. Credit: Bracewell Ch. 3&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;

&lt;figure&gt;&lt;img src="https://iancze.github.io/courses/as5003/lectures/02-fourier-transform-i/convolution-3.png"
 alt="Convolution can also be thought of as a superposition of characteristic contributions. I.e., the final function \(h(x)\) has grabbed value from nearby regions of \(f(x)\), modulated by the envelope of \(g(x)\). This paradigm is very useful for understanding interpolation, smoothing, and kernel density estimation (KDE). Credit: Bracewell Ch. 3"&gt;&lt;figcaption&gt;
 &lt;p&gt;Convolution can also be thought of as a superposition of characteristic contributions. I.e., the final function \(h(x)\) has grabbed value from nearby regions of \(f(x)\), modulated by the envelope of \(g(x)\). This paradigm is very useful for understanding interpolation, smoothing, and kernel density estimation (KDE). Credit: Bracewell Ch. 3&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;

&lt;p&gt;Convolution is commutative&lt;/p&gt;
&lt;p&gt;$$
f * g = g * f
$$&lt;/p&gt;
&lt;p&gt;and associative&lt;/p&gt;
&lt;p&gt;$$
f * (g * h) = (f * g) * h
$$&lt;/p&gt;
&lt;p&gt;and distributive over addition&lt;/p&gt;
&lt;p&gt;$$
f * (g + h) = f * g + f * h.
$$&lt;/p&gt;
&lt;p&gt;It is a &lt;a href="https://undergroundmathematics.org/glossary/linear-operator#:~:text=A%20function%20f%20is%20called,x%20and%20all%20constants%20c."&gt;linear operator&lt;/a&gt;, just like the Fourier transform.&lt;/p&gt;




&lt;h2 id="the-impulse-symbol"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/as5003/lectures/02-fourier-transform-i/#the-impulse-symbol"&gt;The impulse symbol&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;We&amp;rsquo;ll develop a notation for an intense (unit-area) pulse so brief that the measuring equipment is unable to distinguish it from pulses yet briefer still. You may be quite familiar with this concept as a &amp;ldquo;delta-function,&amp;rdquo; especially in the context of quantum physics. Physically speaking, things like &amp;ldquo;point masses&amp;rdquo;, &amp;ldquo;point charges,&amp;rdquo; and (astrophysically speaking) &amp;ldquo;point sources&amp;rdquo; do not physically exist, but they are very useful concepts. The only important attribute of an impulse is &lt;em&gt;how it reacts under integration&lt;/em&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;\(\delta(x) = 0\) for \(x \ne 0\)&lt;/li&gt;
&lt;li&gt;\(\int_{-\infty}^\infty \delta(x)\,\mathrm{d}x = 1\)&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;And, there is a close relationship between the impulse symbol and the unit step function \(H(x)\) such that
$$
\int_{-\infty}^x \delta(x^\prime)\,\mathrm{d}x^\prime = H(x).
$$&lt;/p&gt;
&lt;p&gt;Another very important property of the impulse function is its &amp;ldquo;sifting property&amp;rdquo; (TMS A2.11), such that
$$
f(a) = \int_{-\infty}^{\infty} f(x) \delta(x - a)\,\mathrm{d}x^\prime.
$$
i.e., the integral of function \(f(x)\) times a delta-function located at \(a\) will give the value of the function evaluated at \(a\), \(f(a)\).&lt;/p&gt;
&lt;p&gt;The Fourier transform of a delta function (centered on 0) is&lt;/p&gt;
&lt;p&gt;$$
F(s) = \int_{-\infty}^{\infty} \delta(0) \exp (-i 2 \pi x s)\,\mathrm{d}x = 1.
$$
This is yet another important Fourier pair
$$
\delta(x) \leftrightharpoons \mathrm{constant\;amplitude\,\forall\, s}.
$$&lt;/p&gt;




&lt;h2 id="the-sampling-or-replicating-symbol-shah-function"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/as5003/lectures/02-fourier-transform-i/#the-sampling-or-replicating-symbol-shah-function"&gt;The Sampling or Replicating Symbol &amp;ldquo;Shah function&amp;rdquo;&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;This is an infinite sequence of unit impulses, given by
$$
\mathrm{shah}(x) = \sum_{n=-\infty}^\infty \delta(x - n)
$$&lt;/p&gt;
&lt;figure&gt;&lt;img src="https://iancze.github.io/courses/as5003/lectures/02-fourier-transform-i/shah.png"
 alt="The replicating function. Credit: Bracewell Fig 5.4"&gt;&lt;figcaption&gt;
 &lt;p&gt;The replicating function. Credit: Bracewell Fig 5.4&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;

&lt;p&gt;Also sometimes called a &amp;ldquo;Dirac Comb.&amp;rdquo; There is a generalization of the sifting property, such that if you multiply a function by a shah, you are effectively sampling it at unit intervals.&lt;/p&gt;
&lt;p&gt;You can use it to sample \(f(x)\) (by multiplication)&lt;/p&gt;
&lt;figure&gt;&lt;img src="https://iancze.github.io/courses/as5003/lectures/02-fourier-transform-i/sampling.png"
 alt="Sampling property of the shah function by multiplication. Credit: Bracewell Fig 5.5"&gt;&lt;figcaption&gt;
 &lt;p&gt;Sampling property of the shah function by multiplication. Credit: Bracewell Fig 5.5&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;

&lt;p&gt;And you can use it to replicate \(f(x)\) (by convolution)&lt;/p&gt;
&lt;figure&gt;&lt;img src="https://iancze.github.io/courses/as5003/lectures/02-fourier-transform-i/replicating.png"
 alt="Replicating property of the shah function by convolution. Credit: Bracewell Fig 5.6"&gt;&lt;figcaption&gt;
 &lt;p&gt;Replicating property of the shah function by convolution. Credit: Bracewell Fig 5.6&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;

&lt;p&gt;The unit shah function is also its own Fourier transform
$$
\mathrm{shah}(x) \leftrightharpoons \mathrm{shah}(s)
$$&lt;/p&gt;




&lt;h2 id="fourier-transform-theorems-properties-similarity-convolution-multiplication"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/as5003/lectures/02-fourier-transform-i/#fourier-transform-theorems-properties-similarity-convolution-multiplication"&gt;Fourier transform theorems properties: similarity, convolution, multiplication&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;There are several useful properties of the Fourier transform that you&amp;rsquo;ll want to familiarize yourself with. See Bracewell Ch. 6 or TMS A2.1.2.&lt;/p&gt;




&lt;h3 id="similarity"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/as5003/lectures/02-fourier-transform-i/#similarity"&gt;Similarity&lt;/a&gt;
&lt;/h3&gt;
&lt;p&gt;If&lt;/p&gt;
&lt;p&gt;$$
f(x) \leftrightharpoons F(s)
$$&lt;/p&gt;
&lt;p&gt;then&lt;/p&gt;
&lt;p&gt;$$
f(ax) \leftrightharpoons \frac{1}{|a|} F \left (\frac{s}{a} \right).
$$&lt;/p&gt;
&lt;p&gt;I.e., applied to waveforms and spectra, a compression of the time scale corresponds to an expansion of the frequency scale.&lt;/p&gt;




&lt;h4 id="fourier-transforms-and-the-heisenberg-uncertainty-principle"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/as5003/lectures/02-fourier-transform-i/#fourier-transforms-and-the-heisenberg-uncertainty-principle"&gt;Fourier transforms and the Heisenberg uncertainty principle&lt;/a&gt;
&lt;/h4&gt;
&lt;p&gt;In a signal-processing sense, this manifests as an inability to precisely specify a signal in &lt;em&gt;both&lt;/em&gt; the time and frequency domains at the same time. As you decrease the variance of a function (i.e., make it more concentrated and thus localized) in one domain, you increase the variance of it in the other domain (i.e., make it more extended and thus dispersed).&lt;/p&gt;
&lt;p&gt;In quantum mechanics, this same concept is at play in the Heisenberg Uncertainty principle, where probability distributions (i.e., wavefunctions) governing position and momentum are related by the Fourier transform. It&amp;rsquo;s impossible to know both position and momentum precisely.&lt;/p&gt;




&lt;h3 id="shift"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/as5003/lectures/02-fourier-transform-i/#shift"&gt;Shift&lt;/a&gt;
&lt;/h3&gt;
&lt;p&gt;If&lt;/p&gt;
&lt;p&gt;$$
f(x) \leftrightharpoons F(s)
$$&lt;/p&gt;
&lt;p&gt;then&lt;/p&gt;
&lt;p&gt;$$
f(x - a) \leftrightharpoons \exp(- 2 \pi i a s) F(s).
$$&lt;/p&gt;
&lt;p&gt;If you shift a function, then there are no changes in the &lt;em&gt;amplitude&lt;/em&gt; of the Fourier transform, but, there are changes to its phase, dependent on &lt;em&gt;s&lt;/em&gt;. The higher the frequency, the greater the change in phase angle. In radio astronomy, it&amp;rsquo;s common to hear of this as a translation in the R.A./Dec. plane results in a &lt;em&gt;phase shift&lt;/em&gt; in the visibility plane.&lt;/p&gt;</description></item><item><title>Introduction and Course Overview</title><link>https://iancze.github.io/courses/as5003/lectures/01-introduction/</link><pubDate>Fri, 25 Aug 2023 00:00:00 +0000</pubDate><guid>https://iancze.github.io/courses/as5003/lectures/01-introduction/</guid><description>&lt;ul&gt;
&lt;li&gt;&lt;a href="slides.pdf"&gt;Slides (.pdf)&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="slides.key"&gt;Slides (.key)&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;




&lt;h1 id="references-and-resources-for-this-lecture"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/as5003/lectures/01-introduction/#references-and-resources-for-this-lecture"&gt;References and Resources for this lecture&lt;/a&gt;
&lt;/h1&gt;
&lt;p&gt;Full reference information can always be found in the &lt;a href="https://iancze.github.io/courses/as5003/syllabus/"&gt;syllabus&lt;/a&gt;, under &amp;ldquo;References Materials.&amp;rdquo;&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;&lt;em&gt;Essential Radio Astronomy&lt;/em&gt; &lt;a href="https://www.cv.nrao.edu/~sransom/web/Ch1.html#S3"&gt;Ch 1&lt;/a&gt;: emission mechanisms, relevant astrophysical objects&lt;/li&gt;
&lt;li&gt;&lt;em&gt;Tools of Radio Astronomy&lt;/em&gt;: radio windows, units, flux densities&lt;/li&gt;
&lt;li&gt;&lt;em&gt;Interferometry and Synthesis in Radio Astronomy&lt;/em&gt;: single-dish observations, units, flux densities&lt;/li&gt;
&lt;/ul&gt;




&lt;h1 id="course-overview"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/as5003/lectures/01-introduction/#course-overview"&gt;Course Overview&lt;/a&gt;
&lt;/h1&gt;
&lt;p&gt;Welcome to AS 5003: Contemporary Astrophysics, on radio astronomy and interferometric imaging!&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;&lt;a href="https://iancze.github.io/courses/as5003/syllabus/"&gt;Syllabus&lt;/a&gt; overview regarding format and tutorial schedule&lt;/li&gt;
&lt;/ul&gt;




&lt;h1 id="astrophysics-at-radio-wavelengths"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/as5003/lectures/01-introduction/#astrophysics-at-radio-wavelengths"&gt;Astrophysics at radio wavelengths&lt;/a&gt;
&lt;/h1&gt;




&lt;h2 id="emission-mechanisms"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/as5003/lectures/01-introduction/#emission-mechanisms"&gt;Emission mechanisms&lt;/a&gt;
&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;Radio synchrotron (continuum), non-thermal emission from relativistic electrons in magnetic fields&lt;/li&gt;
&lt;li&gt;Bremsstrahlung (a.k.a. free-free) emission (continuum), thermal emission from ionized gas (H II regions)&lt;/li&gt;
&lt;li&gt;Thermal emission from cold (&amp;lt; 100 K) media, like dust (continuum)&lt;/li&gt;
&lt;li&gt;Atomic hyperfine splitting (&amp;ldquo;21-cm&amp;rdquo; line corresponding to neutral hydrogen)&lt;/li&gt;
&lt;li&gt;Molecular emission lines, primarily from rotational transitions (e.g., CO \(J=2-1\))&lt;/li&gt;
&lt;/ul&gt;
&lt;figure&gt;&lt;a href="https://www.cv.nrao.edu/~sransom/web/Ch1.html#S1"&gt;&lt;img src="https://iancze.github.io/courses/as5003/lectures/01-introduction/atmospheric_windows.jpg"
 alt="Atmospheric windows for astronomy. Credit: ESA/Hubble (F. Granato) and Essential Radio Astronomy."&gt;&lt;/a&gt;&lt;figcaption&gt;
 &lt;p&gt;Atmospheric windows for astronomy. Credit: ESA/Hubble (F. Granato) and Essential Radio Astronomy.&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;

&lt;p&gt;The earth&amp;rsquo;s atmosphere is very transparent in the radio region of the electromagnetic spectrum, especially compared to optical windows. Only towards the microwave region (wavelengths around 1 mm), does the atmospheric transmission start to decline.&lt;/p&gt;




&lt;h2 id="astrophysical-objects"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/as5003/lectures/01-introduction/#astrophysical-objects"&gt;Astrophysical objects&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;In truth, almost everything these days!&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;quasars&lt;/li&gt;
&lt;li&gt;gamma ray burst (GRB) afterglows&lt;/li&gt;
&lt;li&gt;fast radio bursts (FRBs)&lt;/li&gt;
&lt;li&gt;pulsars&lt;/li&gt;
&lt;li&gt;supernovae remnants&lt;/li&gt;
&lt;li&gt;cosmic microwave background (CMB)&lt;/li&gt;
&lt;li&gt;galaxies, including molecules at high redshift sources&lt;/li&gt;
&lt;li&gt;dust/interstellar medium&lt;/li&gt;
&lt;li&gt;the Sun&lt;/li&gt;
&lt;li&gt;planets (e.g., Jupiter, Uranus)&lt;/li&gt;
&lt;li&gt;protoplanetary disks&lt;/li&gt;
&lt;li&gt;molecular clouds (molecular emission)&lt;/li&gt;
&lt;li&gt;black hole accretion disks (EHT)&lt;/li&gt;
&lt;/ul&gt;
&lt;figure&gt;&lt;a href="https://www.nasa.gov/mission_pages/hubble/science/hercules-a.html"&gt;&lt;img src="https://iancze.github.io/courses/as5003/lectures/01-introduction/hercules.jpg"
 alt="Radio jets from the elliptical galaxy Hercules A (overlaid with an optical image from Hubble). Karl Jansky VLA. Credit: NASA, ESA, S. Baum and C. O&amp;rsquo;Dea (RIT), R. Perley and W. Cotton (NRAO/AUI/NSF), and the Hubble Heritage Team (STScI/AURA)."&gt;&lt;/a&gt;&lt;figcaption&gt;
 &lt;p&gt;Radio jets from the elliptical galaxy Hercules A (overlaid with an optical image from Hubble). Karl Jansky VLA. Credit: NASA, ESA, S. Baum and C. O&amp;rsquo;Dea (RIT), R. Perley and W. Cotton (NRAO/AUI/NSF), and the Hubble Heritage Team (STScI/AURA).&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;

&lt;figure&gt;&lt;a href="https://public.nrao.edu/gallery/hl-tau-birth-of-planets-revealed-in-astonishing-detail-2/"&gt;&lt;img src="https://iancze.github.io/courses/as5003/lectures/01-introduction/HLTau_ALMA.jpg"
 alt="The protoplanetary disk around HL Tau, imaged using the Atacama Large Millimeter Array. Credit: ALMA(ESO/NAOJ/NRAO); C. Brogan, B. Saxton (NRAO/AUI/NSF)"&gt;&lt;/a&gt;&lt;figcaption&gt;
 &lt;p&gt;The protoplanetary disk around HL Tau, imaged using the Atacama Large Millimeter Array. Credit: ALMA(ESO/NAOJ/NRAO); C. Brogan, B. Saxton (NRAO/AUI/NSF)&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;

&lt;figure&gt;&lt;a href="https://www.almaobservatory.org/en/press-releases/planetary-rings-of-uranus-glow-in-cold-light/"&gt;&lt;img src="https://iancze.github.io/courses/as5003/lectures/01-introduction/uranus-rings.png"
 alt="The rings of Uranus seen by ALMA (thermal emission from 77 K material). Credit: ALMA (ESO/NAOJ/NRAO); E. Molter and I. de Pater."&gt;&lt;/a&gt;&lt;figcaption&gt;
 &lt;p&gt;The rings of Uranus seen by ALMA (thermal emission from 77 K material). Credit: ALMA (ESO/NAOJ/NRAO); E. Molter and I. de Pater.&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;





&lt;h1 id="single-dish-radio-telescopes"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/as5003/lectures/01-introduction/#single-dish-radio-telescopes"&gt;Single-dish radio telescopes&lt;/a&gt;
&lt;/h1&gt;
&lt;p&gt;In this section, we&amp;rsquo;ll cover existing single-dish radio telescopes and refresh our memory of basic telescope performance.&lt;/p&gt;
&lt;p&gt;The same
$$
\theta \approx \frac{\lambda}{D}
$$
applies for radio antennas. Take a \(\lambda = 1\;\mathrm{cm}\) observation, for example. Compared to an optical \(\lambda = 500\;\mathrm{nm}\) telescope the same size, the resolution will be a factor of
$$
\frac{1\;\mathrm{cm}}{500\;\mathrm{nm}} = 20,000
$$
worse. Yikes!&lt;/p&gt;
&lt;p&gt;Radio astronomers are constantly trying to find ways to increase angular (spatial) resolution.&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;One way is to build bigger telescopes, such as the Green Bank Telescope (100m diameter)&lt;/li&gt;
&lt;/ul&gt;
&lt;figure&gt;&lt;a href="https://public.nrao.edu/gallery/green-bank-telescope/"&gt;&lt;img src="https://iancze.github.io/courses/as5003/lectures/01-introduction/gbt.jpg"
 alt="The Green Bank Telescope (100m diameter) operates at radio wavelegths. Credit: NRAO/AUI/NSF"&gt;&lt;/a&gt;&lt;figcaption&gt;
 &lt;p&gt;The Green Bank Telescope (100m diameter) operates at radio wavelegths. Credit: NRAO/AUI/NSF&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;

&lt;ul&gt;
&lt;li&gt;Another way is to work at higher frequencies (shorter wavelengths), e.g. sub-mm radio astronomy (IRAM 30m telescope)&lt;/li&gt;
&lt;/ul&gt;
&lt;figure&gt;&lt;a href="https://www.iram-institute.org/EN/30-meter-telescope.php"&gt;&lt;img src="https://iancze.github.io/courses/as5003/lectures/01-introduction/iram.jpg"
 alt="The IRAM 30m diameter telescope, which operates at sub-mm wavelengths. Credit: Wikipedia/IRAM-gre"&gt;&lt;/a&gt;&lt;figcaption&gt;
 &lt;p&gt;The IRAM 30m diameter telescope, which operates at sub-mm wavelengths. Credit: Wikipedia/IRAM-gre&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;

&lt;ul&gt;
&lt;li&gt;A final way is to use &lt;em&gt;interferometry&lt;/em&gt;, sometimes also at sub-mm wavelengths (ALMA), which will be the main focus of this course&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;It&amp;rsquo;s easier to build larger telescopes at longer wavelengths because the tolerances required for the reflecting surface are less strict than at optical wavelengths. Though typically one must keep surface tolerances to within
$$
\sigma = \frac{\lambda_\mathrm{min}}{16},
$$
otherwise the efficiency of the antenna starts to decline substantially. For the 100 m diameter GBT operating at it&amp;rsquo;s highest frequency (100 GHz) or 3 mm, this translates to \(200\;\mu\mathrm{m}\), which is the thickness of two sheets of paper! That&amp;rsquo;s quite an engineering challenge, and is the reason why large, steerable dishes are difficult to build.&lt;/p&gt;
&lt;p&gt;Keeping telescopes fixed is one way to build a little bit bigger, such as &lt;a href="https://en.wikipedia.org/wiki/Five-hundred-meter_Aperture_Spherical_Telescope"&gt;FAST&lt;/a&gt;, which is a five hundred meter diameter fixed telescope in China. See also Arecibo, which unfortunately collapsed in December 2020. Eventually, though, the materials/engineering cost to building large single dish telescopes becomes prohibitive.&lt;/p&gt;




&lt;h2 id="single-dish-observations"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/as5003/lectures/01-introduction/#single-dish-observations"&gt;Single dish observations&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;The &amp;ldquo;beam&amp;rdquo; of a receiving antenna, or power pattern as a function of direction, can be calculated using the reciprocity theorems for transmitting and receiving antennas. These state that the far field electric field pattern \(f(l)\) is the Fourier transform (much more in lectures 2 and 3!) of the electric field illuminating the aperture of the telescope \(g(u)\).&lt;/p&gt;
&lt;figure&gt;&lt;a href="https://www.cv.nrao.edu/~sransom/web/Ch3.html#S1"&gt;&lt;img src="https://iancze.github.io/courses/as5003/lectures/01-introduction/uniform-dish.png"
 alt="A schematic illustration of (top): Uniformly illuminated aperture (middle): The electric field pattern of the antenna, as a function of direction (bottom): The power pattern of the antenna, as a function of direction. Credit: Essential Radio Astronomy"&gt;&lt;/a&gt;&lt;figcaption&gt;
 &lt;p&gt;A schematic illustration of (top): Uniformly illuminated aperture (middle): The electric field pattern of the antenna, as a function of direction (bottom): The power pattern of the antenna, as a function of direction. Credit: Essential Radio Astronomy&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;

&lt;p&gt;For large apertures, the nulls at \( l = \pm 1, 2, \ldots\) appear at the angles \(\theta \approx \lambda/D, 2 \lambda/D, \ldots\). In two dimensions, for a circular aperture, this is an Airy pattern.&lt;/p&gt;
&lt;figure&gt;&lt;img src="https://iancze.github.io/courses/as5003/lectures/01-introduction/beam-pattern.png"
 alt="A beam power pattern plotten in polar coordinates, demonstrating that the antenna can pick up power from sidelobes at range of angles. Credit: Tools of Radio Astronomy."&gt;&lt;figcaption&gt;
 &lt;p&gt;A beam power pattern plotten in polar coordinates, demonstrating that the antenna can pick up power from sidelobes at range of angles. Credit: Tools of Radio Astronomy.&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;

&lt;p&gt;This is an idealized representation, but is still helpful. The beam can pick up power through sidelobes at a range of angles. Directional antennas help concentrate power in the main beam, but antennas with secondary stages (and thus supports, like most telescopes) create opportunities for ground radiation to reflect into the receiver.&lt;/p&gt;
&lt;p&gt;You can think of single-dish telescopes (unless they have a sophisticated, multi-pixel receiver) essentially as single-pixel devices. So to make a map of the sky, you would need to raster scan the telescope across the region of interest, reading out antenna temperature as a function of RA, Dec. To make a good (i.e., scientifically accurate) map, you should focus on Nyquist sampling the sky to a uniform sensitivity, usually through a hexagonal pattern of dithering. More advanced instruments may have an array of &amp;ldquo;feeds&amp;rdquo; in a focal plane (mirroring a set of &amp;ldquo;pixels&amp;rdquo;), but this is still a small number of pixels compared to a typical CCD (e.g., 25 or 36 compared to \(2046^2\)).&lt;/p&gt;




&lt;h2 id="image-units"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/as5003/lectures/01-introduction/#image-units"&gt;Image Units&lt;/a&gt;
&lt;/h2&gt;




&lt;h3 id="radiative-transfer-recap"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/as5003/lectures/01-introduction/#radiative-transfer-recap"&gt;Radiative transfer recap&lt;/a&gt;
&lt;/h3&gt;
&lt;p&gt;What are the units of the sky, and the images we make as representations of it?&lt;/p&gt;
&lt;p&gt;One of the most useful quantities from radiative transfer is \(I_\nu\).&lt;/p&gt;
&lt;p&gt;\(I_\nu \) is the &lt;em&gt;specific intensity&lt;/em&gt; of radiation, you can think of it as the energy carried along by an infinitesimal &amp;ldquo;bundle&amp;rdquo; of rays.&lt;/p&gt;
&lt;p&gt;It has dimensions of:
$$
\mathrm{energy}\; (\mathrm{time})^{-1} \;(\mathrm{area})^{-1} \;(\mathrm{solid\,angle})^{-1} \; (\mathrm{frequency})^{-1}
$$
in CGS units, we would write
$$
\mathrm{ergs}\;\mathrm{s}^{-1}\;\mathrm{cm}^{-2}\;\mathrm{ster}^{-1}\;\mathrm{Hz}^{-1}
$$
In astronomical settings, I&amp;rsquo;ve always seen \(I_\nu\) referred to as the &amp;ldquo;specific intensity.&amp;rdquo; In non-astronomy settings, I&amp;rsquo;ve seen &amp;ldquo;spectral intensity.&amp;rdquo; If \(I_\nu\) is integrated over all frequencies, it&amp;rsquo;s called the radiant intensity \(I\).&lt;/p&gt;
&lt;p&gt;\(I_\nu \) can be a little mind-bending to think about&amp;hellip; it can be a function of&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;3D space \(\vec{x}\)&lt;/li&gt;
&lt;li&gt;direction \(\vec{\Omega}\)&lt;/li&gt;
&lt;li&gt;frequency&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;Intensity itself is &lt;em&gt;not&lt;/em&gt; a vector quantity; rather it is a scalar field that &lt;em&gt;is&lt;/em&gt; a function of position and direction \(I_\nu(\vec{x}, \vec{\Omega})\). Rybicki and Lightman write the angular direction vector as \(\vec{\Omega}\) and the solid angle surrounding that vector as \(\mathrm{d}\Omega\).&lt;/p&gt;
&lt;figure&gt;&lt;img src="https://iancze.github.io/courses/as5003/lectures/01-introduction/rybicky_lightman_1.2.png"
 alt="The geometry surrounding the concept of specific intensity. The normal vector is \(\vec{\Omega}\), the position \(\vec{x}\) in 3D space corresponds to the location of the \(dA\) patch. The Credit: Radiative Processes, Figure 1.2"&gt;&lt;figcaption&gt;
 &lt;p&gt;The geometry surrounding the concept of specific intensity. The normal vector is \(\vec{\Omega}\), the position \(\vec{x}\) in 3D space corresponds to the location of the \(dA\) patch. The Credit: Radiative Processes, Figure 1.2&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;

&lt;p&gt;If we have a defined reference frame, we would probably write \(\vec{\Omega}\) as a vector in spherical coordinates and define the components along \(\hat{\phi}\) and \(\hat{\theta}\) and let \(\mathrm{d}\Omega = d\phi \sin \theta d\theta\).&lt;/p&gt;
&lt;p&gt;When we are making astrophysical &lt;em&gt;observations&lt;/em&gt; from the earth, though,
we are making or acquiring images of regions of the celestial sky. So we generally talk of \(I_\nu(\alpha, \delta)\), where \(\alpha\) and \(\delta\) are R.A. and declination offsets from some direction, respectively. We&amp;rsquo;re always looking from the same place (at least compared to the size of the universe), so we don&amp;rsquo;t worry about specifying position within 3D space. But if we went to the Andromeda Galaxy and started mapping the celestial sky, we would need to, then. In the end though, images are have the same units because they represent specific intensity. It&amp;rsquo;s very common to refer to \(I_\nu(\alpha, \delta)\) as the &lt;em&gt;surface brightness&lt;/em&gt;.&lt;/p&gt;
&lt;figure&gt;&lt;img src="https://iancze.github.io/courses/as5003/lectures/01-introduction/solid-angle.png"
 alt="When we&amp;rsquo;re discussing images of astronomical sources, we&amp;rsquo;re usually using RA \(\alpha\) and Dec \(\delta\). A solid angle simply describes the area on a unit sphere (e.g., the sky), the area itself need not be circular. The Very Large Array, located in Socorro, NM. Credit: NRAO"&gt;&lt;figcaption&gt;
 &lt;p&gt;When we&amp;rsquo;re discussing images of astronomical sources, we&amp;rsquo;re usually using RA \(\alpha\) and Dec \(\delta\). A solid angle simply describes the area on a unit sphere (e.g., the sky), the area itself need not be circular. The Very Large Array, located in Socorro, NM. Credit: NRAO&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;





&lt;h3 id="flux"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/as5003/lectures/01-introduction/#flux"&gt;Flux&lt;/a&gt;
&lt;/h3&gt;
&lt;p&gt;Once you&amp;rsquo;ve defined \(I_\nu\), then it&amp;rsquo;s relatively easy to calculate quantities like energy density, flux, momentum, etc, as integrals of the specific intensity field.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Flux&lt;/strong&gt; is where you integrate out the angular dependence:&lt;/p&gt;
&lt;p&gt;$$
F_\nu = \int I_\nu \cos \theta d \Omega
$$
(intensity passing through some differential area \(dA\), lowered by the effective angle).&lt;/p&gt;
&lt;p&gt;\(F_\nu\) has units of
$$
\mathrm{ergs}\;\mathrm{s}^{-1}\;\mathrm{cm}^{-2}\;\mathrm{Hz}^{-1}
$$
(i.e., angular dependence has been integrated out).&lt;/p&gt;
&lt;p&gt;Most astrophysical sources produce significantly less energy in radio waves compared to higher frequency bands, and so the raw CGS unit can be quite cumbersome. To make this easier, astronomers use a unit called the &amp;ldquo;Jansky,&amp;rdquo; which is defined as&lt;/p&gt;
&lt;p&gt;$$
1\,\mathrm{Jy} = 10^{-23}~\mathrm{ergs}\;\mathrm{s}^{-1}\;\mathrm{cm}^{-2}\;\mathrm{Hz}^{-1}
$$&lt;/p&gt;
&lt;p&gt;The Jansky is a unit of flux.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How do we report specific intensities/surface brightnesses for radio sources, then?&lt;/strong&gt; We can, reintroduce the &amp;ldquo;per solid angle&amp;rdquo; to the Jansky, for example
$$
\mathrm{Jy}\;\mathrm{arcsec}^{-2}.
$$&lt;/p&gt;
&lt;p&gt;Later on the course, we&amp;rsquo;ll talk about \(\mathrm{Jy}\;\mathrm{beam}^{-1}\), which is another unit of surface brightness/specific intensity.&lt;/p&gt;
&lt;p&gt;Other units of surface brightness that you might encounter at other wavelengths include \(\mathrm{mag}\;\mathrm{arcsec}^{-2}\) (optical) and \(\mathrm{MJy}\;\mathrm{sr}^{-1}\) (infrared).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Questions for review&lt;/strong&gt;:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;What is the name of \(I_\nu\), and what are its units?&lt;/li&gt;
&lt;li&gt;What is the name of \(F_\nu\), and what are its units?&lt;/li&gt;
&lt;li&gt;If we made an astronomical observation of a &amp;ldquo;point source,&amp;rdquo; would we report \(I_\nu\) or \(F_\nu\)?&lt;/li&gt;
&lt;li&gt;What about for a spatially resolved source?&lt;/li&gt;
&lt;li&gt;Is a Jansky a unit for \(I_\nu\) or \(F_\nu\)?&lt;/li&gt;
&lt;/ul&gt;
&lt;figure&gt;&lt;img src="https://iancze.github.io/courses/as5003/lectures/01-introduction/point-source-resolved-source.png"
 alt="Using \(F_\nu\) and \(I_\nu\) to represent point sources and spatially resolved sources, respectively. Credit: Ian Czekala"&gt;&lt;figcaption&gt;
 &lt;p&gt;Using \(F_\nu\) and \(I_\nu\) to represent point sources and spatially resolved sources, respectively. Credit: Ian Czekala&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;





&lt;h2 id="the-many-temperatures-of-radio-astronomy"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/as5003/lectures/01-introduction/#the-many-temperatures-of-radio-astronomy"&gt;The many temperatures of radio astronomy&lt;/a&gt;
&lt;/h2&gt;




&lt;h3 id="from-cosmic-sources-only"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/as5003/lectures/01-introduction/#from-cosmic-sources-only"&gt;From Cosmic Sources only&lt;/a&gt;
&lt;/h3&gt;
&lt;ul&gt;
&lt;li&gt;&lt;strong&gt;brightness temperature&lt;/strong&gt;: \(T_B\), the temperature such that a blackbody would emit with the observed specific intensity. &lt;strong&gt;N.B.&lt;/strong&gt;: &lt;em&gt;One needs to be very careful of the context when using brightness temperature.&lt;/em&gt; Classic radio astronomers will &lt;em&gt;define&lt;/em&gt; brightness temperature using the Rayleigh-Jeans definition (&lt;em&gt;Essential Radio Astronomy&lt;/em&gt; by Condon and Ransom, &lt;em&gt;Tools of Radio Astronomy&lt;/em&gt; by Wilson et al.)
$$
T_B \equiv \frac{c^2}{2 k} \frac{1}{\nu^2} I_\nu
$$
&lt;em&gt;even when&lt;/em&gt; the Rayleigh-Jeans version of the blackbody formula is not valid for the given temperature and observing frequency. This may not be physically accurate, but has the advantage that intensity and brightness temperature are always linearly related (i.e., a 20% increase in \(T_B\) corresponds to a 20% increase in \(I_\nu\)).&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;Other, more physics-minded resources will define brightness temperature using the full form of the Planck formula (&lt;em&gt;Physics of the Interstellar and Intergalactic Medium&lt;/em&gt;, Draine).
$$
I_\nu = \frac{2 h \nu^3}{c^2} \frac{1}{\exp{(h \nu / k T_B)} - 1}
$$
such that
$$
T_B \equiv \frac{h \nu / k}{\ln[1 + 2 h \nu^3 / c^2 I_\nu].}
$$
If the source is in thermal equilibrium, this form has the advantage that \(T_B\) does correspond to a physical temperature. Draine uses the term &lt;strong&gt;antenna temperature&lt;/strong&gt; to refer to brightness temperature arrived at using the Rayleigh-Jeans definition.&lt;/p&gt;
&lt;p&gt;Unfortunately, this ambiguity makes communicating using brightness temperature (a great concept!) confusing and error-prone. If there is any ambiguity about whether the Rayleigh-Jeans approximation is valid \(h \nu \ll k T_B \) for all observed regions in your field of view, then you should state which form of the blackbody formula you are using in your figure/article.&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;&lt;strong&gt;antenna temperature&lt;/strong&gt;: if you are in a radio astronomy context, you will also come across this terminology, which is functionally the same as the classic radio astronomer&amp;rsquo;s definition of brightness temperature. Key is that specific intensity is &lt;em&gt;linearly related&lt;/em&gt; to antenna temperature and makes it easy to substitute one for the other using the Rayleigh-Jeans approximation
$$
T_A(\nu) = \frac{c^2}{2 k \nu^2} I_\nu
$$
Typically, antenna temperature is used to describe a &lt;em&gt;measured&lt;/em&gt; temperature from the instrument in the broader context of other noise sources.&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;In the field of radio astronomy, be aware that one frequently combines temperatures in other interesting ways. One can express random noise power in terms of an effective temperature
$$
P = k T \Delta \nu
$$
where \(\Delta \nu\) is the bandwidth of the observation. Here the power is equal to the noise power delivered to a &lt;strong&gt;matched load&lt;/strong&gt; by a resistor at physical temperature \(T\). By matched load, we mean we connect a resistor to the input terminals of a linear amplifier. The fact that this resistor has some temperature (i.e., we haven&amp;rsquo;t cooled it to absolute zero&amp;hellip;) means that the thermal motion of the electrons will produce a random, variable current \(i(t)\) input to the amplifier. The mean value of this current is zero, but the root mean squared value is non-zero, and this represents a non-zero power. I.e., you can draw (some) power from a resistor at room temperature, purely from thermal motions. The situation is not dissimilar to the random walk of a particle in Brownian motion. For more details, see &lt;em&gt;Tools of Radio Astronomy&lt;/em&gt;, Chapter 1.8.&lt;/p&gt;




&lt;h3 id="including-noise-sources"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/as5003/lectures/01-introduction/#including-noise-sources"&gt;Including noise sources&lt;/a&gt;
&lt;/h3&gt;
&lt;ul&gt;
&lt;li&gt;&lt;strong&gt;Antenna temperature&lt;/strong&gt; \(T_A\) the component of the power received by the antenna from &lt;em&gt;cosmic sources&lt;/em&gt;. It has the same interpretation as before (though we&amp;rsquo;ll talk about beam dilution in a second).&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Receiver temperature&lt;/strong&gt; \(T_R\) the component of the power from internal noise of the receiver components themselves, ground radiation, atmospheric emission, etc&amp;hellip;&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;System temperature&lt;/strong&gt; \(T_S = T_A + T_R\) is the sum of receiver temperature and antenna temperature. It&amp;rsquo;s the one power number coming out of the backend of your telescope. It&amp;rsquo;s up to you to calibrate \(T_R\) accurately enough to measure \(T_A\).&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;In any observation, you will have your cosmic signal of interest and several contributions of noise (see &lt;em&gt;Essential Radio Astronomy&lt;/em&gt;, Ch. 3.6.1)
$$
T_S=T_\mathrm{cmb}+T_\mathrm{rsb}+T_A + [1−\exp(−\tau_A)] T_\mathrm{atm}+T_\mathrm{spill}+T_r+ \ldots
$$
such as the CMB, other galactic background sources, the atmosphere, spillover radiation from the ground, the temperature of the radiometer itself (hopefully cryogenically cooled), etc.&lt;/p&gt;
&lt;p&gt;In the limit that \(T_A \ll T_S\) (most astronomy situations, unfortunately!), we have
$$
S/N \approx C \frac{T_A}{T_S} \sqrt{\Delta \nu \Delta t}
$$
where \(C\) is a constant of proportionality greater than or equal to 1, and \(\Delta t\) is the integration time. If we let \(\Delta \nu \approx 1\;\mathrm{GHz}\) and \(\Delta t \approx 1\;\mathrm{h}\), then we can get \(\sqrt{\Delta \nu \Delta t} \approx 10^6\), allowing us to detect a signal which is less than \(10^{-6}\) the system noise. A great illustration of this capability is the COBE satellite that studied CMB anisotropies with brightness temperatures \(&amp;lt; 10^{-7}\) that of the system temperature. To achieve these contrasts, however, it&amp;rsquo;s important to keep systematics under control, otherwise the S/N scaling won&amp;rsquo;t hold!&lt;/p&gt;




&lt;h2 id="beam-dilution"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/as5003/lectures/01-introduction/#beam-dilution"&gt;Beam dilution&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;In previous lectures, we&amp;rsquo;ve been talking about specific intensity \(I_\nu(\Omega)\) as a &amp;ldquo;known&amp;rdquo; quantity of direction (e.g., R.A., Dec.) and used antenna temperature \(T_A\) as a linear proxy for its specification. For the following discussion, we&amp;rsquo;re going to move into the realm of observations, and discuss the ways \(T_A\) can be an unfaithful proxy for the &amp;ldquo;true&amp;rdquo; specific intensity distribution or brightness temperature. we&amp;rsquo;ll use the symbol \(T_b(\Omega)\) to denote the &amp;ldquo;true&amp;rdquo; brightness/antenna temperature, assuming we&amp;rsquo;re in the Rayleigh-Jeans limit, and redefine \(T_A\) to mean the response of the telescope to the cosmic radiation.&lt;/p&gt;
&lt;p&gt;When we&amp;rsquo;re doing observations, we don&amp;rsquo;t always have access to the highest resolution version of \(T_b(\Omega)\), but rather we have access to a quantity which is the true \(T_b(\Omega)\) convolved with the beam of the telescope, which is the implication of \(T_A\) for this discussion.&lt;/p&gt;
&lt;figure&gt;&lt;img src="https://iancze.github.io/courses/as5003/lectures/01-introduction/beam-dilution.png"
 alt="When astrophysical sources are insufficiently resolved, our measurements of \(I_\nu\) do not trace the true sky distribution, specifically features are smeared out over a spatial extent and peak intensities are reduced. This means that using these insufficiently resolved measurements of \(T_A\) will not accurately trace the true underlying temperatures of the astrophysical source (even if the emission from the source is actually thermal in origin). Of course, we never have infinite spatial resolution, so there will always be structure on scales beyond that of our observations. Credit: Ian Czekala"&gt;&lt;figcaption&gt;
 &lt;p&gt;When astrophysical sources are insufficiently resolved, our measurements of \(I_\nu\) do not trace the true sky distribution, specifically features are smeared out over a spatial extent and peak intensities are reduced. This means that using these insufficiently resolved measurements of \(T_A\) will not accurately trace the true underlying temperatures of the astrophysical source (even if the emission from the source is actually thermal in origin). Of course, we never have infinite spatial resolution, so there will always be structure on scales beyond that of our observations. Credit: Ian Czekala&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;





&lt;h4 id="large-fully-resolved-source"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/as5003/lectures/01-introduction/#large-fully-resolved-source"&gt;large (fully resolved) source&lt;/a&gt;
&lt;/h4&gt;
&lt;p&gt;In the limit that we are observing a source that subtends a solid angle much larger than the beam of the antenna,
$$
\Omega_S &amp;gt; \Omega_A
$$
the convolution of the beam doesn&amp;rsquo;t matter, we&amp;rsquo;re still sensing approximately the same \(T_b(\Omega)\) such that
$$
T_A(\Omega) \approx T_b(\Omega).
$$
If the source is in LTE, then we also have that \(T_A \approx T\).&lt;/p&gt;




&lt;h3 id="small-unresolved-source"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/as5003/lectures/01-introduction/#small-unresolved-source"&gt;small (unresolved) source&lt;/a&gt;
&lt;/h3&gt;
&lt;p&gt;If the source is more compact than the antenna beam,
$$
\Omega_S &amp;lt; \Omega_A,
$$
then the measured antenna temperature is basically the &amp;ldquo;true&amp;rdquo; intensity averaged over the area of the main beam. This lowers the measured antenna temperature by a factor
$$
\frac{T_A}{T_b} = \frac{\Omega_S}{\Omega_A}
$$
where the ratio \(\frac{\Omega_S}{\Omega_A}\) is called the &lt;strong&gt;beam filling factor&lt;/strong&gt;.&lt;/p&gt;
&lt;p&gt;For example, you could have a compact source with \(T_b = 10^4\) K, but if it only fills 1% of the beam solid angle then you would measure an antenna temperature of 100 K. If you took your observations at face-value (and assumed LTE), then you would incorrectly conclude that the source is 100x cooler than it actually is.&lt;/p&gt;
&lt;p&gt;Beam dilution also applies to observations of sources that have &lt;em&gt;structure&lt;/em&gt; on spatial scales below the observable limit, which, to be honest, is going to be most astrophysical sources of interest. For example, consider a gas filament in a star-forming region.&lt;/p&gt;
&lt;p&gt;Radio-bright, spatially concentrated regions will be &amp;ldquo;smeared out&amp;rdquo; by the beam. If you wanted to use antenna temperature (and assume LTE) to estimate the physical conditions of the gas filament, you do so at the peril of measuring incorrect temperatures. The unfortunate reality here is that, without higher resolution images to guide you (which sometimes exist at optical or infrared wavelengths), it&amp;rsquo;s quite difficult to estimate how badly your measurements are affected by beam dilution.&lt;/p&gt;
&lt;p&gt;For more useful single-dish guidance, see &lt;a href="https://www.atnf.csiro.au/research/radio-school/2011/talks/Parkes-school-Fundamental-II.pdf"&gt;these notes&lt;/a&gt; by James Jackson, or &lt;a href="https://www.cv.nrao.edu/~sransom/web/Ch3.html#S1.SS6"&gt;Ch 3.1.6&lt;/a&gt; of &lt;em&gt;Essential Radio Astronomy&lt;/em&gt;.&lt;/p&gt;




&lt;h1 id="introduction-to-interferometric-arrays-alma-vla-sma-vlba"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/as5003/lectures/01-introduction/#introduction-to-interferometric-arrays-alma-vla-sma-vlba"&gt;Introduction to interferometric arrays, ALMA, VLA, SMA, VLBA&lt;/a&gt;
&lt;/h1&gt;
&lt;p&gt;Now that we&amp;rsquo;ve covered some of the fundamentals around radio telescopes and single-dish antennas, we&amp;rsquo;ll move on to discussing how we combine the signals from multiple antennas to do interferometry. Here are some of he interferometers operating today:&lt;/p&gt;
&lt;figure&gt;&lt;img src="https://iancze.github.io/courses/as5003/lectures/01-introduction/alma.jpg"
 alt="The Atacama Large (Sub)millimeter Array, an interferometric array of 66 antennas operating at sub-millimeter wavelengths. The largest antennas in the array are only 12m in diameter, yet through interferometry, the array is able to obtain far higher spatial resolution than the largest single-dish antennas. Credit: NRAO/ESO/NAOJ/JAO"&gt;&lt;figcaption&gt;
 &lt;p&gt;The Atacama Large (Sub)millimeter Array, an interferometric array of 66 antennas operating at sub-millimeter wavelengths. The largest antennas in the array are only 12m in diameter, yet through interferometry, the array is able to obtain far higher spatial resolution than the largest single-dish antennas. Credit: NRAO/ESO/NAOJ/JAO&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;

&lt;figure&gt;&lt;a href="https://public.nrao.edu/telescopes/vla/"&gt;&lt;img src="https://iancze.github.io/courses/as5003/lectures/01-introduction/VLA.jpg"
 alt="The Very Large Array, located in Socorro, NM. Credit: NRAO"&gt;&lt;/a&gt;&lt;figcaption&gt;
 &lt;p&gt;The Very Large Array, located in Socorro, NM. Credit: NRAO&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;

&lt;figure&gt;&lt;a href="https://en.wikipedia.org/wiki/Very_Long_Baseline_Array"&gt;&lt;img src="https://iancze.github.io/courses/as5003/lectures/01-introduction/VLBA_St_Croix.jpg"
 alt="One antenna at the Eastern end of the Very Long Baseline Array (VLBA), St. Croix, U.S. Virgin Islands. Credit: Cumulus Clouds"&gt;&lt;/a&gt;&lt;figcaption&gt;
 &lt;p&gt;One antenna at the Eastern end of the Very Long Baseline Array (VLBA), St. Croix, U.S. Virgin Islands. Credit: Cumulus Clouds&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;

&lt;figure&gt;&lt;a href="https://www.iram-institute.org/EN/noema-project.php?ContentID=9&amp;amp;rub=9&amp;amp;srub=0&amp;amp;ssrub=0&amp;amp;sssrub=0"&gt;&lt;img src="https://iancze.github.io/courses/as5003/lectures/01-introduction/NOEMA.jpg"
 alt="The NOEMA, located in the French Alps on the Plateau du Bure. Credit: IRAM/Rebus"&gt;&lt;/a&gt;&lt;figcaption&gt;
 &lt;p&gt;The NOEMA, located in the French Alps on the Plateau du Bure. Credit: IRAM/Rebus&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;

&lt;figure&gt;&lt;a href="https://lweb.cfa.harvard.edu/sma/"&gt;&lt;img src="https://iancze.github.io/courses/as5003/lectures/01-introduction/SMA.jpg"
 alt="The Submillimeter Array (SMA), located on Mauna Kea, Hawaii. Credit: I. Czekala"&gt;&lt;/a&gt;&lt;figcaption&gt;
 &lt;p&gt;The Submillimeter Array (SMA), located on Mauna Kea, Hawaii. Credit: I. Czekala&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;
</description></item><item><title>Correlation and Covariance in ALMA Data</title><link>https://iancze.github.io/courses/astro589/lectures/11-covariance/</link><pubDate>Tue, 06 Dec 2022 00:00:00 +0000</pubDate><guid>https://iancze.github.io/courses/astro589/lectures/11-covariance/</guid><description>&lt;ul&gt;
&lt;li&gt;&lt;a href="https://psu.mediaspace.kaltura.com/media/Astro+589+Lecture+11A+Covariance+and+Correlations/1_1jr8jqht"&gt;Zoom Slides&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;</description></item><item><title>Regularized Maximum Likelihood (RML) II</title><link>https://iancze.github.io/courses/astro589/lectures/10-rml-part-ii/</link><pubDate>Fri, 21 Oct 2022 00:00:00 +0000</pubDate><guid>https://iancze.github.io/courses/astro589/lectures/10-rml-part-ii/</guid><description>&lt;ul&gt;
&lt;li&gt;&lt;a href="https://psu.mediaspace.kaltura.com/media/Astro+589A+Lecture+10+RML+Part+II+Part+I/1_syjygobk"&gt;Whiteboard recording&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="https://psu.mediaspace.kaltura.com/media/Astro+589A+Lecture+10+RML+Part+II+Part+II/1_b483lcx9"&gt;Zoom Slides&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;




&lt;h2 id="references"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro589/lectures/10-rml-part-ii/#references"&gt;References&lt;/a&gt;
&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;&lt;a href="https://mpol-dev.github.io/MPoL/rml_intro.html"&gt;MPoL introduction&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="https://catalog.libraries.psu.edu/catalog/19499523"&gt;Machine Learning: A Probabilistic Perspective&lt;/a&gt; by Murphy, Chapter 10&lt;/li&gt;
&lt;li&gt;&lt;a href="https://catalog.libraries.psu.edu/catalog/3405468"&gt;Pattern Recognition and Machine Learning&lt;/a&gt; by Bishop, Chapter 8&lt;/li&gt;
&lt;li&gt;The fourth paper in the 2019 &lt;a href="https://ui.adsabs.harvard.edu/abs/2019ApJ...875L...4E/abstract"&gt;Event Horizon Telescope Collaboration series&lt;/a&gt; describing the imaging principles&lt;/li&gt;
&lt;li&gt;&lt;a href="https://ui.adsabs.harvard.edu/abs/1986ARA%26A..24..127N/abstract"&gt;Maximum entropy image restoration in astronomy&lt;/a&gt; AR&amp;amp;A by Narayan and Nityananda 1986&lt;/li&gt;
&lt;li&gt;&lt;a href="https://ui.adsabs.harvard.edu/abs/2018A%26C....22...16C/abstract"&gt;Multi-GPU maximum entropy image synthesis for radio astronomy&lt;/a&gt; by Cárcamo et al. 2018&lt;/li&gt;
&lt;li&gt;&lt;a href="https://ui.adsabs.harvard.edu/abs/2022arXiv220911813Z/abstract"&gt;Regularized Maximum Likelihood Techniques for ALMA Observations&lt;/a&gt; by Zawadzki, Czekala, et al.&lt;/li&gt;
&lt;li&gt;&lt;a href="https://ui.adsabs.harvard.edu/abs/2021PASP..133i3001H/abstract"&gt;Fitting Very Flexible Models: Linear Regression With Large Numbers of Parameters&lt;/a&gt; by Hogg and Villar&lt;/li&gt;
&lt;/ul&gt;




&lt;h2 id="last-time"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro589/lectures/10-rml-part-ii/#last-time"&gt;Last time&lt;/a&gt;
&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;Recap of (parametric) forward modeling in a Bayesian context&lt;/li&gt;
&lt;li&gt;Recap of the CLEAN procedural image deconvolution algorithm&lt;/li&gt;
&lt;li&gt;Introduction of RML process as a non-parametric model&lt;/li&gt;
&lt;li&gt;Discussion of regularization, in the context of priors&lt;/li&gt;
&lt;li&gt;Discussion of loss function space (defined by probability distribution) vs. the optimization engineering that helps you navigate it&lt;/li&gt;
&lt;/ul&gt;




&lt;h2 id="today"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro589/lectures/10-rml-part-ii/#today"&gt;Today&lt;/a&gt;
&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;Overarching question&amp;mdash;how do you assess whether something is good? Forays into Machine Learning&lt;/li&gt;
&lt;li&gt;Deeper dive into future RML applications and opportunities&lt;/li&gt;
&lt;/ul&gt;




&lt;h2 id="model-comparison"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro589/lectures/10-rml-part-ii/#model-comparison"&gt;Model comparison&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;Last time we talked about the difference between parametric and non-parametric models, i.e., the difference between fitting a line with slope and intercept vs. fitting a spline or Gaussian process. And we made the simple distinction that a parametric model has a fixed number of parameters, whereas a non-parametric model generally has parameters that grow with your number of data points. The truth is that in several contexts these exist as part of a continuum.&lt;/p&gt;
&lt;p&gt;Today we&amp;rsquo;re going to take a journey along this continuum and examine some of the failure modes that can come about. The discussion will first be general and applicable to &lt;em&gt;many&lt;/em&gt; problems, but then we&amp;rsquo;ll zero in on the case of interferometric imaging (both CLEAN and RML) specifically.&lt;/p&gt;
&lt;p&gt;Let&amp;rsquo;s think of a polynomial basis. If you recall back to one of our first lectures, where I asked you to draw a function through a set of discrete samples. Let&amp;rsquo;s narrow in on the specific case of a polynomial basis set of degree \(N\), where \(N\) is the number of terms. I&amp;rsquo;m going to write it like this&lt;/p&gt;
&lt;p&gt;$$
y = a_0 + a_1 x + a_2 x^2 + a_3 x^3 + \ldots.
$$
but similar arguments apply to Legendre polynomials or Chebyshev polynomials, etc, and in practice it&amp;rsquo;s better to use on of those for your actual fitting problem.&lt;/p&gt;
&lt;p&gt;But lets say we have 10 data points. We can fit a 0th order polynomial, 1st order, etc.., using say a \(\chi^2\) likelihood function. And then we can get to a polynomial of degree ten. Who has heard the common critique &amp;ldquo;your model has more parameters than data, it&amp;rsquo;s so flexible&amp;rdquo;. This situation is where the common advice against using a model with more parameters than your data comes from, and when people utter this critique, I think this is the situation that they are referring to.&lt;/p&gt;
&lt;p&gt;What are the criticisms of this model? On one hand, it has fit the data &lt;em&gt;perfectly&lt;/em&gt;. It&amp;rsquo;s done what we&amp;rsquo;ve asked. On the other hand, it doesn&amp;rsquo;t seem to do what we want. If we were to get new data, our model probably wouldn&amp;rsquo;t be that useful.&lt;/p&gt;
&lt;p&gt;What can we do? Well, the common wisdom would have us stick to models with fewer parameters. But, this is being a little shortsighted. Instead, we can add a &lt;em&gt;regularization&lt;/em&gt; that discourages the fit from taking on large amplitudes in many of the terms. One type of regularization is &amp;ldquo;ridge regression,&amp;rdquo; also called Tikhonov or L2 regularization. It adds an extra term to the fit metric that says&lt;/p&gt;
&lt;p&gt;$$
\lambda \sum_{i=0}^{N-1}|a_i|^2
$$&lt;/p&gt;
&lt;p&gt;then we find that the amplitudes of those higher order terms (which might not be necessary) will be diminished. If you recall from very early in the semester, where we talked about the concept of band-limited signals, this regularization is related. In the limit we let \(N\rightarrow \infty\), we arrive at a Gaussian process, and the autocovariance of the function (the Gaussian process kernel) is related to the power spectrum of the signal. If we say the signal is band-limited, then that puts a &lt;em&gt;cap&lt;/em&gt; on the number of higher order terms that we can actually use.&lt;/p&gt;
&lt;p&gt;This math is fully equivalent to the RML imaging problem we introduced last week, and it also raises the same problem: how do we set the regularizer strength? What is the best choice?&lt;/p&gt;




&lt;h2 id="cross-validation"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro589/lectures/10-rml-part-ii/#cross-validation"&gt;Cross validation&lt;/a&gt;
&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;Useful thoughts from &lt;a href="https://biometry.github.io/APES/LectureNotes/2017-Resampling/CrossValidationLecture.html"&gt;https://biometry.github.io/APES/LectureNotes/2017-Resampling/CrossValidationLecture.html&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;The idea is to test the &lt;em&gt;predictive power&lt;/em&gt; of your model. In this case, the model would be your setup of your. In the RML case, the model would be the settings of your image pixelization,&lt;/p&gt;
&lt;p&gt;If we have &lt;em&gt;the right model&lt;/em&gt;, we will generalize perfectly to new data. The problem is that our training data are always limited and will usually always have some noise.&lt;/p&gt;
&lt;p&gt;The problem of non-independence of your random hold-outs. When the data are small, it is possible to overfit your cross validation. This is a hard place to be in, especially when getting new data is expensive.&lt;/p&gt;</description></item><item><title>Regularized Maximum Likelihood (RML) I</title><link>https://iancze.github.io/courses/astro589/lectures/09-rml-part-i/</link><pubDate>Mon, 17 Oct 2022 00:00:00 +0000</pubDate><guid>https://iancze.github.io/courses/astro589/lectures/09-rml-part-i/</guid><description>&lt;ul&gt;
&lt;li&gt;&lt;a href="https://psu.mediaspace.kaltura.com/media/Astro+589A+Lecture+9A+RML+Part+I/1_3ehfryrc"&gt;Recording Part I: whiteboard&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="https://psu.mediaspace.kaltura.com/media/Astro+589A+Lecture+9+part+IIA+RML/1_5k3v0oqn"&gt;Zoom Part II: slides&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;




&lt;h2 id="references"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro589/lectures/09-rml-part-i/#references"&gt;References&lt;/a&gt;
&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;&lt;a href="https://mpol-dev.github.io/MPoL/rml_intro.html"&gt;MPoL introduction&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="https://catalog.libraries.psu.edu/catalog/19499523"&gt;Machine Learning: A Probabilistic Perspective&lt;/a&gt; by Murphy, Chapter 10&lt;/li&gt;
&lt;li&gt;&lt;a href="https://catalog.libraries.psu.edu/catalog/3405468"&gt;Pattern Recognition and Machine Learning&lt;/a&gt; by Bishop, Chapter 8&lt;/li&gt;
&lt;li&gt;The fourth paper in the 2019 &lt;a href="https://ui.adsabs.harvard.edu/abs/2019ApJ...875L...4E/abstract"&gt;Event Horizon Telescope Collaboration series&lt;/a&gt; describing the imaging principles&lt;/li&gt;
&lt;li&gt;&lt;a href="https://ui.adsabs.harvard.edu/abs/1986ARA%26A..24..127N/abstract"&gt;Maximum entropy image restoration in astronomy&lt;/a&gt; AR&amp;amp;A by Narayan and Nityananda 1986&lt;/li&gt;
&lt;li&gt;&lt;a href="https://ui.adsabs.harvard.edu/abs/2018A%26C....22...16C/abstract"&gt;Multi-GPU maximum entropy image synthesis for radio astronomy&lt;/a&gt; by Cárcamo et al. 2018&lt;/li&gt;
&lt;li&gt;&lt;a href="https://ui.adsabs.harvard.edu/abs/2022arXiv220911813Z/abstract"&gt;Regularized Maximum Likelihood Techniques for ALMA Observations&lt;/a&gt; by Zawadzki, Czekala, et al.&lt;/li&gt;
&lt;/ul&gt;




&lt;h2 id="last-time"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro589/lectures/09-rml-part-i/#last-time"&gt;Last time&lt;/a&gt;
&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;Discussed \(u,v\) coverage and sampling (weights)&lt;/li&gt;
&lt;li&gt;Introduced the &amp;ldquo;dirty image&amp;rdquo; as the inverse Fourier transform of the visibility samples&lt;/li&gt;
&lt;li&gt;Introduced the CLEAN image deconvolution procedure&lt;/li&gt;
&lt;/ul&gt;




&lt;h2 id="this-time"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro589/lectures/09-rml-part-i/#this-time"&gt;This time&lt;/a&gt;
&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;Review parametric vs. non-parametric models&lt;/li&gt;
&lt;li&gt;Introduce Regularized Maximum Likelihood (RML) imaging&lt;/li&gt;
&lt;/ul&gt;




&lt;h2 id="parametric-vs-non-parametric-models"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro589/lectures/09-rml-part-i/#parametric-vs-non-parametric-models"&gt;Parametric vs. Non-Parametric Models&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;Recall our discussion on Bayesian inference and what it means to forward-model a dataset and how to calculate a likelihood function.&lt;/p&gt;
&lt;p&gt;Say we have a model that we are fitting to some data. In &amp;ldquo;Machine Learning: A Probabilistic Perspective,&amp;rdquo; Murphy defines a parametric model as one that has a fixed number of parameters, and a non-parametric one as one where the number of parameters grows with the size of the data.&lt;/p&gt;
&lt;p&gt;Something like the line we discussed in our Bayesian Modeling lecture is a parametric model. It has two parameters, a slope and an intercept. If we were observing a source and we wanted to fit a 2D Gaussian to the visibility function, that visibility model would also be a parametric model. Its parameters would be the width of the Gaussian, the position of the source, and the amplitude of the source.&lt;/p&gt;
&lt;p&gt;TODO: draw example of Gaussian function and label parameters&lt;/p&gt;
&lt;p&gt;If you&amp;rsquo;ve ever fit a spline to a bunch of data, then you&amp;rsquo;ve used a non-parametric model. A Gaussian process would also a non-parametric model. In these models there in fact are parameters (like the number or type of splines/GP kernels), but these are usually nuisances to the problem. You wouldn&amp;rsquo;t necessarily fit a spline model to determine the exact number of spline position parameters, but you &lt;em&gt;are&lt;/em&gt; interested in the approximation to \(f(x)\) that the spline has enabled you.&lt;/p&gt;
&lt;p&gt;TODO: draw points and a spline or GP drawn through them&lt;/p&gt;
&lt;p&gt;I would consider a CLEAN model to be a type of non-parametric model too. Through the CLEANing process, you create a model of the source emission from a collection of \(\delta\) functions. Each of these \(\delta\) functions technically has parameters, but those are mostly nuisance parameters in pursuit of their aggregate representation of the model. In general, non-parametric models have the ability to be more expressive than parametric models, but sometimes at the expense of interpretability.&lt;/p&gt;




&lt;h2 id="rml-images-as-non-parametric-models"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro589/lectures/09-rml-part-i/#rml-images-as-non-parametric-models"&gt;RML images as non-parametric models&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;Now, let me introduce a set of techniques that have been grouped under the banner &amp;ldquo;Regularized Maximum Likelihood Imaging&amp;rdquo; or RML Imaging for short. With RML imaging, we&amp;rsquo;re trying to come up with a model that will fit the dataset. But rather than using a parametric model like a protoplanetary disk structure model or a series of Gaussian rings, we&amp;rsquo;re using a non-parametric model of &lt;em&gt;the image itself&lt;/em&gt;. This could be as simple as parameterizing the image using the intensity values of the pixels themselves, i.e.,&lt;/p&gt;
&lt;p&gt;$$
\boldsymbol{\theta} = {I_1, I_2, \ldots, I_{N^2} }
$$&lt;/p&gt;
&lt;p&gt;assuming we have an \(N \times N\) image.&lt;/p&gt;
&lt;p&gt;A flexible image model for RML imaging is mostly analogous to using a spline or Gaussian process to fit a series of \(\boldsymbol{X} = {x_1, x_2, \ldots, x_N}\) and \(\boldsymbol{Y} = {y_1, y_2, \ldots, y_N}\) points&amp;mdash;the model will nearly always have enough flexibility to capture the structure that exists in the dataset. The most straightforward formulation of a non-parametric image model is the pixel basis set, but we could also use more sophisticated basis sets like a set of wavelet coefficients, or even more exotic basis sets constructed from trained neural networks. These may have some serious advantages when it comes to the &amp;ldquo;regularizing&amp;rdquo; part of &amp;ldquo;regularized maximum likelihood&amp;rdquo; imaging. But first, let&amp;rsquo;s talk about the &amp;ldquo;maximum likelihood&amp;rdquo; part.&lt;/p&gt;
&lt;p&gt;Given some image parameterization (e.g., a pixel basis set of \(N \times N\) pixels, with each pixel &lt;code&gt;cell_size&lt;/code&gt; in width), we would like to find the maximum likelihood image \(\boldsymbol{\theta}_\mathrm{MLE}\). Fortunately, because the Fourier transform is a linear operation, we can analytically calculate the maximum solution (the same way we might find the best-fit slope and intercept for the line example). This maximum likelihood solution is called (in the radio astronomy world) the dirty image, and its associated point spread function is called the dirty beam.&lt;/p&gt;
&lt;p&gt;In the construction of the dirty image, all unsampled spatial frequencies are set to zero power. This means that the dirty image will only contain spatial frequencies about which we have at least some data. This assumption, however, rarely translates into good image fidelity, especially if there are many unsampled spatial frequencies which carry significant power. It&amp;rsquo;s also important to recognize that dirty image is only &lt;em&gt;one&lt;/em&gt; out of a set of &lt;em&gt;many&lt;/em&gt; images that could maximize the likelihood function. From the perspective of the likelihood calculation, we could modify the unsampled spatial frequencies of the dirty image to whatever power we might like, and, because they are &lt;em&gt;unsampled&lt;/em&gt;, the value of the likelihood calculation won&amp;rsquo;t change, i.e., it will still remain maximal.&lt;/p&gt;
&lt;p&gt;When synthesis imaging is described as an &amp;ldquo;ill-posed inverse problem,&amp;rdquo; this is what is meant. There is a (potentially infinite) range of images that could &lt;em&gt;exactly&lt;/em&gt; fit the dataset, and without additional information we have no way of discriminating which is best. As you might suspect, this is now where the &amp;ldquo;regularization&amp;rdquo; part of &amp;ldquo;regularized maximum likelihood&amp;rdquo; imaging comes in.&lt;/p&gt;




&lt;h2 id="regularization"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro589/lectures/09-rml-part-i/#regularization"&gt;Regularization&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;There are a number of different ways to talk about regularization. If one wants to be Bayesian about it, one would talk about specifying &lt;em&gt;priors&lt;/em&gt;, i.e., we introduce terms like \(p(\boldsymbol{\theta})\) such that we might calculate the maximum a posteriori (MAP) image \(\boldsymbol{\theta}_\mathrm{MAP}\) using the posterior probability distribution&lt;/p&gt;
&lt;p&gt;$$
p(\boldsymbol{\theta} |\, \boldsymbol{V}) \propto \mathcal{L}(\boldsymbol{V} |\, \boldsymbol{\theta}) \, p(\boldsymbol{\theta}).
$$&lt;/p&gt;
&lt;p&gt;For computational reasons related to numerical over/underflow, we would most likely use the logarithm of the posterior probability distribution&lt;/p&gt;
&lt;p&gt;$$
\ln p(\boldsymbol{\theta} |\, \boldsymbol{V}) \propto \ln \mathcal{L}(\boldsymbol{V} |\, \boldsymbol{\theta}) + \ln p(\boldsymbol{\theta}).
$$&lt;/p&gt;
&lt;p&gt;One could accomplish the same goal without necessarily invoking the Bayesian language by simply talking about which parameters \(\boldsymbol{\theta}\) optimize some objective function.&lt;/p&gt;
&lt;p&gt;We&amp;rsquo;ll adopt the perspective that we have some objective &amp;ldquo;cost&amp;rdquo; function that we&amp;rsquo;d like to &lt;em&gt;minimize&lt;/em&gt; to obtain the optimal parameters \(\hat{\boldsymbol{\theta}}\). The machine learning community calls this a &amp;ldquo;loss&amp;rdquo; function \(L(\boldsymbol{\theta})\), and so we&amp;rsquo;ll borrow that terminology here. For an unregularized fit, an acceptable loss function is just the negative log likelihood (&amp;ldquo;nll&amp;rdquo;) term,&lt;/p&gt;
&lt;p&gt;$$
L(\boldsymbol{\theta}) = L_\mathrm{nll}(\boldsymbol{\theta}) = - \ln \mathcal{L}(\boldsymbol{V}|\,\boldsymbol{\theta}) = \frac{1}{2} \chi^2(\boldsymbol{V}|\,\boldsymbol{\theta})
$$&lt;/p&gt;
&lt;p&gt;If we&amp;rsquo;re only interested in \(\hat{\boldsymbol{\theta}}\), it doesn&amp;rsquo;t matter whether we include the 1/2 prefactor in front of \(\chi^2\), the loss function will still have the same optimum. However, when it comes time to add additional terms to the loss function, these prefactors matter in controlling the relative strength of each term.&lt;/p&gt;
&lt;p&gt;When phrased in the terminology of function optimization, additional terms can be described as regularization penalties. To be specific, let&amp;rsquo;s add a term that regularizes the sparsity of an image.&lt;/p&gt;
&lt;p&gt;$$
L_\mathrm{sparsity}(\boldsymbol{\theta}) = \sum_i |I_i|
$$&lt;/p&gt;
&lt;p&gt;In short, the L1 norm promotes sparse solutions (solutions where many pixel values are zero). The combination of these two terms leads to a new loss function&lt;/p&gt;
&lt;p&gt;$$
L(\boldsymbol{\theta}) = L_\mathrm{nll}(\boldsymbol{\theta}) + \lambda_\mathrm{sparsity} L_\mathrm{sparsity}(\boldsymbol{\theta})
$$&lt;/p&gt;
&lt;p&gt;Where we control the relative &amp;ldquo;strength&amp;rdquo; of the regularization via the scalar prefactor \(\lambda_\mathrm{sparsity}\). If \(\lambda_\mathrm{sparsity} = 0\), no sparsity regularization is applied. Non-zero values of \(\lambda_\mathrm{sparsity}\) will add in regularization that penalizes non-sparse \(\boldsymbol{\theta}\) values. How strong this penalization is depends on the strength relative to the other terms in the loss calculation.&lt;/p&gt;
&lt;p&gt;We can equivalently specify this using Bayesian terminology, such that&lt;/p&gt;
&lt;p&gt;$$
p(\boldsymbol{\theta} |\,\boldsymbol{V}) = \mathcal{L}(\boldsymbol{V}|,\boldsymbol{\theta}) \, p(\boldsymbol{\theta})
$$&lt;/p&gt;
&lt;p&gt;where&lt;/p&gt;
&lt;p&gt;$$
p(\boldsymbol{\theta}) = C \exp \left (-\lambda_\mathrm{sparsity} \sum_i | I_i| \right)
$$&lt;/p&gt;
&lt;p&gt;and \(C\) is a normalization factor. When working with the logarithm of the posterior, this constant term is irrelevant.&lt;/p&gt;




&lt;h2 id="the-mpol-package-for-regularized-maximum-likelihood-imaging"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro589/lectures/09-rml-part-i/#the-mpol-package-for-regularized-maximum-likelihood-imaging"&gt;The MPoL package for Regularized Maximum Likelihood imaging&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;&lt;em&gt;Million Points of Light&lt;/em&gt; or &amp;ldquo;MPoL&amp;rdquo; is a Python package that is used to perform regularized maximum likelihood imaging. By that we mean that the package provides the building blocks to create flexible image models and optimize them to fit interferometric datasets. The package is developed completely in the open on &lt;a href="https://github.com/MPoL-dev/MPoL"&gt;Github&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;We strive to&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;create an open, welcoming, and supportive community for new users and contributors (see our &lt;code&gt;code of conduct &amp;lt;https://github.com/MPoL-dev/MPoL/blob/main/CODE_OF_CONDUCT.md&amp;gt;&lt;/code&gt;&lt;strong&gt;and &lt;code&gt;developer documentation &amp;lt;developer-documentation.html&amp;gt;&lt;/code&gt;&lt;/strong&gt;)&lt;/li&gt;
&lt;li&gt;support well-tested (|Tests badge|) and stable releases (i.e., &lt;code&gt;pip install mpol&lt;/code&gt;) that run on all currently-supported Python versions, on Linux, MacOS, and Windows&lt;/li&gt;
&lt;li&gt;maintain up-to-date &lt;code&gt;API documentation &amp;lt;api.html&amp;gt;&lt;/code&gt;__&lt;/li&gt;
&lt;li&gt;cultivate tutorials covering real-world applications&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;We also recommend checking out several other excellent packages for RML imaging:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;&lt;a href="https://github.com/astrosmili/smili"&gt;SMILI&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="https://github.com/achael/eht-imaging"&gt;eht-imaging&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="https://github.com/miguelcarcamov/gpuvmem"&gt;GPUVMEM&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;There are a few things about MPoL that we believe make it an appealing platform for RML modeling.&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Built on PyTorch&lt;/strong&gt;: Many of MPoL&amp;rsquo;s exciting features stem from the fact that it is built on top of a rich computational library that supports autodifferentiation and construction of complex neural networks. Autodifferentiation libraries like &lt;a href="https://github.com/aesara-devs/aesara"&gt;Theano/Aesara&lt;/a&gt;, &lt;a href="https://www.tensorflow.org/"&gt;Tensorflow&lt;/a&gt;, &lt;a href="https://pytorch.org/"&gt;PyTorch&lt;/a&gt;, and &lt;a href="https://jax.readthedocs.io/"&gt;JAX&lt;/a&gt; have revolutionized the way we compute and optimize functions. For now, PyTorch is the library that best satisfies our needs, but we&amp;rsquo;re keeping a close eye on the Python autodifferentiation ecosystem should a more suitable framework arrive. If you are familiar with scientific computing with Python but haven&amp;rsquo;t yet tried any of these frameworks, don&amp;rsquo;t worry, the syntax is easy to pick up and quite similar to working with numpy arrays.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Autodifferentiation&lt;/strong&gt;: PyTorch gives MPoL the capacity to autodifferentiate through a model. The &lt;em&gt;gradient&lt;/em&gt; of the objective function is exceptionally useful for finding the &amp;ldquo;downhill&amp;rdquo; direction in a large parameter space (such as the set of image pixels). Traditionally, these gradients would have needed to been calculated analytically (by hand) or via finite-difference methods which can be noisy in high dimensions. By leveraging the autodifferentiation capabilities, this allows us to rapidly formulate and implement complex prior distributions which would otherwise be difficult to differentiate by hand.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Optimization&lt;/strong&gt;: PyTorch provides a full-featured suite of research-grade &lt;a href="https://pytorch.org/docs/stable/optim.html"&gt;optimizers&lt;/a&gt; designed to train deep neural networks. These same optimizers can be employed to quickly find the optimum RML image.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;GPU acceleration&lt;/strong&gt;: PyTorch wraps CUDA libraries, making it seamless to take advantage of (multi-)GPU acceleration to optimize images. No need to use a single line of CUDA.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Model composability&lt;/strong&gt;: Rather than being a monolithic program for single-click RML imaging, MPoL strives to be a flexible, composable, RML imaging &lt;em&gt;library&lt;/em&gt; that provides primitives that can be used to easily solve your particular imaging challenge. One way we do this is by mimicking the PyTorch ecosystem and writing the RML imaging workflow using &lt;a href="https://pytorch.org/tutorials/beginner/nn_tutorial.html"&gt;PyTorch modules&lt;/a&gt;. This makes it easy to mix and match modules to construct arbitrarily complex imaging workflows. We&amp;rsquo;re working on tutorials that describe these ideas in depth, but one example would be the ability to use a single latent space image model to simultaneously fit single dish and interferometric data.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;A bridge to the machine learning/neural network community&lt;/strong&gt;: MPoL will happily calculate RML images for you using &amp;ldquo;traditional&amp;rdquo; image priors, lest you are the kind of person that turns your nose up at the words &amp;ldquo;machine learning&amp;rdquo; or &amp;ldquo;neural network.&amp;rdquo; However, if you are the kind of person that sees opportunity in these tools, because MPoL is built on PyTorch, it is straightforward to take advantage of them for RML imaging. For example, if one were to train a variational autoencoder on protoplanetary disk emission morphologies, the latent space + decoder architecture could be easily plugged in to MPoL and serve as an imaging basis set.&lt;/p&gt;
&lt;/li&gt;
&lt;/ul&gt;</description></item><item><title>Image Plane Deconvolution (CLEAN)</title><link>https://iancze.github.io/courses/astro589/lectures/08-image-plane-deconvolution/</link><pubDate>Sat, 08 Oct 2022 00:00:00 +0000</pubDate><guid>https://iancze.github.io/courses/astro589/lectures/08-image-plane-deconvolution/</guid><description>&lt;ul&gt;
&lt;li&gt;&lt;a href="https://psu.mediaspace.kaltura.com/media/Astro+589A+Lecture+8+CLEAN/1_99zi3w9d"&gt;Whiteboard recording&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="https://psu.mediaspace.kaltura.com/media/Astro+589A+Lecture+8+CLEAN+part+II/1_nw7x5iky"&gt;Zoom slides&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;




&lt;h3 id="references"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro589/lectures/08-image-plane-deconvolution/#references"&gt;References&lt;/a&gt;
&lt;/h3&gt;
&lt;ul&gt;
&lt;li&gt;&lt;a href="https://leo.phys.unm.edu/~gbtaylor/astr423/s98book.pdf"&gt;Synthesis Imaging in Radio Astronomy II&lt;/a&gt;: Lecture 7: Imaging by Briggs, Schwab, and Sramek and Lecture 8: Deconvolution by Cornwell, Braun, and Briggs&lt;/li&gt;
&lt;li&gt;&lt;a href="https://ui.adsabs.harvard.edu/abs/2021ApJS..257....2C/abstract"&gt;Molecules with ALMA at Planet-forming Scales (MAPS). II. CLEAN Strategies for Synthesizing Images of Molecular Line Emission in Protoplanetary Disks&lt;/a&gt; by Czekala et al. 2021&lt;/li&gt;
&lt;/ul&gt;




&lt;h3 id="outline"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro589/lectures/08-image-plane-deconvolution/#outline"&gt;Outline&lt;/a&gt;
&lt;/h3&gt;
&lt;ul&gt;
&lt;li&gt;Recap of visibility datasets and the sampling function&lt;/li&gt;
&lt;li&gt;Image plane implications of sampling&amp;ndash;the dirty image&lt;/li&gt;
&lt;li&gt;Noise and &amp;ldquo;weighting&amp;rdquo;&lt;/li&gt;
&lt;li&gt;The CLEAN image deconvolution algorithm&lt;/li&gt;
&lt;/ul&gt;




&lt;h2 id="visibility-datasets"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro589/lectures/08-image-plane-deconvolution/#visibility-datasets"&gt;Visibility datasets&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;Recall from last time that the visibility function is the Fourier transform of the sky brightness distribution&lt;/p&gt;
&lt;p&gt;$$
\mathcal{V}(u,v) \leftrightharpoons I(l,m)
$$&lt;/p&gt;
&lt;p&gt;and that each baseline (pair of antennas) of an interferometric array corresponds to a sample of the visibility function at a specific \(u,v\) point. The \(u,v\) point corresponds to the length of the &lt;em&gt;projected&lt;/em&gt; baseline in multiples of the observing wavelength and is the &lt;em&gt;spatial frequency&lt;/em&gt; of the image plane that is being sampled.&lt;/p&gt;
&lt;p&gt;For a large array with &amp;gt; 50 antennas, like ALMA, you get nearly 1000 unique &lt;em&gt;instantaneous&lt;/em&gt; baselines, from each pairwise combination of antennas in the array. As the earth rotates, you can quickly acquire new projected baseline samples.&lt;/p&gt;




&lt;h2 id="sampling-function"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro589/lectures/08-image-plane-deconvolution/#sampling-function"&gt;Sampling function&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;If you do radio interferometry, you will very often see the baseline distributions plotted as a series of \(\delta\) functions on the \(u,v\) plane.&lt;/p&gt;
&lt;p&gt;TODO: draw approximate plot with points&lt;/p&gt;
&lt;p&gt;This is called the sampling function&lt;/p&gt;
&lt;p&gt;$$
S(u,v) = \sum_{k=1}^M \delta(u - u_k, v - v_k).
$$&lt;/p&gt;
&lt;p&gt;And, we can write down the sampling of the visibility function as&lt;/p&gt;
&lt;p&gt;$$
S(u, v) \times \mathcal{V}(u, v).
$$&lt;/p&gt;
&lt;p&gt;If you recall from our Fourier transform distribution, this sampling function is also called the &lt;em&gt;transfer function&lt;/em&gt;. It allows certain spatial frequencies through the interferometric system. Both terminologies are used in the radio astronomy community.&lt;/p&gt;




&lt;h3 id="image-plane-implications"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro589/lectures/08-image-plane-deconvolution/#image-plane-implications"&gt;Image plane implications&lt;/a&gt;
&lt;/h3&gt;
&lt;p&gt;Now, let&amp;rsquo;s discuss the image-plane ramifications of the sampling operation. We started with \(I(l,m)\), the &amp;ldquo;true&amp;rdquo; sky brightness, i.e., the one you would observe if you had a perfectly sensitive telescope with infinite resolving power.&lt;/p&gt;
&lt;p&gt;$$
\mathcal{V}(u,v) \leftrightharpoons I(l,m)
$$&lt;/p&gt;
&lt;p&gt;but we&amp;rsquo;ve &lt;em&gt;multiplied&lt;/em&gt; it by the sampling distribution&lt;/p&gt;
&lt;p&gt;$$
S(u, v) \times \mathcal{V}(u, v).
$$&lt;/p&gt;
&lt;p&gt;Remember how we talked about interferometers as &lt;em&gt;spatial filters&lt;/em&gt;? We just showed how this conceptually works in the Fourier domain, the \(u,v\) coverage provided by the antenna spacings &lt;em&gt;is&lt;/em&gt; the transfer function.&lt;/p&gt;
&lt;p&gt;We can also show that interferometers are spatial filters by considering the image plane implications. This same operation implies that the true sky brightness is &lt;em&gt;convolved&lt;/em&gt; by something in the image plane
$$
I(l, m) * B_D(l, m) \leftrightharpoons S(u, v) \times \mathcal{V}(u, v).
$$&lt;/p&gt;
&lt;p&gt;The quantity \(B_D(l,m,)\) is called the dirty beam. We&amp;rsquo;ll soon talk about the CLEAN algorithm, so the dirty/clean terminology will soon make sense. But first, let&amp;rsquo;s just think about this for a second. This dirty beam is the same thing we showed in previous lectures, and can also be thought of as the sum of the fringe functions.&lt;/p&gt;
&lt;p&gt;If you take an image, convolve it with the dirty beam, and then take its Fourier transform, you&amp;rsquo;ll see that you will have visibility samples only at the spatial frequencies corresponding to your baselines.&lt;/p&gt;
&lt;p&gt;Another way to think of the dirty beam is as the &lt;em&gt;impulse response&lt;/em&gt; of the interferometric system. Let&amp;rsquo;s assume we are observing a point source \(I(l,m) = \delta(l,m)\). The visibility function corresponding to a point source is a constant, so we have&lt;/p&gt;
&lt;p&gt;$$
\delta(l,m) * B_D(l,m) = S(u,v) \times \mathrm{constant}
$$&lt;/p&gt;
&lt;p&gt;$$
B_D(l,m) \leftrightharpoons S(u,v).
$$&lt;/p&gt;
&lt;p&gt;So, another way we can say the same thing is that the dirty beam is the &lt;em&gt;point spread function&lt;/em&gt; (PSF) of the interferometer, and it is given by the Fourier transform of the sampling function, which is set by the configuration of the baselines within the array.&lt;/p&gt;




&lt;h2 id="the-dirty-image"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro589/lectures/08-image-plane-deconvolution/#the-dirty-image"&gt;The Dirty Image&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;In last week&amp;rsquo;s lecture, we talked a bit about the actual visibility data products, a discrete set of (noisy) visibility samples&lt;/p&gt;
&lt;p&gt;$$
\mathbf{V} = \{V_1, V_2, \ldots, V_M\}^M_{k=1}.
$$&lt;/p&gt;
&lt;p&gt;The idea is that each was sampled with some (complex) noise draw&lt;/p&gt;
&lt;p&gt;$$
V_i = \mathcal{V}(u_i, v_i) + \epsilon
$$&lt;/p&gt;
&lt;p&gt;such that
$$
\epsilon_\Re \sim \mathcal{N}(0, \sigma)
$$&lt;/p&gt;
&lt;p&gt;$$
\epsilon_\Im \sim \mathcal{N}(0, \sigma)
$$&lt;/p&gt;
&lt;p&gt;$$
\epsilon = \epsilon_\Re + i \epsilon_\Im.
$$&lt;/p&gt;
&lt;p&gt;And then we said that radio interferometers commonly represent the uncertainty on each visibility measurement by a &amp;ldquo;weight&amp;rdquo; \(w_i\), where&lt;/p&gt;
&lt;p&gt;$$
w_i = \frac{1}{\sigma_i^2}.
$$&lt;/p&gt;




&lt;h3 id="more-details-about-the-sampling-function"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro589/lectures/08-image-plane-deconvolution/#more-details-about-the-sampling-function"&gt;More details about the sampling function&lt;/a&gt;
&lt;/h3&gt;
&lt;p&gt;The fact that each visibility measurement is made in the presence of noise means that we should be taking this into account in our sampling function. So a more sophisticated sampling function looks like&lt;/p&gt;
&lt;p&gt;$$
S(u, v) = \sum_{k=1}^M T_k D_k w_k \delta(u - u_k, v - v_k).
$$&lt;/p&gt;
&lt;p&gt;In addition to weighting each sample by its inverse variance \(w_k\) (something you might do to take a statistical average), there are other factors you might fiddle with, like a &amp;ldquo;taper&amp;rdquo; \(T_k\) and density weight \(D_k\). For now, you can just think of them as equal to 1.0.&lt;/p&gt;
&lt;p&gt;Now, let&amp;rsquo;s think about how we would take our sampled visibilities \(\mathbf{V} = \{V_1, V_2, \ldots, V_M\}^M_{k=1}\) and make an image from them. Well, the way to do this looks like an inverse Fourier transform&lt;/p&gt;
&lt;p&gt;$$
I_D(l, m) = C \sum_{k=1}^{N^\dagger} T_k D_k w_k V_k \exp \{2 \pi i (u_k l + v_k m) \}.
$$
with normalization constant
$$
C = 1 / \sum_{k=1}^{N^\dagger} T_k D_k w_k.
$$&lt;/p&gt;
&lt;p&gt;where \(N^\dagger = 2 N\) is the set of visibilities that includes their complex conjugates such that the sampling is Hermitian. This image that results is called the &lt;em&gt;dirty image&lt;/em&gt;, and we denote it with a &amp;ldquo;D&amp;rdquo; subscript.&lt;/p&gt;
&lt;p&gt;Continuing with the dirty beam concept, we also have&lt;/p&gt;
&lt;p&gt;$$
I_D(l,m) = I(l, m) * B_D(l, m) \leftrightharpoons S(u, v) \times \mathcal{V}(u, v).
$$&lt;/p&gt;




&lt;h3 id="a-quick-note-about-the-ill-defined-imaging-problem-and-weighting-choices"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro589/lectures/08-image-plane-deconvolution/#a-quick-note-about-the-ill-defined-imaging-problem-and-weighting-choices"&gt;A quick note about the &amp;ldquo;ill-defined&amp;rdquo; imaging problem and weighting choices&lt;/a&gt;
&lt;/h3&gt;
&lt;p&gt;Making images from Fourier samples is generally an ill-defined inverse process, which is only complicated in the presence of noise. What do we mean by ill-defined?&lt;/p&gt;
&lt;p&gt;In the forward process, we collect samples of the visibility function at specific \(u,v\) values \(S(u, v) \times \mathcal{V}(u, v)\). To make a dirty image, we take the inverse Fourier transform of those values to produce &lt;em&gt;an&lt;/em&gt; image. This is the inverse process.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;The ill-defined nature of imaging&lt;/strong&gt; (even in the presence of noise): Think about what&amp;rsquo;s happening in the forward process. If the visibility function had non-zero amplitudes at some \(u,v\) values that we didn&amp;rsquo;t sample, then these never enter our dataset (obviously). The inverse process itself doesn&amp;rsquo;t include these values (obviously), and by their omission, assumes that their amplitudes are equal to zero.&lt;/p&gt;
&lt;p&gt;So, as far as our interferometer is concerned, it can&amp;rsquo;t distinguish between a set of degenerate image brightness distributions on the sky so long as they have the exact same \(\mathcal{V}(u,v)\) values at the sampled \(u,v\) points. The unsampled \(\mathcal{V}(u,v)\) locations can take on arbitrary values and still result in the exact same dataset.&lt;/p&gt;
&lt;p&gt;Another way of saying the same thing is to think of an interferometer as a &lt;em&gt;spatial filter&lt;/em&gt;, i.e., its transfer function (the array configuration) only allows measurements of certain spatial frequencies to enter the dataset. But most images contain power at many spatial frequencies, including those that have been filtered out. So, if you just try to make an image with the spatial frequencies in your dataset, your image will most likely be missing some spatial frequencies that would be there in actuality.&lt;/p&gt;




&lt;h4 id="weighting-choices"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro589/lectures/08-image-plane-deconvolution/#weighting-choices"&gt;Weighting choices&lt;/a&gt;
&lt;/h4&gt;
&lt;p&gt;The whole discussion about the ill-defined inverse problem applies &lt;em&gt;even if&lt;/em&gt; we sampled the visibility function perfectly with no noise, so long as there are still unsampled \(u,v\) points that contain significant visibility &amp;ldquo;power.&amp;rdquo;&lt;/p&gt;
&lt;p&gt;The problem gets even more complicated when we consider measurement noise and the fact that array configurations usually sample some parts of \(u,v\) space better than others. In my opinion, this is really why various weighting schemes are as popular as they are. The common way to do this is by tuning the \(D_k\) and \(T_k\) terms. As we&amp;rsquo;ll show later in the slides, tuning the \(D_k\) terms provide a tradeoff between:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;&lt;strong&gt;natural weighting&lt;/strong&gt; maximizing point source sensitivity at the cost of spatial resolution (broader beam)&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;uniform weighting&lt;/strong&gt; maximizing spatial resolution (narrow beam) at the cost of point source sensitivity (higher RMS noise floor in the image)&lt;/li&gt;
&lt;li&gt;&amp;ldquo;Briggs&amp;rdquo; &lt;strong&gt;robust weighting&lt;/strong&gt; a tradeoff between these two regimes, ranging from (-2 to 2). The tradeoff is non-linear, so coming from natural weighting, for example, good spatial resolution can be gained with only modest sacrifices in point source sensitivity. Or, vice versa, coming from uniform weighting, good sensitivity to point sources can be gained with only modest losses in resolution.&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;Typically, a reasonable starting point with ALMA observations is to use some in-between value of Briggs weighting like -0.5, 0.0, or 0.5. Different weighting choices can change your sensitivities on different spatial scales.&lt;/p&gt;
&lt;p&gt;The \(T_k\) terms are for applying a &lt;em&gt;taper&lt;/em&gt;, whereby one downweights longer baseline observations.&lt;/p&gt;




&lt;h2 id="clean"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro589/lectures/08-image-plane-deconvolution/#clean"&gt;CLEAN&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;We&amp;rsquo;ve talked about how&lt;/p&gt;
&lt;p&gt;$$
I_D(l,m) = I(l, m) * B_D(l, m) \leftrightharpoons S(u, v) \times \mathcal{V}(u, v).
$$&lt;/p&gt;
&lt;p&gt;Specifically, the &lt;em&gt;dirty image&lt;/em&gt; is the &lt;em&gt;convolution&lt;/em&gt; of the true sky brightness \(I\) with the dirty beam \(B_D\). We know what the dirty beam is to high precision.&lt;/p&gt;
&lt;p&gt;First, it&amp;rsquo;s important to note that convolution is a &lt;em&gt;lossy&lt;/em&gt; procedure, you (irrevocably) lose information. For example, consider applying a Gaussian blur to an image. The high resolution information in that image has been lost.&lt;/p&gt;
&lt;p&gt;CLEAN is an &lt;em&gt;image deconvolution&lt;/em&gt; algorithm. We just said that convolution is a lossy procedure, so, how does the algorithm get that information back? What follows are my own opinions about the CLEAN algorithm, its use cases I&amp;rsquo;m most familiar with in the protoplanetary disk community, and its limitations.&lt;/p&gt;
&lt;p&gt;TODO: draw a 1D cut of the dirty beam&lt;/p&gt;
&lt;p&gt;The short answer is that CLEAN can help restore an image &lt;em&gt;up to a point&lt;/em&gt;. The thing that CLEAN is best at is removing the effect of those nasty sidelobes from a dirty beam, and replacing them with a more Gaussian beam response that is usually easier to work with. CLEAN will not give you &amp;ldquo;super-resolution&amp;rdquo; access to lost spatial frequencies that you have lost, but it can help you make better looking images, and ones that have better dynamic range.&lt;/p&gt;




&lt;h3 id="iterative-processes"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro589/lectures/08-image-plane-deconvolution/#iterative-processes"&gt;Iterative processes&lt;/a&gt;
&lt;/h3&gt;
&lt;p&gt;CLEAN is a procedure that iteratively builds up a model image. To carry this out on the whiteboard, I&amp;rsquo;m going to do things in 1D. In a moment we&amp;rsquo;ll show an example with 2D images in the slides.&lt;/p&gt;
&lt;p&gt;TODO: draw in 1D a dirty image of a few point sources, a representation of the dirty beam, and a blank model image&lt;/p&gt;
&lt;p&gt;Before you start, we&amp;rsquo;ll define a quantity called the CLEAN beam. It&amp;rsquo;s usually chosen to be a Gaussian fit to the main lobe of the beam.&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;
&lt;ol&gt;
&lt;li&gt;First, we identify the &lt;em&gt;peak&lt;/em&gt; location in the dirty image.&lt;/li&gt;
&lt;/ol&gt;
&lt;/li&gt;
&lt;li&gt;2a. Then, we subtract some fraction of the flux times the dirty beam from this location. This dirty image becomes a &amp;ldquo;residual image&amp;rdquo; now.&lt;/li&gt;
&lt;li&gt;2b. At the same time, we add a \(\delta\) function at corresponding location in the model image with the same amplitude as the flux we subtracted. So, if we subtracted 0.1 Jy of flux in the dirty beam, then we would add a \(\delta\) function with amplitude of 0.1 Jy in the model image.&lt;/li&gt;
&lt;li&gt;2c. You can think of these steps as equivalent, because a \(\delta\) function times the dirty beam gives you back the dirty beam. These steps are &lt;em&gt;also&lt;/em&gt; equivalently carried out in the visibility domain.&lt;/li&gt;
&lt;li&gt;
&lt;ol start="3"&gt;
&lt;li&gt;Go back to step 1, and repeat with the next-highest &lt;em&gt;peak&lt;/em&gt; location. Continue this loop until the peak flux in the image drops below some threshold&lt;/li&gt;
&lt;/ol&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;ol start="4"&gt;
&lt;li&gt;Once this threshold is reached, the CLEANing is done. The final step is to put everything back together. The model image is convolved with the CLEAN beam to form the restored image. This &amp;ldquo;smooths out&amp;rdquo; the model image to some resolution limit, and hides imperfections on smaller scales.&lt;/li&gt;
&lt;/ol&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;ol start="5"&gt;
&lt;li&gt;The remainder of the residual image is added back to the restored image to give a sense of the &amp;ldquo;noise&amp;rdquo; in the image.&lt;/li&gt;
&lt;/ol&gt;
&lt;/li&gt;
&lt;/ul&gt;




&lt;h3 id="limitations"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro589/lectures/08-image-plane-deconvolution/#limitations"&gt;Limitations&lt;/a&gt;
&lt;/h3&gt;
&lt;p&gt;CLEAN is &lt;em&gt;procedural&lt;/em&gt;. What this means is that you set parameters that guide the above process and then carry on until some termination criterion is reached. This could also be part of an interactive process. There is no guarantee that the CLEANed image is unique, either.&lt;/p&gt;
&lt;p&gt;In my opinion, CLEAN is best at removing the sidelobe effects of the dirty beam, improving the dynamic range of your image, and possibly detecting fainter (point-like) sources that would have been hidden by the sidelobes of other brighter point sources.&lt;/p&gt;
&lt;p&gt;In the above example, we said that we would use a \(\delta\) function to build up a model image. You may have already identified this choice of basis set or &amp;ldquo;CLEAN component&amp;rdquo; as a potential limitation. This works great for fields of point sources, but what about extended sources? It turns out that it actually works &lt;em&gt;OK&lt;/em&gt; for extended sources, so long as you have many of them. This adds to the computational time, though, and is why I think we&amp;rsquo;re now currently in an interesting place, approaching the limitations of CLEAN.&lt;/p&gt;
&lt;p&gt;There are other extensions to CLEAN (called multi-scale), which use CLEAN components of varying sizes, like little Gaussian blobs. This can help substantially over using just \(\delta\) functions. For very resolved structures, though, you still run into the problem that your components aren&amp;rsquo;t sufficiently like the morphology of the source you&amp;rsquo;re trying to deconvolve.&lt;/p&gt;</description></item><item><title>Bayesian Inference, Complex Measurement, and Model Fitting</title><link>https://iancze.github.io/courses/astro589/lectures/07-bayesian-inference/</link><pubDate>Tue, 04 Oct 2022 00:00:00 +0000</pubDate><guid>https://iancze.github.io/courses/astro589/lectures/07-bayesian-inference/</guid><description>&lt;p&gt;&lt;a href="https://psu.mediaspace.kaltura.com/media/Astro+589A+Lecture+7/1_hnzpjmiw"&gt;Zoom link&lt;/a&gt;&lt;/p&gt;




&lt;h2 id="references-for-today"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro589/lectures/07-bayesian-inference/#references-for-today"&gt;References for today&lt;/a&gt;
&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;&lt;a href="https://mpol-dev.github.io/MPoL/rml_intro.html"&gt;MPoL introduction&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="https://catalog.libraries.psu.edu/catalog/19551280"&gt;Data Analysis: a Bayesian Tutorial&lt;/a&gt; by Sivia and Skilling&lt;/li&gt;
&lt;li&gt;&lt;a href="https://ui.adsabs.harvard.edu/abs/2010arXiv1008.4686H/abstract"&gt;Data Analysis Recipes: Fitting a Model to Data&lt;/a&gt;, by Hogg et al.&lt;/li&gt;
&lt;li&gt;&lt;a href="https://ui.adsabs.harvard.edu/abs/2012arXiv1205.4446H/abstract"&gt;Data analysis recipes: Probability calculus for inference&lt;/a&gt; by Hogg&lt;/li&gt;
&lt;li&gt;&lt;a href="https://catalog.libraries.psu.edu/catalog/19499523"&gt;Machine Learning: A Probabilistic Perspective&lt;/a&gt; by Murphy, Chapter 10&lt;/li&gt;
&lt;li&gt;&lt;a href="https://catalog.libraries.psu.edu/catalog/3405468"&gt;Pattern Recognition and Machine Learning&lt;/a&gt; by Bishop, Chapter 8&lt;/li&gt;
&lt;li&gt;&lt;a href="https://catalog.libraries.psu.edu/catalog/32525864"&gt;Probabilistic Graphical Models&lt;/a&gt; by Sucar, especially Chapters 7, 8&lt;/li&gt;
&lt;/ul&gt;




&lt;h2 id="last-time"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro589/lectures/07-bayesian-inference/#last-time"&gt;Last time&lt;/a&gt;
&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;general interferometer with north-south and east-west baselines&lt;/li&gt;
&lt;li&gt;arrays with multiple antennas and Earth aperture synthesis&lt;/li&gt;
&lt;li&gt;point spread functions (PSFs) and their relationship to array configuration&lt;/li&gt;
&lt;/ul&gt;




&lt;h2 id="todays-lecture"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro589/lectures/07-bayesian-inference/#todays-lecture"&gt;Today&amp;rsquo;s lecture&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;Now that we&amp;rsquo;ve covered how interferometers work to observe a source and the Fourier transform theory behind that, we&amp;rsquo;re going to focus on the data products (called &amp;ldquo;visibilities&amp;rdquo;) and Bayesian inference techniques for analyzing the data in its natural space. In subsequent lectures we will talk about how we might use the visibilities and Fourier inversion techniques to synthesize images of the source, but this lecture occupies an important intermediate (and foundational) step, where we are treating the visibilities as the &amp;ldquo;raw&amp;rdquo; data product and thinking about how we bring our analysis techniques into that space.&lt;/p&gt;
&lt;p&gt;The topics we will cover include:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Bayesian inference&lt;/li&gt;
&lt;li&gt;Forward-modeling with a &lt;em&gt;generative&lt;/em&gt; model&lt;/li&gt;
&lt;li&gt;Complex-valued noise and measurement (weights)&lt;/li&gt;
&lt;li&gt;Statistical weight and relationship to point source uncertainty&lt;/li&gt;
&lt;li&gt;Forward-modeling visibility data&lt;/li&gt;
&lt;li&gt;Missing spatial frequencies (model constraints)&lt;/li&gt;
&lt;/ul&gt;




&lt;h2 id="probability-calculus-and-bayesian-inference"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro589/lectures/07-bayesian-inference/#probability-calculus-and-bayesian-inference"&gt;Probability calculus and Bayesian Inference&lt;/a&gt;
&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;A good reference for this section is &lt;a href="https://ui.adsabs.harvard.edu/abs/2012arXiv1205.4446H/abstract"&gt;Data analysis recipes: Probability calculus for inference&lt;/a&gt; by Hogg&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;Generally, we write probability distributions like \(p(a)\). The probability distribution is a function describing the probability of the variable \(a\) having some value.&lt;/p&gt;
&lt;p&gt;Probability functions are normalized.
$$
\int_{-\infty}^\infty p(a)\,\mathrm{d}a = 1
$$&lt;/p&gt;
&lt;p&gt;Say that \(a\) represents the height of an individual from the US population, measured in meters.&lt;/p&gt;
&lt;p&gt;TODO: draw a bell curve&lt;/p&gt;
&lt;p&gt;Probability functions &lt;strong&gt;have units&lt;/strong&gt;. In this case, \(p(a)\) has units of \(a^{-1}\), or \(\mathrm{m}^{-1}\).&lt;/p&gt;
&lt;p&gt;If selected an individual from the US population and we wanted to know the probability that their height was between 1.7 and 1.9 meters, we could do an integral over this range&lt;/p&gt;
&lt;p&gt;$$
\int_{1.7\,\mathrm{m}}^{1.9\,\mathrm{m}} p(a)\,\mathrm{d}a
$$&lt;/p&gt;




&lt;h3 id="conditional-probabilities"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro589/lectures/07-bayesian-inference/#conditional-probabilities"&gt;Conditional probabilities&lt;/a&gt;
&lt;/h3&gt;
&lt;p&gt;It&amp;rsquo;s very common that we will talk about multiple parameters at the same time. For example, we can continue our example and let \(b\) be the age of an individual drawn from the US population. We can talk about the probability of an individual drawn from the US population having a certain height &lt;em&gt;given&lt;/em&gt; that we know they are 20 years old&lt;/p&gt;
&lt;p&gt;$$
p(a | b = 20\,\mathrm{yr}).
$$&lt;/p&gt;
&lt;p&gt;What are the units of this probability distribution? It&amp;rsquo;s actually the same as before, it&amp;rsquo;s \(a^{-1}\), or \(\mathrm{m}^{-1}\), because this probability distribution must obey the same normalization&lt;/p&gt;
&lt;p&gt;$$
\int_{-\infty}^\infty p(a | b = 20\,\mathrm{yr})) \,\mathrm{d}a = 1
$$&lt;/p&gt;
&lt;p&gt;You can&amp;rsquo;t do the integral
$$
\int p(a | b ) \,\mathrm{d}b
$$
because this integrand now has units of \(a^{-1}b\), which is nonsensical.&lt;/p&gt;




&lt;h3 id="factorizing-probabilities"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro589/lectures/07-bayesian-inference/#factorizing-probabilities"&gt;Factorizing probabilities&lt;/a&gt;
&lt;/h3&gt;
&lt;p&gt;Now let&amp;rsquo;s consider the probability distribution
$$
p(a,b)
$$
this is a two-dimensional probability distribution. It has units of \((ab)^{-1}\), or in our previous example \(\mathrm{m}^{-1}\, \mathrm{yr}^{-1}\). You can read this as the probability of an individual having \(a\) value of height &lt;em&gt;and&lt;/em&gt; \(b\) value of age, i.e., this is a &lt;em&gt;joint&lt;/em&gt; probability distribution.&lt;/p&gt;
&lt;p&gt;The same normalization rules apply, only now these need to be done over two dimensions.&lt;/p&gt;
&lt;p&gt;You can take &lt;em&gt;any&lt;/em&gt; joint probability distribution and factor it into conditional distributions. So, we could write \(p(a,b)\) in two different ways
$$
p(a,b) = p(a) p(b | a)
$$
or
$$
p(a,b) = p(a|b) p(b).
$$
So, in words, we can say that the probability of \(a\) &lt;em&gt;and&lt;/em&gt; \(b\) (the left hand side) is equal to the probability of \(a\) &lt;em&gt;times&lt;/em&gt; the probability of \(b\) given \(a\).&lt;/p&gt;
&lt;p&gt;As I said, this factorization can apply to any joint probability distribution.&lt;/p&gt;
&lt;p&gt;Side note that &lt;em&gt;if&lt;/em&gt;
$$
p(b | a) = p(b)
$$
we would say that \(a\) and \(b\) are independent variables, and so we would write
$$
p(a, b) = p(a) p(b),
$$
but this isn&amp;rsquo;t true in the general case, only if the variables are independent.&lt;/p&gt;
&lt;p&gt;We can put the two factorization equations together to arrive at another relationship
$$
p(a | b) = \frac{p(b|a) p(a)}{p(b)}
$$
which is called Bayes&amp;rsquo;s theorem.&lt;/p&gt;




&lt;h3 id="marginalization"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro589/lectures/07-bayesian-inference/#marginalization"&gt;Marginalization&lt;/a&gt;
&lt;/h3&gt;
&lt;p&gt;One amazing thing you can do with probability distributions is &lt;em&gt;marginalization&lt;/em&gt;. Say that I told you the joint distribution \(p(a, b)\). As we talked about, we said this distribution had units of \((ab)^{-1}\) or \(\mathrm{m}^{-1}\, \mathrm{yr}^{-1}\). But let&amp;rsquo;s say you only cared about \(p(a)\), the distribution of heights. We can marginalize away the variable we don&amp;rsquo;t want by integration
$$
p(a) = \int p(a, b)\,\mathrm{d}a.
$$&lt;/p&gt;
&lt;p&gt;As we&amp;rsquo;ll see in a moment, this has huge implications when it comes time to do inference.&lt;/p&gt;




&lt;h2 id="likelihood-functions"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro589/lectures/07-bayesian-inference/#likelihood-functions"&gt;Likelihood functions&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;Now, let&amp;rsquo;s revisit Bayes&amp;rsquo;s rule and rewrite it like
$$
p(\mathrm{hypothesis} | \mathrm{data}) \propto p(\mathrm{data} | \mathrm{hypothesis}) \times p(\mathrm{hypothesis}).
$$
We&amp;rsquo;re omitting a constant of proportionality, commonly called the Bayesian evidence.&lt;/p&gt;
&lt;p&gt;The term on the left hand side is called a posterior distribution and is really a wonderful thing to report at the end of your analysis. Say you collected many years of measurements on the positions (orbits) of Jupiter, Saturn, and their moons, and then used those measurements to infer the mass of Saturn (as Laplace famously did). The posterior you would be most interested in would be a 1D distribution of the probability of the mass of Saturn and would represent your degree of belief that Saturn truly had that particular mass. This would be conditional on all of the measurements you made.&lt;/p&gt;
&lt;p&gt;The term on the very right hand side is a prior probability distribution and expresses your belief about the mass of Saturn in the absence of data. For example, we might rightly say that the mass needed to be greater than zero, and less than the mass of the Sun. A simple prior would then ascribe equal probabilities to all values in between (or perhaps equal probabilities to the &lt;em&gt;logarithm&lt;/em&gt; of the mass of Saturn).&lt;/p&gt;
&lt;p&gt;The remaining term, \(p(\mathrm{data} | \mathrm{hypothesis}) \) is called a likelihood function, and it is really where the rubber meets the road in most statistical analyses. Simply put, the likelihood function is how probable the observed data is for a given setting of the hypothesis. So, what is the probability of obtaining the observed positional measurements of Jupiter, Saturn, and their moons &lt;em&gt;if&lt;/em&gt; the mass of Saturn were \(5\times 10^{26}\,\mathrm{kg}\), for example.&lt;/p&gt;
&lt;p&gt;A quick note that likelihood functions show up in frequentist analysis all the time, too. However, the interpretations of probability are different.&lt;/p&gt;




&lt;h3 id="fitting-a-line"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro589/lectures/07-bayesian-inference/#fitting-a-line"&gt;Fitting a line&lt;/a&gt;
&lt;/h3&gt;
&lt;p&gt;Let&amp;rsquo;s dive into a quick example with some real data to make these concepts clearer.&lt;/p&gt;
&lt;p&gt;Typically, when astronomers fit a model to some dataset, such as a line \(y = m x + b\) to a collection of \(\boldsymbol{X} = {x_1, x_2, \ldots, x_N}\) and \(\boldsymbol{Y} = {y_1, y_2, \ldots, y_N}\) points, we require a likelihood function. Simply put, the likelihood function specifies the probability of the data, given a model, and encapsulates our assumptions about the data and noise generating processes.&lt;/p&gt;
&lt;p&gt;TODO: draw a bunch of points for putting a line through.&lt;/p&gt;
&lt;p&gt;For most real-world datasets, we don&amp;rsquo;t measure the &amp;ldquo;true&amp;rdquo; \(y\) value of the line (i.e., \(mx + b\)), but rather make a measurement which has been partially corrupted by some &amp;ldquo;noise.&amp;rdquo; In that case, we say that each \(y_i\) data point is actually generated by&lt;/p&gt;
&lt;p&gt;$$
y_i = m x_i + b + \epsilon
$$&lt;/p&gt;
&lt;p&gt;where \(\epsilon\) is a noise realization from a standard &lt;a href="https://en.wikipedia.org/wiki/Normal_distribution"&gt;normal distribution&lt;/a&gt; with standard deviation \(\sigma\), i.e.,&lt;/p&gt;
&lt;p&gt;$$
\epsilon \sim \mathcal{N}(0, \sigma).
$$&lt;/p&gt;
&lt;p&gt;This information about the data and noise generating process means that we can write down a likelihood function to calculate the probability that we observe the data that we do, given a set of model parameters. The likelihood function is \(p(\boldsymbol{Y} |\boldsymbol{\theta})\). Sometimes it is written as \(\mathcal{L}(\boldsymbol{Y} |\boldsymbol{\theta})\), and frequently, when employed in computation, we&amp;rsquo;ll use the logarithm of the likelihood function, or &amp;ldquo;log-likelihood,&amp;rdquo; \(\ln \mathcal{L}\) to avoid numerical under/overflow issues.&lt;/p&gt;
&lt;p&gt;Let&amp;rsquo;s call \(\boldsymbol{\theta} = {m, b}\) and \(M(x_i |, \boldsymbol{\theta}) = m x_i + b\). This is a very simple example here, but we would still call \(M\) a forward or &lt;em&gt;generative&lt;/em&gt; model. By that, we mean our model is sophisticated enough that we can use it (and some noise model) to fully replicate the dataset, or alternative sets of data indistinguishable from the measured data.&lt;/p&gt;
&lt;p&gt;The probability of observing each datum is a Gaussian (normal distribution) centered on the model value, evaluated at the \(y_i\) value. So, the full likelihood function for this line problem is just the multiplication of all of these probability distributions&lt;/p&gt;
&lt;p&gt;$$
\mathcal{L}(\boldsymbol{Y} |\boldsymbol{\theta}) = \prod_i^N \frac{1}{\sqrt{2 \pi} \sigma} \exp \left [ - \frac{(y_i - M(x_i |\boldsymbol{\theta}))^2}{2 \sigma^2}\right ].
$$&lt;/p&gt;
&lt;p&gt;The logarithm of the likelihood function is&lt;/p&gt;
&lt;p&gt;$$
\ln \mathcal{L}(\boldsymbol{Y} |,\boldsymbol{\theta}) = -N \ln(\sqrt{2 \pi} \sigma) - \frac{1}{2} \sum_i^N \frac{(y_i - M(x_i |\boldsymbol{\theta}))^2}{\sigma^2}.
$$&lt;/p&gt;
&lt;p&gt;You may recognize the right hand term looks similar to the \(\chi^2\) metric,&lt;/p&gt;
&lt;p&gt;$$
\chi^2(\boldsymbol{Y} |\boldsymbol{\theta}) = \sum_i^N \frac{(y_i - M(x_i |\boldsymbol{\theta}))^2}{\sigma^2}
$$&lt;/p&gt;
&lt;p&gt;Assuming that the uncertainty (\(\sigma\)) on each data point is known (and remains constant), the first term in the log likelihood expression remains constant, and we have&lt;/p&gt;
&lt;p&gt;$$
\ln \mathcal{L}(\boldsymbol{Y} |\boldsymbol{\theta}) = - \frac{1}{2} \chi^2 (\boldsymbol{Y} |\boldsymbol{\theta}) + C
$$&lt;/p&gt;
&lt;p&gt;where \(C\) is a constant with respect to the model parameters. It is common to use shorthand to say that &amp;ldquo;the likelihood function is \(\chi^2\)&amp;rdquo; to indicate situations where the data uncertainties are Gaussian. Very often, we (or others) are interested in the parameter values \(\boldsymbol{\theta}_\mathrm{MLE}\) which maximize the likelihood function. Unsurprisingly, these parameters are called the &lt;em&gt;maximum likelihood estimate&lt;/em&gt; (or MLE), and usually they represent something like a &amp;ldquo;best-fit&amp;rdquo; model.&lt;/p&gt;
&lt;p&gt;When it comes time to do parameter inference, however, it&amp;rsquo;s important to keep in mind&lt;/p&gt;
&lt;ol&gt;
&lt;li&gt;the simplifying assumptions we made about the noise uncertainties being constant with respect to the model parameters. If we were to &amp;ldquo;fit for the noise&amp;rdquo; in a hierarchical model, for example, we would need to use the full form of the log-likelihood function, including the \(-N \ln \left (\sqrt{2 \pi} \sigma \right)\) term.&lt;/li&gt;
&lt;li&gt;that in order to maximize the likelihood function we want to &lt;em&gt;minimize&lt;/em&gt; the \(\chi^2\) function.&lt;/li&gt;
&lt;li&gt;that constants of proportionality (e.g., the 1/2 in front of the \(\chi^2\)) can matter when combining likelihood functions with prior distributions for Bayesian parameter inference.&lt;/li&gt;
&lt;/ol&gt;
&lt;p&gt;To be specific, \(\chi^2\) is not the end of the story when we&amp;rsquo;d like to perform Bayesian parameter inference. To do so, we need the posterior probability distribution of the model parameters given the dataset, \(p(\boldsymbol{\theta}|,\boldsymbol{Y})\).&lt;/p&gt;




&lt;h2 id="visibility-data"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro589/lectures/07-bayesian-inference/#visibility-data"&gt;Visibility Data&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;Now that we have reviewed likelihood functions, let&amp;rsquo;s turn back to radio astronomy and go into further detail &lt;em&gt;how&lt;/em&gt; the visibility function is sampled. Recall that the visibility domain is the Fourier transform of the image sky brightness \(\mathcal{V} \leftrightharpoons I\).&lt;/p&gt;
&lt;p&gt;The visibility function is complex-valued, and each measurement of it (denoted by \(V_i\)) is made in the presence of noise&lt;/p&gt;
&lt;p&gt;$$
V_i = \mathcal{V}(u_i, v_i) + \epsilon.
$$&lt;/p&gt;
&lt;p&gt;Here \(\epsilon\) represents a noise realization from a &lt;a href="https://en.wikipedia.org/wiki/Complex_normal_distribution"&gt;complex normal&lt;/a&gt; (Gaussian) distribution. Thankfully, most interferometric datasets &lt;em&gt;do not&lt;/em&gt; exhibit significant covariance between the real and imaginary noise components &lt;em&gt;and&lt;/em&gt; the distributions of the values are similar, so we could equivalently say that the real and imaginary components of the noise are separately generated by draws from normal distributions characterized by standard deviation \(\sigma\)&lt;/p&gt;
&lt;p&gt;$$
\epsilon_\Re \sim \mathcal{N}(0, \sigma)
$$&lt;/p&gt;
&lt;p&gt;$$
\epsilon_\Im \sim \mathcal{N}(0, \sigma)
$$&lt;/p&gt;
&lt;p&gt;where \(\sigma\) is a real-valued quantity. If the units of the visibility function are Janskys, then the units of \(\sigma\) are also Janskys.&lt;/p&gt;
&lt;p&gt;The full complex noise-draw is given by
$$
\epsilon = \epsilon_\Re + i \epsilon_\Im.
$$&lt;/p&gt;
&lt;p&gt;Radio interferometers will commonly represent the uncertainty on each visibility measurement by a &amp;ldquo;weight&amp;rdquo; \(w_i\), where&lt;/p&gt;
&lt;p&gt;$$
w_i = \frac{1}{\sigma_i^2}.
$$&lt;/p&gt;
&lt;p&gt;Like \(\sigma\), the weight itself is a real quantity, in this case having units of \(1/\mathrm{Jy}^2\).&lt;/p&gt;
&lt;p&gt;A full interferometric dataset is a collection of visibility measurements, which we represent by&lt;/p&gt;
&lt;p&gt;$$
\boldsymbol{V} = \{V_1, V_2, \ldots V_N\}_{i=1}^N
$$&lt;/p&gt;
&lt;p&gt;each one having a corresponding \(u_i, v_i\) coordinate. For reference, a typical ALMA dataset might contain a half-million individual visibility samples, acquired over a range of spatial frequencies.&lt;/p&gt;




&lt;h3 id="likelihood-functions-for-fourier-data"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro589/lectures/07-bayesian-inference/#likelihood-functions-for-fourier-data"&gt;Likelihood functions for Fourier data&lt;/a&gt;
&lt;/h3&gt;
&lt;p&gt;Now that we&amp;rsquo;ve introduced likelihood functions in general and the specifics of Fourier data, let&amp;rsquo;s talk about likelihood functions for inference with Fourier data. As before, our statement about the data generating process&lt;/p&gt;
&lt;p&gt;$$
V_i = \mathcal{V}(u_i, v_i) + \epsilon
$$&lt;/p&gt;
&lt;p&gt;leads us to the formulation of the likelihood function.&lt;/p&gt;
&lt;p&gt;First, let&amp;rsquo;s assume we have some model that we&amp;rsquo;d like to fit to our dataset. To be a forward model, it should be able to predict the value of the visibility function for any spatial frequency, i.e., we need to be able to calculate&lt;/p&gt;
&lt;p&gt;$$
\mathcal{V}(u, v) = M_\mathcal{V}(u, v |, \boldsymbol{\theta}).
$$&lt;/p&gt;
&lt;p&gt;Following the discussion about how the complex noise realization \(\epsilon\) is generated, this leads to a log likelihood function&lt;/p&gt;
&lt;p&gt;$$
\ln \mathcal{L}(\boldsymbol{V}|,\boldsymbol{\theta}) = - \frac{1}{2} \chi^2(\boldsymbol{V}|,\boldsymbol{\theta}) + C
$$&lt;/p&gt;
&lt;p&gt;Because the data and model are complex-valued, \(\chi^2\) is evaluated as&lt;/p&gt;
&lt;p&gt;$$
\chi^2(\boldsymbol{V}|,\boldsymbol{\theta}) = \sum_i^N \frac{|V_i - M_\mathcal{V}(u_i, v_i |,\boldsymbol{\theta})|^2}{\sigma_i^2}
$$&lt;/p&gt;
&lt;p&gt;where \(| |\) denotes the modulus squared. Equivalently, the calculation can be broken up into sums over the real and imaginary components of the visibility data and model&lt;/p&gt;
&lt;p&gt;$$
\chi^2(\boldsymbol{V}|,\boldsymbol{\theta}) = \sum_i^N \frac{(V_{\Re,i} - M_\mathcal{V,\Re}(u_i, v_i |,\boldsymbol{\theta}))^2}{\sigma_i^2} + \sum_i^N \frac{(V_{\Im,i} - M_\mathcal{V,\Im}(u_i, v_i |,\boldsymbol{\theta}))^2}{\sigma_i^2}.
$$&lt;/p&gt;
&lt;p&gt;Because images of the sky are real, therefore the real part of the visibility function must always be even and the imaginary part odd. The visibility function is Hermitian. This means that&lt;/p&gt;
&lt;p&gt;$$
\mathcal{V}(u, v) = \mathcal{V}^{*}(-u, -v).
$$&lt;/p&gt;
&lt;p&gt;So, if you make a measurement of \(\mathcal{V}(u, v)\), this means you have also made the same measurement of \(\mathcal{V}^{*}(-u, -v)\). If you are doing forward-modeling of the visibilities as we just described, you only need to use one of the Hermitian pairs, otherwise you will double count your measurements (this only turns out to be a scale factor in the likelihood for most analysis, so it&amp;rsquo;s technically OK). If you are gridding the visibilities to then image them, however, you will certainly want to include the Hermitian pairs. Otherwise, your image will not turn out to be real!&lt;/p&gt;




&lt;h3 id="point-source-sensitivity"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro589/lectures/07-bayesian-inference/#point-source-sensitivity"&gt;Point source sensitivity&lt;/a&gt;
&lt;/h3&gt;
&lt;p&gt;We&amp;rsquo;ll use our forward modeling formalism to fit for the flux of a point source.&lt;/p&gt;
&lt;p&gt;Se also &lt;a href="https://ui.adsabs.harvard.edu/abs/2022MNRAS.513.5790C/abstract"&gt;Casussus and Carcamo&lt;/a&gt; 2022, appendix A4.&lt;/p&gt;




&lt;h2 id="more-complex-visibility-models"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro589/lectures/07-bayesian-inference/#more-complex-visibility-models"&gt;More complex visibility models&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;It&amp;rsquo;s difficult to reason about all but the simplest models directly in the Fourier plane, so usually models are constructed in the image plane \(M_I(l,m |,\boldsymbol{\theta})\) and then Fourier transformed (either analytically, or via the FFT) to construct visibility models&lt;/p&gt;
&lt;p&gt;$$
M_\mathcal{V}(u, v |, \boldsymbol{\theta}) \leftrightharpoons M_I(l,m |,\boldsymbol{\theta})
$$&lt;/p&gt;
&lt;p&gt;For marginally resolved sources, it&amp;rsquo;s common to fit simple models like a 2D Gaussian. We can write down an image plane model and then calculate its Fourier transform analytically.&lt;/p&gt;
&lt;p&gt;But, these could be more complicated models. For example, these models could be channel maps of carbon monoxide emission from a rotating protoplanetary disk (as in &lt;a href="https://ui.adsabs.harvard.edu/abs/2015ApJ...806..154C/abstract"&gt;Czekala et al. 2015&lt;/a&gt;, where \(\boldsymbol{\theta}\) contains parameters setting the structure of the disk), or rings of continuum emission from a protoplanetary disk (as in &lt;a href="https://ui.adsabs.harvard.edu/abs/2018ApJ...869L..48G/abstract"&gt;Guzmán et al. 2018&lt;/a&gt;, where \(\boldsymbol{\theta}\) contains parameters setting the sizes and locations of the rings).&lt;/p&gt;
&lt;p&gt;With the likelihood function specified, we can add prior probability distributions \(p(\boldsymbol{\theta})\), and calculate and explore the posterior probability distribution of the model parameters using algorithms like Markov Chain Monte Carlo. In this type of Bayesian inference, we&amp;rsquo;re usually using forward models constructed with a small to medium number of parameters (e.g., 10 - 30), like in the protoplanetary disk examples of &lt;a href="https://ui.adsabs.harvard.edu/abs/2015ApJ...806..154C/abstract"&gt;Czekala et al. 2015&lt;/a&gt; or &lt;a href="https://ui.adsabs.harvard.edu/abs/2018ApJ...869L..48G/abstract"&gt;Guzmán et al. 2018&lt;/a&gt;.&lt;/p&gt;
&lt;p&gt;All of these type of models would be called &lt;em&gt;parametric&lt;/em&gt; models, because we can represent the model using a finite set of parameters. E.g., for the Gaussian model, we have width in the major and minor axes, rotation angle, and 2D position. So these parameters fully represent the model. One thing you need to be concerned with inference using parametric models is whether you have the right model! If your source is actually a ring instead of a Gaussian, the posterior distribution of your parameters can be rendered meaningless.&lt;/p&gt;
&lt;p&gt;Forward modeling with (simple) parametric models can be very useful for understanding&lt;/p&gt;




&lt;h2 id="discussion-about-model-mis-specification-and-unsampled-visibilities-model-constraints"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro589/lectures/07-bayesian-inference/#discussion-about-model-mis-specification-and-unsampled-visibilities-model-constraints"&gt;Discussion about model mis-specification and unsampled visibilities (model constraints)&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;Could use u / v undersampling to constrain width or size, say.&lt;/p&gt;</description></item><item><title>2D Interferometry, PSFs, Gridding, and Dirty Images</title><link>https://iancze.github.io/courses/astro589/lectures/06-psfs-and-gridding/</link><pubDate>Thu, 22 Sep 2022 00:00:00 +0000</pubDate><guid>https://iancze.github.io/courses/astro589/lectures/06-psfs-and-gridding/</guid><description>&lt;ul&gt;
&lt;li&gt;&lt;a href="https://psu.mediaspace.kaltura.com/media/Astro+589A+Lecture+6+part+1/1_tf6rbhb7"&gt;Recording of whiteboard component&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;Slides [&lt;a href="beams-and-arrays.pdf"&gt;PDF&lt;/a&gt;] [&lt;a href="beams-and-arrays.key"&gt;Keynote&lt;/a&gt; (better for movies)]&lt;/li&gt;
&lt;li&gt;&lt;a href="https://psu.mediaspace.kaltura.com/media/Astro+589A+Lecture+6+part+2/1_5rwp9jwx"&gt;Audio for slides&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;




&lt;h2 id="references-for-today"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro589/lectures/06-psfs-and-gridding/#references-for-today"&gt;References for today&lt;/a&gt;
&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;&lt;a href="https://catalog.libraries.psu.edu/catalog/20789467"&gt;Interferometry and Synthesis in Radio Astronomy&lt;/a&gt; by Thompson, Moran, and Swenson, particularly Appendix 2.1&lt;/li&gt;
&lt;li&gt;&lt;a href="https://www.cv.nrao.edu/~sransom/web/xxx.html"&gt;Essential Radio Astronomy&lt;/a&gt; by James Condon and Scott Ransom&lt;/li&gt;
&lt;li&gt;&lt;a href="https://catalog.libraries.psu.edu/catalog/2010095"&gt;The Fourier Transform and its Applications&lt;/a&gt; by R. Bracewell&lt;/li&gt;
&lt;/ul&gt;




&lt;h2 id="review-of-last-time"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro589/lectures/06-psfs-and-gridding/#review-of-last-time"&gt;Review &lt;em&gt;of last time&lt;/em&gt;&lt;/a&gt;
&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;We talked about R.A., Dec., and direction cosines as coordinates on the sky&lt;/li&gt;
&lt;li&gt;We introduced a two-element interferometer with multiply and add correlator backend&lt;/li&gt;
&lt;li&gt;We introduced the fringe pattern \(F(l)\) of a two-antenna interferometer, which is a cosine wave on the sky&lt;/li&gt;
&lt;li&gt;We discussed how the frequency of this fringe pattern (i.e., the &lt;em&gt;spatial frequency&lt;/em&gt;) changes as a function of baseline length between the two antennas&lt;/li&gt;
&lt;li&gt;We introduced the visibility function \(\mathcal{V}(u)\) as the Fourier transform of the sky-brightness \(I(l)\)&lt;/li&gt;
&lt;li&gt;And discussed how the output from the interferometer changes in response to a point source and an extended source&lt;/li&gt;
&lt;/ul&gt;




&lt;h2 id="this-time"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro589/lectures/06-psfs-and-gridding/#this-time"&gt;This time&lt;/a&gt;
&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;Extend our discussion to a general interferometer with north-south and east-west baselines&lt;/li&gt;
&lt;li&gt;Discuss arrays with multiple antennas&lt;/li&gt;
&lt;li&gt;Earth aperture synthesis&lt;/li&gt;
&lt;li&gt;\(u,v\) sampling distributions for various arrays&lt;/li&gt;
&lt;li&gt;point spread functions (PSFs) and their relationship to the sampling distribution&lt;/li&gt;
&lt;/ul&gt;




&lt;h2 id="group-projects-and-presentations"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro589/lectures/06-psfs-and-gridding/#group-projects-and-presentations"&gt;Group Projects and Presentations&lt;/a&gt;
&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;Dates are set for presentations&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Review the syllabus for &lt;a href="https://iancze.github.io/courses/astro589/syllabus//#group-project-and-presentation"&gt;group projects&lt;/a&gt;&lt;/strong&gt;.&lt;/li&gt;
&lt;li&gt;Suggest that your group sets up a meeting with me to discuss topics&lt;/li&gt;
&lt;/ul&gt;




&lt;h2 id="wrapping-up-1d"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro589/lectures/06-psfs-and-gridding/#wrapping-up-1d"&gt;Wrapping up 1D&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;Last time, we had said the response \(R(l)\) of the interferometer to some sky distribution \(I(l)\) is to act as a convolution
$$
R(l) = \cos(2 \pi u l) * I(l).
$$
In this case, you can think of the fringe function as a &lt;em&gt;point spread function&lt;/em&gt; (a terrible one), that is convolving out the true sky brightness distribution.&lt;/p&gt;
&lt;p&gt;Another way to look at this, though, is through our old friend the multiplication/convolution algorithm. Let \(u_0\) be the (fixed, instantaneous) baseline distance between two antennas, measured in multiples of the observing wavelength
$$
u = \frac{D \cos \theta}{\lambda}
$$
where \(\theta\) is the angle from zenith.&lt;/p&gt;
&lt;p&gt;Then, the Fourier pair of the fringe function is
$$
\cos(2 \pi u_0 l) \leftrightharpoons \frac{1}{2} [\delta(u + u_0) + \delta(u - u_0)].
$$
These are two delta functions situated at \(\pm u_0\).&lt;/p&gt;
&lt;p&gt;The Fourier pair of the sky brightness distribution is called the &lt;em&gt;visibility function&lt;/em&gt;
$$
I(l) \leftrightharpoons \mathcal{V}(u).
$$&lt;/p&gt;
&lt;p&gt;In the Fourier plane, the response of the interferometer is a multiplication
$$
\cos(2 \pi u l) * I(l) \leftrightharpoons \frac{1}{2} [\delta(u + u_0) + \delta(u - u_0)] \times \mathcal{V}(u)
$$
i.e., the interferometer has &lt;em&gt;sampled&lt;/em&gt; the visibility function at locations \(\pm u_0\) corresponding to the baseline distance of the two antennas.&lt;/p&gt;
&lt;p&gt;Another way to think about this is that the interferometer acts as a &lt;em&gt;spatial filter&lt;/em&gt; that only responds to the two spatial frequencies \(\pm u_0\). \(\mathcal{V}(u)\) represents the amplitude and phase of the sinusoidal component of the intensity distribution with spatial frequency \(u\) cycles per radian. The negative spatial frequency doesn&amp;rsquo;t have a physical meaning but is a mathematical convenience. Because the intensity distribution on the sky is a real quantity, the visibility function itself is symmetric about the origin in a Hermitian sense, meaning it has real even parts and odd imaginary parts.&lt;/p&gt;




&lt;h2 id="moving-on-to-the-general-case"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro589/lectures/06-psfs-and-gridding/#moving-on-to-the-general-case"&gt;Moving on to the general case&lt;/a&gt;
&lt;/h2&gt;
&lt;ol&gt;
&lt;li&gt;We&amp;rsquo;re going to take our two-element interferometer and re-derive the same relationships using a general vector formalism&lt;/li&gt;
&lt;li&gt;Then, we&amp;rsquo;re going to introduce a general 3D cartesian coordinate set to the problem, which is used by most interferometers&lt;/li&gt;
&lt;/ol&gt;




&lt;h3 id="coordinates"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro589/lectures/06-psfs-and-gridding/#coordinates"&gt;Coordinates&lt;/a&gt;
&lt;/h3&gt;
&lt;p&gt;Let&amp;rsquo;s consider a generic situation of a two-element interferometer observing (tracking) a source on the sky with phase center \(\mathbf{s}_0\).&lt;/p&gt;
&lt;figure&gt;&lt;img src="https://iancze.github.io/courses/astro589/lectures/06-psfs-and-gridding/tms-general-coord.png"
 alt="TMS Fig 3.1"&gt;&lt;figcaption&gt;
 &lt;p&gt;TMS Fig 3.1&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;

&lt;p&gt;&lt;strong&gt;Antenna power pattern&lt;/strong&gt;: An element of the source with solid angle \(d\Omega\) at some position \(\mathbf{s} = \mathbf{s}_0 + \mathbf{\sigma}\) will contribute an element of power \( \frac{1}{2} A(\sigma) I(\sigma) \Delta \nu d\Omega\), where \(A\) is some (normalized) power pattern of a single antenna. For now, you can consider it to be a directionally smooth function that is effectively constant over the field of view of interest.&lt;/p&gt;
&lt;p&gt;From what we just talked about for the 1D example, the component of the correlator output will be equal to the received power and to the fringe term \(\cos(2 \pi \nu \tau_g)\).&lt;/p&gt;
&lt;p&gt;Let \(\mathbf{D}_\lambda\) be a baseline vector which points from the central antenna to the other one, and specifies the baseline length in multiples of the observing wavelength. Then&lt;/p&gt;
&lt;p&gt;$$
\nu \tau_g = \mathbf{D}_\lambda \cdot \mathbf{s} = \mathbf{D} \cdot (\mathbf{s}_0 + \mathbf{\sigma}).
$$&lt;/p&gt;
&lt;p&gt;To calculate the output from the correlator, we need to integrate over the spatial distribution of the source&lt;/p&gt;
&lt;p&gt;$$
r(D_\lambda, s_0) = \Delta \nu \int_{4 \pi} A(\mathbf{\sigma}) I(\sigma) \cos [2 \pi D_\lambda \cdot (s_0 + \mathbf{\sigma})]\,\mathrm{d}\Omega
$$&lt;/p&gt;
&lt;p&gt;Here we see an opportunity to use our sine/cosine difference angle formulae again to split this up into sine and cosine components and then use Euler&amp;rsquo;s formula to put it back together.&lt;/p&gt;
&lt;p&gt;Let&amp;rsquo;s define the complex visibility as
$$
\mathcal{V} = \int_{4 \pi} A_N(\sigma) I(\sigma) e^{-i 2 \pi D_\lambda \cdot \sigma}\,\mathrm{d}\Omega
$$
which I hope you agree looks suspiciously like a Fourier transform.&lt;/p&gt;
&lt;p&gt;When an interferometer observes a source, it is sampling the visibility function at these points (corresponding to a spatial frequency of \(\pm \mathbf{D}_\lambda \cdot \mathbf{s}_0 \)). You can think of this measurement as just recording the real and imaginary values of the visibility function, or alternatively, some &lt;em&gt;visibility amplitude&lt;/em&gt; and some &lt;em&gt;visibility phase&lt;/em&gt;
$$
\mathcal{V} = |\mathcal{V}|e^{i \phi}.
$$&lt;/p&gt;




&lt;h3 id="3d-coordinates"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro589/lectures/06-psfs-and-gridding/#3d-coordinates"&gt;3D coordinates&lt;/a&gt;
&lt;/h3&gt;
&lt;p&gt;Now that we&amp;rsquo;ve introduced how the visibility function comes about in a general vector formalism, let&amp;rsquo;s get concrete with respect to coordinates on Earth and in the sky.&lt;/p&gt;
&lt;figure&gt;&lt;img src="https://iancze.github.io/courses/astro589/lectures/06-psfs-and-gridding/tms-uv-curved.png"
 alt="Credit: TMS Fig 3.2. Note that the \(l\) and \(m\) coordinates technically index the flat image plane tangent to \(\mathbf{s_0}\), not curved as they are shown here."&gt;&lt;figcaption&gt;
 &lt;p&gt;Credit: TMS Fig 3.2. Note that the \(l\) and \(m\) coordinates technically index the &lt;em&gt;flat&lt;/em&gt; image plane tangent to \(\mathbf{s_0}\), not curved as they are shown here.&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;

&lt;p&gt;Let&amp;rsquo;s focus in on the \(u,v, w\) coordinate system in the bottom of the figure. The plane is centered at the location of one of the antennas \(u=0,v=0, w=0\). We can also draw our baseline vector \(\mathbf{D}_\lambda\) in this coordinate system, pointing to the other antenna. The \(w\) unit vector is pointing towards phase center \(\mathbf{s}_0\) (i.e., the direction of the source).&lt;/p&gt;
&lt;p&gt;Another way of saying this is
$$
\mathbf{D}_\lambda \cdot \mathbf{s}_0 = w.
$$
So, you can think of the \(u,v\) plane \(w = 0\) as oriented orthogonal to the vector pointing towards phase center. But, N.B. that the baseline vector &lt;em&gt;itself&lt;/em&gt; does not necessarily live in the \(w = 0\) plane, it can have a non-zero \(w\) component.&lt;/p&gt;
&lt;p&gt;Using this coordinate system, let&amp;rsquo;s focus on rewriting the \(\mathbf{D}_\lambda \cdot \mathbf{s}\) term. Recall that the dot product between two vectors is&lt;/p&gt;
&lt;p&gt;$$
\mathbf{a} \cdot \mathbf{b} = ||a|| \; ||b|| \cos \theta.
$$
Therefore we have
$$
\mathbf{D}_\lambda \cdot \mathbf{s} = \left ( ul + vm + wn \right).
$$
Wow, that was simple. Hopefully now you see why it was convenient to use \(l, m\) as direction cosines!&lt;/p&gt;
&lt;p&gt;\(n\) is the third direction cosine and is w.r.t. the \(w\) axis. It is not independent of \(l, m\) and can be written in terms of them as
$$
n = \sqrt{1 - l^2 - m^2}.
$$&lt;/p&gt;
&lt;p&gt;So we would normally write
$$
\mathbf{D}_\lambda \cdot \mathbf{s} = \left ( ul + vm + w\sqrt{1 - l^2 - m^2} \right).
$$&lt;/p&gt;
&lt;p&gt;This factor also appears in the solid angle differential. As we move from phase center, the solid angle is changed by a factor
$$
d \Omega = \frac{\mathrm{d}l\; \mathrm{d} m}{\sqrt{1 - l^2 - m^2}}.
$$
This is adjusting for the fact that the solid angle is something on the celestial sphere, but we are measuring it using the direction cosines on the tangent plane.&lt;/p&gt;
&lt;p&gt;With these relationships in hand, we can rewrite the visibility function as
$$
\mathcal{V}(u, v, w) = \int_{-\infty}^\infty \int_{-\infty}^\infty A_N(l,m) I(l,m) \exp \left \{ -i 2 \pi \left [ ul + vm + w \left ( \sqrt{1 - l^2 - m^2} - 1 \right )\right ] \right \} \;\frac{\mathrm{d}l\;\mathrm{d}m}{\sqrt{1 - l^2 - m^2}}.
$$&lt;/p&gt;
&lt;p&gt;The factor in the exponential comes about from the measurement of angular position with respect to phase center (\(\mathbf{D}_\lambda \cdot \mathbf{s})\), as we saw in the &amp;ldquo;general coordinates&amp;rdquo; example.&lt;/p&gt;
&lt;p&gt;&lt;em&gt;If&lt;/em&gt; all of the measurements could be made with the antennas in a plane normal to the \(w\) direction such that \(w=0\), then we would turn this equation into an exact 2D transform. But this isn&amp;rsquo;t usually the case and we need to make approximations.&lt;/p&gt;
&lt;p&gt;So long as we are in the small-field regime and \(l\) and \(m\) are small enough such that the term
$$
\left ( \sqrt{1 - l^2 - m^2} - 1 \right ) w
$$
can be neglected (\(\simeq - \frac{1}{2}(l^2 + m^2)w\) in this regime), then we have
$$
\mathcal{V}(u, v, w) \simeq \mathcal{V}(u, v, 0) = \int_{-\infty}^\infty \int_{-\infty}^\infty \frac{A_N(l,m) I(l,m)}{\sqrt{1 - l^2 - m^2}} \exp \left \{ -i 2 \pi \left [ ul + vm \right ] \right \} \;\mathrm{d}l\;\mathrm{d}m.
$$&lt;/p&gt;
&lt;p&gt;OK! So we&amp;rsquo;ve arrived at the result that I told you about at the beginning of last week&amp;rsquo;s class, that the visibility function is the Fourier transform of the sky brightness (modified by the primary beam of each antenna, which we can mostly ignore for this discussion as a constant). The approximation we made for the \(w\) term places a limit on the maximum size of the field that we can image (at once). There are approaches designed to overcome this scenario, but at least in the context of this course we will restrict our discussion to those that don&amp;rsquo;t require it. This is generally the case for all images made with VLA or ALMA for a single pointing (i.e., imaging the full primary beam). If you use multiple pointings of the array (generally called &amp;ldquo;mosaicing&amp;rdquo;) to make an even larger image, you&amp;rsquo;ll need to take into account the effects of the \(w\) term.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Revisiting &amp;ldquo;slightly extended source&lt;/strong&gt;: A &amp;ldquo;slightly extended source&amp;rdquo; is something that is larger than the dirty beam but smaller than the primary beam of each telescope. Instantaneous field of view of an interferometer is the same as the primary beam of each telescope, treated as a single dish (see previous section). Each single dish antenna is still seeing the same thing as before, it&amp;rsquo;s just that we have a correlator backend that&amp;rsquo;s doing things with the signals, allowing us to create a &lt;em&gt;synthesized beam&lt;/em&gt; that is considerably smaller than the size of the primary beam. For example, at 220 GHz (band 6), ALMA has a primary beam of about 20 arcseconds in diameter. However, it&amp;rsquo;s common to make synthesized beams on the size of 0.1 arcseconds or smaller.&lt;/p&gt;
&lt;p&gt;Let&amp;rsquo;s also briefly discuss what the \(u,v\) coordinate plane looks like, now that we are in 2D:&lt;/p&gt;
&lt;figure&gt;&lt;img src="https://iancze.github.io/courses/astro589/lectures/06-psfs-and-gridding/2D-uv-plane.png"
 alt="Credit: TMS Fig 2.7"&gt;&lt;figcaption&gt;
 &lt;p&gt;Credit: TMS Fig 2.7&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;





&lt;h3 id="units-of-mathcalv"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro589/lectures/06-psfs-and-gridding/#units-of-mathcalv"&gt;Units of \(\mathcal{V}\)&lt;/a&gt;
&lt;/h3&gt;
&lt;p&gt;What are the units of \(\mathcal{V}\) itself? We can get at this by looking at the units of \(I_\nu(l,m)\) and how we carried out the Fourier transform integral (in its simplified form).&lt;/p&gt;
&lt;p&gt;$$
\mathcal{V}(u, v) = \int_{-\infty}^\infty \int_{-\infty}^\infty I(l,m) \exp \left \{ -i 2 \pi \left [ ul + vm \right ] \right \} \;\mathrm{d}l\;\mathrm{d}m.
$$&lt;/p&gt;
&lt;p&gt;If we parameterized our image using \(\mathrm{Jy} / \mathrm{arcsec}^2\) and we integrated over \( \mathrm{d}l\, \mathrm{d}m\) (both assuming they had units of arcsec), then \(\mathcal{V}\) must have units of Jy. I.e., you can think of it sort of like the flux being observed at that angular scale. The visibility function is complex-valued, so if you want to discuss the &amp;ldquo;power&amp;rdquo; of an image at some angular scale then you should consider \(|\mathcal{V}|^2\).&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;&lt;strong&gt;Slides&lt;/strong&gt;&lt;/li&gt;
&lt;/ul&gt;</description></item><item><title>Interferometry in Practice</title><link>https://iancze.github.io/courses/astro589/lectures/05-interferometry/</link><pubDate>Sun, 18 Sep 2022 00:00:00 +0000</pubDate><guid>https://iancze.github.io/courses/astro589/lectures/05-interferometry/</guid><description>&lt;ul&gt;
&lt;li&gt;&lt;a href="https://psu.mediaspace.kaltura.com/media/Astro+589A+Lecture+5/1_2zi2doy2"&gt;Video Recording&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;




&lt;h2 id="references-for-today"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro589/lectures/05-interferometry/#references-for-today"&gt;References for today&lt;/a&gt;
&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;&lt;a href="https://catalog.libraries.psu.edu/catalog/2010095"&gt;The Fourier Transform and its Applications&lt;/a&gt; by R. Bracewell&lt;/li&gt;
&lt;li&gt;&lt;a href="https://catalog.libraries.psu.edu/catalog/20789467"&gt;Interferometry and Synthesis in Radio Astronomy&lt;/a&gt; by Thompson, Moran, and Swenson, particularly Appendix 2.1&lt;/li&gt;
&lt;li&gt;&lt;a href="https://www.cv.nrao.edu/~sransom/web/xxx.html"&gt;Essential Radio Astronomy&lt;/a&gt; by James Condon and Scott Ransom&lt;/li&gt;
&lt;/ul&gt;




&lt;h2 id="review-of-last-time"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro589/lectures/05-interferometry/#review-of-last-time"&gt;Review &lt;em&gt;of last time&lt;/em&gt;&lt;/a&gt;
&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;Interpolation kernels&lt;/li&gt;
&lt;li&gt;Discrete Fourier Transform (DFT)&lt;/li&gt;
&lt;li&gt;Fast Fourier Transform (FFT)&lt;/li&gt;
&lt;/ul&gt;




&lt;h2 id="this-time"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro589/lectures/05-interferometry/#this-time"&gt;This time&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;The punchline of today is that the complex-valued visibility function \(\mathcal{V}\) is the 2D Fourier transform of an image on the celestial sphere (with a small field of view)&lt;/p&gt;
&lt;p&gt;$$
I(l, m) \leftrightharpoons \mathcal{V}(u, v)
$$&lt;/p&gt;
&lt;p&gt;and it is the visibility function that interferometers measure directly. The values of \(u, v\) for which interferometers are able to measure the visibility function depend on how the array of antennas is laid out and the spacing between them.&lt;/p&gt;
&lt;p&gt;Most of today&amp;rsquo;s lecture will follow Chapters 2 and 3 of &lt;a href="https://catalog.libraries.psu.edu/catalog/20789467"&gt;Interferometry and Synthesis in Radio Astronomy&lt;/a&gt; by Thompson, Moran, and Swenson. First, we will introduce a two-element interferometer and it&amp;rsquo;s response to a point source. Then, we&amp;rsquo;ll complexify this a bit to talk about an extended source (but still in 1D). Then, we&amp;rsquo;ll move on to discuss intensity distributions and the visibility function in the general case and then derive the relationship between \(I(l, m) \leftrightharpoons \mathcal{V}(u, v)\).&lt;/p&gt;
&lt;p&gt;We&amp;rsquo;ll first develop the geometry and math of this relationship for small fields of view, so that you understand the result, at least in an abstract manner. Then we&amp;rsquo;ll spend the latter part of the course working through how a radio interferometer like the VLA or ALMA works to actually sample the visibility function.&lt;/p&gt;




&lt;h2 id="ra-and-dec"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro589/lectures/05-interferometry/#ra-and-dec"&gt;R.A. and Dec&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;Let&amp;rsquo;s review our coordinates for images on the celestial sphere.&lt;/p&gt;
&lt;p&gt;Declination&amp;rsquo;s the easier one, in my opinion. No matter where you are, if you move in declination, you move along a great circle (i.e., a circle that actually traces the circumference of the celestial sphere). We measure this in terms of 0 (celestial equator) to +90 degrees at the north celestial pole and -90 degrees at the southern celestial pole. You can split a degree into 60 arcminutes and an arcminute into 60 arcseconds.&lt;/p&gt;
&lt;figure&gt;&lt;a href="https://skyandtelescope.org/wp-content/uploads/RA-Dec-wiki-Tom-RuenCC-BY-SA-3.0.jpg"&gt;&lt;img src="https://iancze.github.io/courses/astro589/lectures/05-interferometry/celestial-sphere.jpg"
 alt="Credit: Sky and Telescope"&gt;&lt;/a&gt;&lt;figcaption&gt;
 &lt;p&gt;Credit: Sky and Telescope&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;

&lt;p&gt;Right Ascension is the one that sometimes trips me up. Because the sky rotates, we have this system of marking it using sidereal &lt;em&gt;time&lt;/em&gt;: 24 &lt;em&gt;hours&lt;/em&gt; of right ascension, sometimes broken up into 60 &lt;em&gt;minutes&lt;/em&gt;, and 60 &lt;em&gt;seconds&lt;/em&gt;. Although they are still in multiples of 60, these &lt;em&gt;minutes&lt;/em&gt; and &lt;em&gt;seconds&lt;/em&gt; we use for right ascension do not have the same angular size as arcminutes and arcseconds (even if we are on the celestial equator). For example, let&amp;rsquo;s say we have two points on the sky.&lt;/p&gt;
&lt;pre&gt;&lt;code&gt;p1 = '00h42m00s', '+41d12m'
p2 = '00h42m01s', '+41d12m'
&lt;/code&gt;&lt;/pre&gt;
&lt;p&gt;Same declination, but their R.A. values differ by one second. Can anyone guess how many &lt;em&gt;arcseconds&lt;/em&gt; separate these points? The answer is about 11.3 arcseconds.&lt;/p&gt;
&lt;p&gt;Usually, when we are talking about observing a smaller source (like a protoplanetary disk, or galaxy), we point in some direction towards that object and then define a small little postage stamp, commonly in units of \(\Delta \delta \) and \(\Delta \alpha \cos \delta\), in which case, the units describing the image are arcseconds. We&amp;rsquo;re still talking about spherical astronomy though, so this isn&amp;rsquo;t necessarily limited to small fields of view. We could have \(\Delta \delta\) be several degrees, for example.&lt;/p&gt;
&lt;p&gt;TODO: Figure: point relative to center, and then Delta directions coming off of it.&lt;/p&gt;




&lt;h3 id="direction-cosines"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro589/lectures/05-interferometry/#direction-cosines"&gt;Direction cosines&lt;/a&gt;
&lt;/h3&gt;
&lt;p&gt;In a moment, we&amp;rsquo;re going talk about the mechanics of interferometers observing the celestial sphere and how these relate to Fourier transforms. Before we talk about that, though, there&amp;rsquo;s a concept I want to introduce while we&amp;rsquo;re still talking about units for images.&lt;/p&gt;
&lt;p&gt;Practically speaking, most images we might make with ALMA or the VLA will have a small field of view (&amp;lt; 1 arcminute). In this regime, it simplifies a lot to talk about &amp;ldquo;flat&amp;rdquo; images, i.e., image planes that are tangent to the field center. In 1D, it would look like this&lt;/p&gt;
&lt;figure&gt;&lt;img src="https://iancze.github.io/courses/astro589/lectures/05-interferometry/direction-cosine.png"
 alt="The concept of the direction cosine. Credit: Fig 3.3 TMS"&gt;&lt;figcaption&gt;
 &lt;p&gt;The concept of the direction cosine. Credit: Fig 3.3 TMS&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;

&lt;p&gt;The direction cosines are
$$
l = \sin(\Delta \alpha \cos \delta)
$$
and
$$
m = \sin(\Delta \delta)
$$
relative to the phase center. You can see from the figure that it is the direction cosine that is actually tracking the position on the tangent plane, where we are defining our image. I know I just wrote down \(\sin\) but I called these direction cosines. The term comes about from the way you&amp;rsquo;d set up the problem in two dimensions, where you might use cosine of the complementary angle instead, but it&amp;rsquo;s just a matter of convention.&lt;/p&gt;
&lt;p&gt;Because they are outputs from trigonometric functions, they are technically unitless. Though l and m are technically unitless and measures of &lt;em&gt;linear&lt;/em&gt; distance, for small angular extent, they could also be considered to have units of radians. So it will be common that we refer to
$$
l = \sin(\Delta \alpha \cos \delta) \approx \Delta \alpha \cos \delta
$$
and
$$
m = \sin(\Delta \delta) \approx \Delta \delta.
$$&lt;/p&gt;
&lt;p&gt;This probably sounds pointless, since we just arrived back at the same units we started with. Hopefully the reasons why we might wish to use these units will become apparent after we cover more about the interferometer. And remember that you can always &lt;em&gt;exactly&lt;/em&gt; convert from direction cosine back to angular usits (e.g. \(\Delta \alpha \cos \delta\)) by doing \(\sin^{-1}\), it&amp;rsquo;s just that for most small angles we&amp;rsquo;ll be dealing with, this operation is essentially an identity function. All of this goes out the window when we consider wide-field imaging (which we won&amp;rsquo;t have time to talk about in this course, though you might consider as a topic for a course project).&lt;/p&gt;
&lt;hr&gt;




&lt;h2 id="introduction-to-a-2-element-interferometer"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro589/lectures/05-interferometry/#introduction-to-a-2-element-interferometer"&gt;Introduction to a 2-element interferometer&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;Consider this geometric situation&lt;/p&gt;
&lt;figure&gt;&lt;img src="https://iancze.github.io/courses/astro589/lectures/05-interferometry/elementary-interferometer.png"
 alt="Credit: TMS Fig 2.1"&gt;&lt;figcaption&gt;
 &lt;p&gt;Credit: TMS Fig 2.1&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;

&lt;p&gt;The antennas are spaced directly east-west, and they are observing a source in the &lt;em&gt;far-field&lt;/em&gt;, i.e., the radiation from a distant cosmic source appears as a plane wave. First, we will consider the case of a &lt;em&gt;point-source&lt;/em&gt;; later we will extend this formalism to spatially extended sources. We&amp;rsquo;ll assume the primary beam of each antenna is large, such that they can observe radiation from a source located a wide range of \(\theta\) angles. We&amp;rsquo;ll assume that we&amp;rsquo;re observing in a narrow slice of frequency around \(\nu\), essentially monochromatic.&lt;/p&gt;
&lt;p&gt;The wavefront from the source arrives at the right antenna some time
$$
\tau_g = \frac{D}{c} \sin \theta
$$
before it reaches the left one. This is called the &lt;em&gt;geometric time delay&lt;/em&gt;.&lt;/p&gt;
&lt;p&gt;Each antenna has its own signal voltage stream:
$$
V_1 = \sin 2 \pi \nu t
$$
and
$$
V_2 = \sin 2 \pi \nu (t - \tau_g).
$$&lt;/p&gt;
&lt;p&gt;These streams are multiplied together in a &lt;em&gt;correlator&lt;/em&gt; and then time-averaged over some interval. The output of the correlator is proportional to
$$
F(t, \tau_g) = \sin (2 \pi \nu t) \sin 2 \pi \nu (t - \tau_g),
$$
which we can expand using our trig sum identities for sine to
$$
F(t, \tau_g) = \sin^2(2 \pi \nu t) \cos(2 \pi \nu \tau_g) - \sin(2 \pi \nu t)\cos(2 \pi \nu t) \sin(2 \pi \nu \tau_g).
$$&lt;/p&gt;
&lt;p&gt;We can simplify this equation based on our knowledge that the correlator multiplies and then adds (integrates, typically for a few seconds).&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;The central frequency \(\nu\) is on the order of 10s of MHz to nearly a THz&lt;/li&gt;
&lt;li&gt;\(\theta\) (baked into \(\tau_g\)) is rotating at the Earth&amp;rsquo;s rotational velocity, which is \(10^{-4}\;\mathrm{rad\,s}^{-1}\).&lt;/li&gt;
&lt;li&gt;\(D\) must be smaller than \(10^7\;\)m for terrestrial baselines&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;This means that the rate of variation of \(\nu \tau_g \ll \nu t\) by several orders of magnitude.&lt;/p&gt;
&lt;p&gt;So long as our averaging interval is \(T \gg 1/\nu\) (which is satisfied by a typical multi-second integration),
$$
\langle \sin^2 (2 \pi \nu t) \rangle = 1/2
$$
and
$$
\langle \sin(2 \pi \nu t)\cos(2 \pi \nu t) \rangle = 0
$$
so we&amp;rsquo;re left with
$$
F \propto \cos (2 \pi \nu \tau_g).
$$
We can also define \(l = \sin \theta\) and then we can write
$$
F \propto \cos (2 \pi \nu \tau_g) = \cos \left (\frac{2 \pi D l}{\lambda} \right ).
$$&lt;/p&gt;
&lt;p&gt;This is called the &lt;em&gt;fringe function&lt;/em&gt; of a two-element interferometer&lt;/p&gt;
&lt;p&gt;TODO: draw as a linear relationship vs. \(l\), i.e., an oscillating sine wave
TODO: then include the fringe plot itself&lt;/p&gt;
&lt;figure&gt;&lt;img src="https://iancze.github.io/courses/astro589/lectures/05-interferometry/fringe.png"
 alt="The fringe function (plotted here as \(|F|\)) can be thought of as the directional power pattern of the interferometer in the case the antennas are isotropic. Credit: TMS Fig 2.2"&gt;&lt;figcaption&gt;
 &lt;p&gt;The fringe function (plotted here as \(|F|\)) can be thought of as the directional power pattern of the interferometer in the case the antennas are isotropic. Credit: TMS Fig 2.2&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;

&lt;p&gt;So, you see that the 2-element interferometer has a sine/cosine sensitivity to the sky along the east-west axis. It has no sensitivity along the north-south direction.&lt;/p&gt;




&lt;h2 id="2-element-interferometer-for-a-spatially-resolved-source"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro589/lectures/05-interferometry/#2-element-interferometer-for-a-spatially-resolved-source"&gt;2-element interferometer for a spatially resolved source&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;Now we&amp;rsquo;ll consider a slightly more complex interferometer.&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;The antennas are (somewhat) directional&lt;/li&gt;
&lt;li&gt;They track the source as it moves across the sky, from the rotation of the Earth. This introduces an &lt;em&gt;instrumental&lt;/em&gt; time delay&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;$$
\tau_i = \frac{D}{c} \sin \theta_0.
$$
I.e., this keeps the waveforms in sync so long as we&amp;rsquo;re looking directly at \(\theta_0\).&lt;/p&gt;
&lt;p&gt;The direction the antennas are pointed is called the &lt;em&gt;phase reference position&lt;/em&gt;, which we&amp;rsquo;ll denote as \(\theta_0\) and tracks the source as it rotates across the sky.&lt;/p&gt;
&lt;p&gt;TODO: make a figure showing \(\Delta \theta\) offset.&lt;/p&gt;
&lt;p&gt;And let us consider radiation from a direction \(\theta_0 - \Delta \theta\), where \(\Delta \theta\) is a small angle. As before, the fringe response term is
$$
\cos (2 \pi \nu \tau) = \cos \left \{ 2 \pi \nu \left [ \frac{D}{c} \sin (\theta_0 - \Delta \theta) - \tau_i \right ] \right \}.
$$
Using the sine formulas for difference and simplifying with \(\cos \Delta \simeq 1\) for small angles, we have
$$
\cos (2 \pi \nu \tau) \simeq \cos \left [ 2 \pi \nu \frac{D}{c} \sin \Delta \theta \cos \theta_0 \right].
$$&lt;/p&gt;
&lt;p&gt;Let&amp;rsquo;s stare at this equation a bit more. Assuming we&amp;rsquo;re holding observing frequency fixed, the angular resolution of the fringes is determined by the projected length of the baseline orthogonal to the direction of the source, which is \(D \cos \theta_0\). This is a pretty ordinary physical measurement of a distance, i.e., we would measure it to be something like 50 &lt;em&gt;meters&lt;/em&gt;.&lt;/p&gt;
&lt;p&gt;Of course, observing frequency also makes a difference. If we&amp;rsquo;re observing at higher frequencies (shorter \(\lambda\)), the fringe resolution is going to better (this is just another form of the \(\lambda/D\) resolution relationship for telescopes showing up).&lt;/p&gt;
&lt;p&gt;So, we can define a new variable for this projected baseline length
$$
u = \frac{D \cos \theta_0}{\lambda} = \frac{\nu_0 D \cos \theta_0}{c}.
$$
\(u\) is &lt;em&gt;the number of wavelengths&lt;/em&gt; (at that observing wavelength) that are needed to span the projected baseline length. It is measured in multiples of &amp;ldquo;\(\lambda\),&amp;rdquo; i.e., you might see a baseline length described as \(u = \) 300 kilolambda. \(u\) is called a &lt;em&gt;spatial frequency&lt;/em&gt;.&lt;/p&gt;
&lt;p&gt;Now we will redefine our sky coordinate variable \(l = \sin \Delta \theta\), and we find that we can write the fringe response as
$$
F(l) \propto \cos (2 \pi \nu_0 \tau) \propto \cos (2 \pi u l).
$$&lt;/p&gt;
&lt;p&gt;If \(u\) gets larger (either by moving the antennas further apart and increasing the projected baseline, or by observing at a higher frequency), then the spatial resolution (spacing of the fringes) will get better.&lt;/p&gt;
&lt;p&gt;We see that the quantity \((ul)\) appears inside of a trigonometric function, so this means the quantity must be dimensionless.&lt;/p&gt;
&lt;p&gt;This motivates the many different ways we can think about these variables in the image plane and the visibility domain.&lt;/p&gt;




&lt;h3 id="unitless"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro589/lectures/05-interferometry/#unitless"&gt;Unitless&lt;/a&gt;
&lt;/h3&gt;
&lt;p&gt;The first way is to recognize that&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;\(l\) is technically unitless, since it is \(\sin(\Delta \theta)\)&lt;/li&gt;
&lt;li&gt;\(u\) itself is also technically unitless, it&amp;rsquo;s just the baseline length measured in a number of wavelengths (e.g., kilolambda)&lt;/li&gt;
&lt;li&gt;Inside this \(\cos\) term, though, \(u\) plays the role of a &lt;em&gt;frequency&lt;/em&gt;, i.e., &amp;ldquo;cycles per unit \(l\)&amp;rdquo;&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;Both of these variables do correspond to actual distances, the interferometer does have some baseline, which corresponds to its ability to resolve a source of some actual size on the sky. It&amp;rsquo;s just with this way of thinking about it, we measured both of those sizes using dimensionless units (and that&amp;rsquo;s OK)!&lt;/p&gt;




&lt;h3 id="unitful"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro589/lectures/05-interferometry/#unitful"&gt;Unitful&lt;/a&gt;
&lt;/h3&gt;
&lt;p&gt;We can put a little bit more sense to this by bringing back the small angle approximation, and saying that because \(\Delta \theta\) is small, then \(l = \sin \Delta \theta \approx \Delta \theta\), and so it is as if we measured \(l\) in some angular unit, like radians or arcseconds.&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;If \(l\) is small, then we would say that it can be measured in radians (which can then be converted to arcseconds)&lt;/li&gt;
&lt;li&gt;Then \(u\) is measured in cycles/radian or cycles/arcsec&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;Because of this small angle approximation for the interferometer geometry, it allows us to equate &amp;ldquo;multiples of lambda&amp;rdquo; to &amp;ldquo;cycles per angle&amp;rdquo; as a spatial frequency.&lt;/p&gt;
&lt;p&gt;Let&amp;rsquo;s walk through an example, and say that we&amp;rsquo;re observing a&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;do small angle approximation to say l in units of radians&lt;/li&gt;
&lt;li&gt;convert to u&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;TODO: redraw waveform. This makes a lot of sense if you just draw the waveform up on the sky.&lt;/p&gt;
&lt;p&gt;Compared to last time, we now assumed some directional sensitivity for each antenna, such as this power pattern&lt;/p&gt;
&lt;figure&gt;&lt;img src="https://iancze.github.io/courses/astro589/lectures/05-interferometry/primary-beam.png"
 alt="Credit: Tools of Radio Astronomy, Fig 7.1"&gt;&lt;figcaption&gt;
 &lt;p&gt;Credit: Tools of Radio Astronomy, Fig 7.1&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;

&lt;figure&gt;&lt;a href="http://www.aoc.nrao.edu/events/synthesis/2022/slides/Fundamentals-2022.pdf"&gt;&lt;img src="https://iancze.github.io/courses/astro589/lectures/05-interferometry/fringe-beam.png"
 alt="The fringe pattern modified by the antenna power pattern. Credit: Rick Perley&amp;rsquo;s slides, NRAO summer synthesis imaging school 2022."&gt;&lt;/a&gt;&lt;figcaption&gt;
 &lt;p&gt;The fringe pattern modified by the antenna power pattern. Credit: Rick Perley&amp;rsquo;s slides, NRAO summer synthesis imaging school 2022.&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;

&lt;p&gt;Recap: so now we&amp;rsquo;ve redefined the fringe function to talk about the response to a spatially resolved source, as a function of projected baseline length. And, we&amp;rsquo;ve introduced the concept of spatial frequency.&lt;/p&gt;




&lt;h3 id="fourier-transform-relationship"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro589/lectures/05-interferometry/#fourier-transform-relationship"&gt;Fourier transform relationship&lt;/a&gt;
&lt;/h3&gt;
&lt;p&gt;We just derived and drew the fringe function as the response of the interferometer &lt;em&gt;on the sky&lt;/em&gt; and examined how it changes as we change the position of the antennas on the ground. The essential response \(R(l)\) of the interferometer to some sky distribution \(I(l)\) is to act as a convolution
$$
R(l) = \cos(2 \pi u l) * I(l)
$$&lt;/p&gt;
&lt;p&gt;Another way to look at this is to consider the Fourier pair of the fringe function when the antennas are at a fixed (instantaneous) baseline distance \(u_0\),
$$
\cos(2 \pi u_0 l) \leftrightharpoons \frac{1}{2} [\delta(u + u_0) + \delta(u - u_0)].
$$&lt;/p&gt;
&lt;p&gt;Let us define the &lt;em&gt;visibility function&lt;/em&gt; \(\mathcal{V}(u)\) as the Fourier transform of the sky brightness distribution (the true one)&lt;/p&gt;
&lt;p&gt;$$
I(l) \leftrightharpoons \mathcal{V}(u).
$$
\(\mathcal{V}(u)\) represents the amplitude and phase of the sinusoidal component of the intensity distribution with spatial frequency \(u\) cycles per radian.&lt;/p&gt;
&lt;p&gt;Then, we can bring out our old friend the multiplication/convolution algorithm.
$$
\cos(2 \pi u l) * I(l) \leftrightharpoons \frac{\mathcal{V}(u)}{2} [\delta(u + u_0) + \delta(u - u_0)].
$$&lt;/p&gt;</description></item><item><title>The DFT and FFT</title><link>https://iancze.github.io/courses/astro589/lectures/04-fft/</link><pubDate>Tue, 13 Sep 2022 00:00:00 +0000</pubDate><guid>https://iancze.github.io/courses/astro589/lectures/04-fft/</guid><description>



&lt;h2 id="zoom-links"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro589/lectures/04-fft/#zoom-links"&gt;Zoom links&lt;/a&gt;
&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;&lt;a href="https://psu.mediaspace.kaltura.com/media/Astro+589A+Lecture+4+part+1/1_ohzd132x"&gt;Part I (whiteboard)&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="https://psu.mediaspace.kaltura.com/media/Astro+589A+Lecture+4+part+2/1_p10hq79o"&gt;Part II (slides)&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;




&lt;h2 id="references-for-today"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro589/lectures/04-fft/#references-for-today"&gt;References for today&lt;/a&gt;
&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;&lt;a href="https://catalog.libraries.psu.edu/catalog/2010095"&gt;The Fourier Transform and its Applications&lt;/a&gt; by R. Bracewell&lt;/li&gt;
&lt;li&gt;&lt;a href="https://catalog.libraries.psu.edu/catalog/20789467"&gt;Interferometry and Synthesis in Radio Astronomy&lt;/a&gt; by Thompson, Moran, and Swenson, particularly Appendix 2.1&lt;/li&gt;
&lt;li&gt;&lt;a href="https://catalog.libraries.psu.edu/catalog/34517505"&gt;Fourier Analysis and Imaging&lt;/a&gt; by R. Bracewell&lt;/li&gt;
&lt;li&gt;&lt;a href="https://en.wikipedia.org/wiki/Nyquist%E2%80%93Shannon_sampling_theorem"&gt;Wikipedia on Nyqist-Shannon sampling theorem&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;




&lt;h2 id="review-of-last-time"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro589/lectures/04-fft/#review-of-last-time"&gt;Review &lt;em&gt;of last time&lt;/em&gt;&lt;/a&gt;
&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;Convolution/multiplication theorem&lt;/li&gt;
&lt;li&gt;Band-limited functions&lt;/li&gt;
&lt;li&gt;Nyquist-Shannon sampling theorem&lt;/li&gt;
&lt;li&gt;Aliasing&lt;/li&gt;
&lt;li&gt;&amp;ldquo;Band-limited&amp;rdquo; interpolation with sinc kernel&lt;/li&gt;
&lt;/ul&gt;




&lt;h2 id="this-time"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro589/lectures/04-fft/#this-time"&gt;This time:&lt;/a&gt;
&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;Convolution/Interpolation with different kernels&lt;/li&gt;
&lt;li&gt;Discrete Fourier Transform (DFT)&lt;/li&gt;
&lt;/ul&gt;




&lt;h3 id="convolutional-kernels"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro589/lectures/04-fft/#convolutional-kernels"&gt;Convolutional kernels&lt;/a&gt;
&lt;/h3&gt;
&lt;p&gt;Now that we&amp;rsquo;ve developed our understanding of band-limited functions, sampling, and restoration, let&amp;rsquo;s make one more point about interpolation and convolutional kernels.&lt;/p&gt;
&lt;p&gt;We just said that in order to restore a continuous function from a set of samples, we needed to do sinc-interpolation. What happens if we do one of the more commonly used forms of interpolation, like nearest neighbor or linear?&lt;/p&gt;
&lt;figure&gt;&lt;a href="https://octave.sourceforge.io/octave/function/interp1.html"&gt;&lt;img src="https://iancze.github.io/courses/astro589/lectures/04-fft/nearest-neighbor-vs-linear.png"
 alt="Credit: Octave"&gt;&lt;/a&gt;&lt;figcaption&gt;
 &lt;p&gt;Credit: Octave&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;

&lt;p&gt;We can cast each of these as convolution with a different kind of kernel.&lt;/p&gt;
&lt;figure&gt;&lt;a href="https://www.researchgate.net/figure/Interpolation-kernels-in-one-dimension-The-width-of-the-support-is-shown-below_fig3_10624120"&gt;&lt;img src="https://iancze.github.io/courses/astro589/lectures/04-fft/interpolation-kernels.png"
 alt="Credit Jeffrey Tsao."&gt;&lt;/a&gt;&lt;figcaption&gt;
 &lt;p&gt;Credit Jeffrey Tsao.&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;

&lt;p&gt;The nearest neighbor kernel will have a corresponding Fourier transform of a sinc and the linear kernel will have a \(\mathrm{sinc}^2\). These functions all have non-zero values above the cutoff frequency. Of these, &lt;em&gt;only&lt;/em&gt; the sinc function has zero amplitude beyond the cutoff frequency.&lt;/p&gt;
&lt;p&gt;So reconstruction of a function using linear interpolation, for example, will introduce higher-frequency features, like these sharp transitions around each data point. Depending on what you&amp;rsquo;re doing the interpolation for, sometimes this matters, sometimes it doesn&amp;rsquo;t. Later on, when we talk about &amp;ldquo;gridding&amp;rdquo; of visibility data from radio interferometers, the type of interpolation used will have a big impact on the dynamic range of the resulting images.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q&lt;/strong&gt;: Why is sinc interpolation generally &lt;em&gt;not&lt;/em&gt; used in practice? Because the kernel size is large/infinite, you&amp;rsquo;re actually using all of the data points in each and every interpolation and this can be computationally prohibitive for many applications. Compare that to linear interpolation, which only uses the two nearest points.&lt;/p&gt;




&lt;h2 id="the-discrete-fourier-transform-dft"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro589/lectures/04-fft/#the-discrete-fourier-transform-dft"&gt;The Discrete Fourier transform (DFT)&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;Now we&amp;rsquo;ll talk about how we deal with samples of data. We&amp;rsquo;ll stick with the same example that we&amp;rsquo;re dealing with a function of time. But rather than \(t\), having units of seconds, we&amp;rsquo;ll simply label each data point by an index \(m\) which takes on non-negative, integer values like \(m = 0, 1, \ldots, N\).&lt;/p&gt;
&lt;figure&gt;&lt;img src="https://iancze.github.io/courses/astro589/lectures/04-fft/DFT-samples.png"
 alt="Credit: Bracewell Fig 11.2"&gt;&lt;figcaption&gt;
 &lt;p&gt;Credit: Bracewell Fig 11.2&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;

&lt;p&gt;The forward discrete Fourier transform (DFT) is
$$
F_k = \sum_{m=0}^{N-1} f_m \exp \left ( - 2 \pi i \frac{m k}{N} \right)
$$
and we could compute \(F_k\) for \(k = 0, 1, \ldots, N-1\).&lt;/p&gt;
&lt;p&gt;Here, the discrete index variable \(k\) has replaced the continuous-frequency variable \(s\), just like \(m\) replaced the continuous-time variable \(t\).&lt;/p&gt;
&lt;p&gt;The inverse discrete Fourier transform is
$$
f_m = \frac{1}{N} \sum_{k=0}^{N -1} F_k \exp \left ( 2 \pi i \frac{m k}{N}\right).
$$
Like the continuous-Fourier transform, one of the differences from the forward is the \(+i\) in the exponential. The other is the inclusion of the normalization pre-factor.&lt;/p&gt;
&lt;p&gt;Note: depending on whom you talk to, you&amp;rsquo;ll see a wide variety of conventions as to where the normalization prefactor goes and where the \(2 \pi\) lives. The convention presented here is the same one used by the &lt;a href="https://numpy.org/doc/stable/reference/routines.fft.html#module-numpy.fft"&gt;Python/NumPy package&lt;/a&gt; and the &lt;a href="https://juliamath.github.io/AbstractFFTs.jl/stable/api/#Public-Interface"&gt;Julia/AbstractFFTs.jl&lt;/a&gt; package, so it &lt;em&gt;should&lt;/em&gt; be the one you encounter most frequently.&lt;/p&gt;
&lt;p&gt;Like the continuous Fourier transform, if you take the DFT of a set of samples and then take the iDFT of that, you will end up with the original set of samples.&lt;/p&gt;




&lt;h3 id="units-of-the-dft"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro589/lectures/04-fft/#units-of-the-dft"&gt;Units of the DFT&lt;/a&gt;
&lt;/h3&gt;
&lt;p&gt;The DFT only knows/assumes that it was fed a set of equally spaced samples
$$
f_m = f(x_m)
$$
where
$$
m = 0, 1, \ldots, N - 1.
$$&lt;/p&gt;
&lt;p&gt;So, at its most abstract, the DFT takes in a bunch of \(N\) samples spaced \(\Delta x\) apart and returns \(N\) samples corresponding to the Fourier components. The frequency of each component corresponds is given by \(k/N\) in units of &amp;ldquo;cycles per sampling interval.&amp;rdquo;&lt;/p&gt;
&lt;p&gt;I.e., so if we had \(N = 8\) samples, then the \(k=3\) frequency component returned from the DFT would be equal to \(3/8\) cycles per &amp;ldquo;the interval between samples.&amp;rdquo;&lt;/p&gt;
&lt;p&gt;On its own, the DFT doesn&amp;rsquo;t provide any information about what type of variable \(x\) is or what the spacing is. But there is hope. We can make this concrete, we just have to be careful. Using our example of a time series with \(N=8\) samples \(\{f_m\}\), say we know that \(\Delta x = 0.1\) seconds,
$$
\Delta x = x_{m+1} -x_m
$$&lt;/p&gt;
&lt;p&gt;The spacing in the frequency domain will be 1/8 cycles per 0.1 seconds, or 1.25 Hz.&lt;/p&gt;




&lt;h3 id="dft-as-a-matrix-operation"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro589/lectures/04-fft/#dft-as-a-matrix-operation"&gt;DFT as a matrix operation&lt;/a&gt;
&lt;/h3&gt;
&lt;p&gt;Thus far we have just been talking about a &amp;ldquo;set&amp;rdquo; of samples. We can also think of the collection of data as a vector
$$
\mathbf{f} =
\begin{bmatrix}
f_0 \\
f_1 \\
\vdots \\
f_{N-1} \\
\end{bmatrix}
$$&lt;/p&gt;
&lt;p&gt;and the frequency samples as a vector too&lt;/p&gt;
&lt;p&gt;$$
\mathbf{F} =
\begin{bmatrix}
F_0 \\
F_1 \\
\vdots \\
F_{N-1} \\
\end{bmatrix}.
$$&lt;/p&gt;
&lt;p&gt;We&amp;rsquo;ve now mentioned that the Fourier transform is a linear operator a few times. Another way of demonstrating this is that we could write the DFT as a matrix multiplication
$$
\mathbf{F} = \mathbf{W} \mathbf{f}.
$$&lt;/p&gt;
&lt;p&gt;If we look back at the definition of the DFT
$$
F_k = \sum_{m=0}^{N-1} f_m \exp \left ( - 2 \pi i \frac{m k}{N} \right)
$$
hopefully we can see how this might be cast in matrix form. If we define the quantity
$$
\omega = e^{- 2 \pi i / N}
$$
then we can write
$$
W = \begin{bmatrix}
1 &amp;amp; 1 &amp;amp; 1 &amp;amp; \ldots &amp;amp; 1 \\
1 &amp;amp; \omega &amp;amp; \omega^2 &amp;amp; \ldots &amp;amp; \omega^{N - 1} \\
1 &amp;amp; \omega^2 &amp;amp; \omega^4 &amp;amp; \ldots &amp;amp; \omega^{2(N-1)} \\
\vdots &amp;amp; \vdots &amp;amp; \vdots &amp;amp; \ddots &amp;amp; \vdots \\
1 &amp;amp; \omega^{N-1} &amp;amp; \omega^{2(N-1)} &amp;amp; \ldots &amp;amp; \omega^{(N-1)(N-1)}\\
\end{bmatrix}
$$&lt;/p&gt;
&lt;p&gt;This formulation can be really useful if you&amp;rsquo;re into forward modeling using linear models. Because we can write the DFT as a matrix multiplication, it can essentially be just another linear transformation to your linear model. Determining the &amp;ldquo;best-fit&amp;rdquo; parameters can still be done analytically. We&amp;rsquo;ll talk about this a bit more when we get to the lecture on Bayesian inference.&lt;/p&gt;
&lt;figure&gt;&lt;img src="https://iancze.github.io/courses/astro589/lectures/04-fft/DFT-graphical.png"
 alt="The elements of the DFT matrix represented as samples of complex exponentials. Credit: Wikipedia/Glogger"&gt;&lt;figcaption&gt;
 &lt;p&gt;The elements of the DFT matrix represented as samples of complex exponentials. Credit: Wikipedia/Glogger&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;





&lt;h2 id="fast-fourier-transform-fft"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro589/lectures/04-fft/#fast-fourier-transform-fft"&gt;Fast Fourier Transform (FFT)&lt;/a&gt;
&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;&lt;a href="https://www.youtube.com/watch?v=E8HeD-MUrjY&amp;amp;ab_channel=SteveBrunton"&gt;Youtube/SteveBrunton&lt;/a&gt; on the Fast Fourier Transform&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;The Fast Fourier Transform or FFT is an algorithm (class of algorithms) for computing the discrete Fourier transform. Practically speaking, if you&amp;rsquo;re going to perform a Fourier transform on discrete data with a computer, you will almost certainly use an FFT algorithm, so FFT and DFT end up being synonymous.&lt;/p&gt;
&lt;p&gt;If we have a data array of length \(N\), the complexity of the DFT is \(\mathcal{O}(N^2)\), while the complexity of the FFT is only \(\mathcal{O}(N \log N)\). This can make a huge difference in computational time if N is large. Say if you have an array of \(N = 4096\) datapoints, then you could be looking at a factor of 1000 speedup.&lt;/p&gt;
&lt;p&gt;The development and implementation of FFT packages turned the DFT from something that was too slow for many practical applications into a formidable analysis tool. This really enabled the widespread usage of the Fourier transform in signal manipulation and data analysis. I don&amp;rsquo;t think anyone would argue with you that strongly if you said that the FFT is the most important algorithm of the last century &lt;a href="http://pi.math.cornell.edu/~ajt/presentations/TopTenAlgorithms.pdf"&gt;Cornell&amp;rsquo;s top ten algorithms of the 20th century&lt;/a&gt;.&lt;/p&gt;
&lt;p&gt;There are many different FFT algorithms out there. The most popular one is the &lt;a href="https://en.wikipedia.org/wiki/Cooley%E2%80%93Tukey_FFT_algorithm"&gt;Cooley-Tukey&lt;/a&gt; algorithm and relies on the factorization of a size \(N\) DFT matrix into \(N_1\) smaller DFTs of sizes \(N_2\) in a recursive manner. Therefore there is a (historical) preference towards array sizes that powers of \(2\). However, there are algorithms out that still give \(\mathcal{O}(N \log N)\) even for prime values of \(N\), so it&amp;rsquo;s not much of a constraint in practice.&lt;/p&gt;
&lt;p&gt;We don&amp;rsquo;t have time to go into much more detail about the FFT algorithm here, but hopefully you can see from the structure of the \(\mathbf{W}\) matrix that there are plenty of opportunities to make the calculation more efficient by factoring and caching values.&lt;/p&gt;
&lt;p&gt;Now, lets look at some code examples of how we might actually use the FFT in Python.&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;&lt;a href="FFT-lecture.slides.html"&gt;Jupyter Notebook Slides&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;</description></item><item><title>The Fourier Transform II</title><link>https://iancze.github.io/courses/astro589/lectures/03-fourier-transform-ii/</link><pubDate>Mon, 05 Sep 2022 00:00:00 +0000</pubDate><guid>https://iancze.github.io/courses/astro589/lectures/03-fourier-transform-ii/</guid><description>&lt;ul&gt;
&lt;li&gt;&lt;a href="https://psu.mediaspace.kaltura.com/media/Astro+589A+Lecture+3/1_607jh3p1"&gt;Video recording&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;




&lt;h2 id="references-for-today"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro589/lectures/03-fourier-transform-ii/#references-for-today"&gt;References for today&lt;/a&gt;
&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;&lt;a href="https://catalog.libraries.psu.edu/catalog/2010095"&gt;The Fourier Transform and its Applications&lt;/a&gt; by R. Bracewell&lt;/li&gt;
&lt;li&gt;&lt;a href="https://catalog.libraries.psu.edu/catalog/20789467"&gt;Interferometry and Synthesis in Radio Astronomy&lt;/a&gt; by Thompson, Moran, and Swenson, particularly Appendix 2.1&lt;/li&gt;
&lt;li&gt;&lt;a href="https://catalog.libraries.psu.edu/catalog/34517505"&gt;Fourier Analysis and Imaging&lt;/a&gt; by R. Bracewell&lt;/li&gt;
&lt;li&gt;&lt;a href="https://en.wikipedia.org/wiki/Nyquist%E2%80%93Shannon_sampling_theorem"&gt;Wikipedia on Nyqist-Shannon sampling theorem&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;




&lt;h2 id="review-of-last-time"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro589/lectures/03-fourier-transform-ii/#review-of-last-time"&gt;Review &lt;em&gt;of last time&lt;/em&gt;&lt;/a&gt;
&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;
&lt;p&gt;Defined Fourier transform, inverse
$$
F(s) = \int_{-\infty}^{\infty} f(x) e^{-i 2 \pi x s}\,\mathrm{d}x
$$&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;Note that because \(x\) and \(s\) appear in the argument of the exponential, their product must be dimensionless. This means that they will also have inverse units, e.g., &amp;ldquo;seconds&amp;rdquo; and &amp;ldquo;cycles per second.&amp;rdquo;&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;Introduced convolution, impluse symbol, and theorems&lt;/p&gt;
&lt;/li&gt;
&lt;/ul&gt;




&lt;h2 id="where-are-we-headed-today"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro589/lectures/03-fourier-transform-ii/#where-are-we-headed-today"&gt;Where are we headed &lt;em&gt;today&lt;/em&gt;?&lt;/a&gt;
&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;Finish up Fourier transform theorems&lt;/li&gt;
&lt;li&gt;Nyquist sampling theorem&lt;/li&gt;
&lt;li&gt;Discrete Fourier Transform&lt;/li&gt;
&lt;/ul&gt;




&lt;h2 id="continuing-from-last-time-fourier-transform-theorems"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro589/lectures/03-fourier-transform-ii/#continuing-from-last-time-fourier-transform-theorems"&gt;Continuing from last time: &lt;em&gt;Fourier transform theorems&lt;/em&gt;&lt;/a&gt;
&lt;/h2&gt;




&lt;h3 id="convolutionmultiplication"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro589/lectures/03-fourier-transform-ii/#convolutionmultiplication"&gt;Convolution/multiplication&lt;/a&gt;
&lt;/h3&gt;
&lt;p&gt;The convolution of two functions corresponds to the multiplication of their Fourier transforms.&lt;/p&gt;
&lt;p&gt;If&lt;/p&gt;
&lt;p&gt;$$
f(x) \leftrightharpoons F(s)
$$&lt;/p&gt;
&lt;p&gt;and&lt;/p&gt;
&lt;p&gt;$$
g(x) \leftrightharpoons G(s)
$$&lt;/p&gt;
&lt;p&gt;then&lt;/p&gt;
&lt;p&gt;$$
f(x) * g(x) \leftrightharpoons F(s)G(s).
$$&lt;/p&gt;
&lt;p&gt;This is an &lt;em&gt;extremely&lt;/em&gt; useful theorem. At least in my career, this, and concepts related to sampling, have been the ones I have used the most often. You may have already used this theorem (numerically) if you&amp;rsquo;ve ever carried out a convolution operation using &lt;a href="https://docs.scipy.org/doc/scipy/reference/generated/scipy.signal.fftconvolve.html"&gt;scipy.signal.fftconvolve&lt;/a&gt; in Python, which can be dramatically faster than directly implementing the convolution, at least for certain array sizes.&lt;/p&gt;




&lt;h3 id="rayleighs-theorem-parsevals-theorem-for-fourier-series"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro589/lectures/03-fourier-transform-ii/#rayleighs-theorem-parsevals-theorem-for-fourier-series"&gt;Rayleigh&amp;rsquo;s theorem (Parseval&amp;rsquo;s theorem for Fourier Series)&lt;/a&gt;
&lt;/h3&gt;
&lt;p&gt;The amount of energy in a system is the same whether you calculate it in the time domain or in the frequency domain.&lt;/p&gt;
&lt;p&gt;The integral of the mod-squared of a function is equal to the integral of the mod-squared of its spectrum&lt;/p&gt;
&lt;p&gt;$$
\int_{-\infty}^\infty |f(x)|^2\,\mathrm{d}x = \int_{-\infty}^\infty |F(s)|^2\,\mathrm{d}s.
$$&lt;/p&gt;




&lt;h3 id="autocorrelation-theorem"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro589/lectures/03-fourier-transform-ii/#autocorrelation-theorem"&gt;Autocorrelation theorem&lt;/a&gt;
&lt;/h3&gt;
&lt;p&gt;The autocorrelation of a function is&lt;/p&gt;
&lt;p&gt;$$
f(x) * f(x) = \int_{-\infty}^\infty f^{*}(u) f(u + x)\,\mathrm{d}u
$$&lt;/p&gt;
&lt;p&gt;and it has the Fourier transform
$$
f(x) * f(x) \leftrightharpoons |F(s)|^2.
$$&lt;/p&gt;
&lt;p&gt;Thus, the power spectrum is the Fourier transform of the autocorrelation function. It can also be computed directly by taking the &amp;ldquo;mod-squared&amp;rdquo; of \(F(s)\),&lt;/p&gt;
&lt;p&gt;$$
|F|^2 = F F^*.
$$&lt;/p&gt;
&lt;p&gt;If you&amp;rsquo;ve ever worked with (stationary) Gaussian processes (e.g., squared-exponential, Matern, etc&amp;hellip;), you might recognize this relationship between the autocorrelation (the kernel function) and the power spectrum of the Gaussian process.&lt;/p&gt;




&lt;h3 id="the-derivative-theorem"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro589/lectures/03-fourier-transform-ii/#the-derivative-theorem"&gt;The derivative theorem&lt;/a&gt;
&lt;/h3&gt;
&lt;p&gt;If&lt;/p&gt;
&lt;p&gt;$$
f(x) \leftrightharpoons F(s)
$$&lt;/p&gt;
&lt;p&gt;then&lt;/p&gt;
&lt;p&gt;$$
f^\prime(x) \leftrightharpoons i 2 \pi s F(s).
$$&lt;/p&gt;




&lt;h3 id="using-the-transform-pairs"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro589/lectures/03-fourier-transform-ii/#using-the-transform-pairs"&gt;Using the transform pairs&lt;/a&gt;
&lt;/h3&gt;
&lt;p&gt;Now that we have a few basic transform pairs, and some of the transform theorems, you can mix and match these to build up a library of new transform pairs. You will explore this in the problem set.&lt;/p&gt;




&lt;h3 id="definite-integral"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro589/lectures/03-fourier-transform-ii/#definite-integral"&gt;Definite integral&lt;/a&gt;
&lt;/h3&gt;
&lt;p&gt;The zero-valued frequency of a Fourier transform is equal to the definite integral of a function over all space&lt;/p&gt;
&lt;p&gt;$$
\int_{-\infty}^\infty f(x)\,\mathrm{d}x = F(0).
$$&lt;/p&gt;
&lt;p&gt;I.e., to compute the area under the curve, you can just read off the zero-frequency value of the Fourier transform.&lt;/p&gt;




&lt;h3 id="first-moment"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro589/lectures/03-fourier-transform-ii/#first-moment"&gt;First moment&lt;/a&gt;
&lt;/h3&gt;
&lt;p&gt;The first moment of \(f(x)\) about the origin is&lt;/p&gt;
&lt;p&gt;$$
\int_{-\infty}^\infty x f(x)\,\mathrm{d}x
$$&lt;/p&gt;
&lt;p&gt;Using the derivative theorem (not shown), you can determine that the first moment \(\bar{f}\) is&lt;/p&gt;
&lt;p&gt;$$
\bar{f} = \int_{-\infty}^\infty x f(x)\,\mathrm{d}x = \frac{F^\prime(0)}{-2 \pi i}
$$
i.e., the slope of the Fourier transform evaluated at \(s = 0\), times \(-1/2 \pi i\).&lt;/p&gt;




&lt;h3 id="second-moment"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro589/lectures/03-fourier-transform-ii/#second-moment"&gt;Second moment&lt;/a&gt;
&lt;/h3&gt;
&lt;p&gt;Second moment (moment of inertia) is given by&lt;/p&gt;
&lt;p&gt;$$
\int_{-\infty}^\infty x^2 f(x)\,\mathrm{d}x = -\frac{1}{4 \pi^2} F^{\prime\prime}(0).
$$&lt;/p&gt;
&lt;figure&gt;&lt;img src="https://iancze.github.io/courses/astro589/lectures/03-fourier-transform-ii/second-moment.png"
 alt="Credit: Bracewell Fig 8.4"&gt;&lt;figcaption&gt;
 &lt;p&gt;Credit: Bracewell Fig 8.4&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;

&lt;p&gt;Similar arguments extend to variance \(\sigma^2\) as we had mentioned with regards to the uncertainty principle.&lt;/p&gt;




&lt;h2 id="smoothness-and-compactness"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro589/lectures/03-fourier-transform-ii/#smoothness-and-compactness"&gt;Smoothness and compactness&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;In general, the smoother a function is in the time domain, the more compact its Fourier transform will be in the frequency domain.&lt;/p&gt;
&lt;p&gt;Smoother functions will have a larger number of continuous derivatives. Something like the Gaussian envelope \(\exp(-\pi x^2)\) is &amp;ldquo;as smooth as possible&amp;rdquo; and therefore its Fourier transform (also a Gaussian envelope) is as compact as possible.&lt;/p&gt;




&lt;h2 id="filters-and-transfer-functions"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro589/lectures/03-fourier-transform-ii/#filters-and-transfer-functions"&gt;Filters and transfer functions&lt;/a&gt;
&lt;/h2&gt;




&lt;h3 id="time-domain"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro589/lectures/03-fourier-transform-ii/#time-domain"&gt;Time domain&lt;/a&gt;
&lt;/h3&gt;
&lt;p&gt;We can say that we have some (electrical) waveform
$$
V_1(t) = A \cos (2 \pi f t)
$$ which is a single-valued function of time. You can think of this as a voltage time-series or another physical quantity. By definition, the waveform is real.&lt;/p&gt;
&lt;p&gt;Let&amp;rsquo;s put on our electrical/acoustical/mechanical engineering hats for a moment and consider that a &lt;em&gt;filter&lt;/em&gt; is a physical system with an input an and output, e.g., something that is transmitting vibrations or oscillations, like our waveform.&lt;/p&gt;
&lt;figure&gt;&lt;img src="https://iancze.github.io/courses/astro589/lectures/03-fourier-transform-ii/filter.png"
 alt="How a filter changes the amplitude and phase of an waveform. Credit: Ian Czekala"&gt;&lt;figcaption&gt;
 &lt;p&gt;How a filter changes the amplitude and phase of an waveform. Credit: Ian Czekala&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;

&lt;p&gt;If we feed our waveform into a &lt;em&gt;linear&lt;/em&gt; filter, we get output
$$
V_2(t) = B \cos (2 \pi f t + \phi).
$$&lt;/p&gt;
&lt;p&gt;The output is still a waveform, but its amplitude and its phase have changed. These changes are likely frequency dependent, too.&lt;/p&gt;
&lt;p&gt;We can specify the filter by a frequency-dependent quantity \(T(f)\) called the &lt;em&gt;transfer function&lt;/em&gt;. It is a complex-valued function (having both an amplitude and a phase) and is given by
$$
T(f) = \frac{B}{A}e^{i \phi}.
$$&lt;/p&gt;
&lt;p&gt;Interesting, perhaps, but maybe not immediately obviously useful. Let&amp;rsquo;s introduce the &lt;em&gt;spectrum&lt;/em&gt; and then circle back to the transfer function.&lt;/p&gt;




&lt;h3 id="obtaining-v_2-using-the-frequency-domain"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro589/lectures/03-fourier-transform-ii/#obtaining-v_2-using-the-frequency-domain"&gt;Obtaining \(V_2\) using the frequency domain&lt;/a&gt;
&lt;/h3&gt;
&lt;p&gt;The &lt;em&gt;spectrum&lt;/em&gt; of the waveform is the Fourier transform of \(V(t)\), which we&amp;rsquo;ll call \(S(f)\) in this section. We&amp;rsquo;ve broken slightly from our \(f \leftrightharpoons F\) notation, but \(V \leftrightharpoons S\) is a classic in the signal processing and electrical engineering fields, so we&amp;rsquo;ll at least build familiarity with it in this example.&lt;/p&gt;
&lt;p&gt;The &amp;ldquo;spectrum&amp;rdquo; here is just the Fourier transform quantity, it can definitely be complex-valued.&lt;/p&gt;
&lt;p&gt;The &amp;ldquo;spectra&amp;rdquo; that we typically talk about in astrophysics are measurements of the electromagnetic spectrum&amp;mdash;you&amp;rsquo;ve probably never come across as one that&amp;rsquo;s complex-valued, right? What&amp;rsquo;s going on here?&lt;/p&gt;
&lt;p&gt;Consider the &lt;em&gt;units&lt;/em&gt; of a flux measurement \(F_\nu\) of the electromagnetic spectrum. In lecture 1, we covered that the cgs units of flux are
$$
\mathrm{ergs}\;\mathrm{s}^{-1}\;\mathrm{cm}^{-2}\;\mathrm{Hz}^{-1}.
$$&lt;/p&gt;
&lt;p&gt;The clue is in the \(\mathrm{ergs}\;\mathrm{s}^{-1}\) part, which we could also write in terms of &amp;ldquo;watts&amp;rdquo; if we wanted to be strictly S.I. about it. When we are measuring the electromagnetic spectrum, we are actually measuring the &lt;strong&gt;power&lt;/strong&gt; spectral density, \(|F(\nu)|^2\). The absolute squared means the quantity \(|F(\nu)|^2\) is real-valued, and is the reason why you never hear about measurements of the electromagnetic spectrum containing imaginary values!&lt;/p&gt;
&lt;p&gt;In this course, at least, we&amp;rsquo;ll try to be explicit about which spectrum we&amp;rsquo;re referring to. When we actually mean power spectrum, we&amp;rsquo;ll try to call it as such. Otherwise, &amp;ldquo;spectrum&amp;rdquo; will refer to a quantity like \(S\).&lt;/p&gt;
&lt;p&gt;Now, let&amp;rsquo;s revisit our filter example, where we had input and output \(V_1(t)\) and \(V_2(t)\) respectively. From our discussion, we also have&lt;/p&gt;
&lt;p&gt;$$
V_1 \leftrightharpoons S_1
$$
and
$$
V_2 \leftrightharpoons S_2.
$$&lt;/p&gt;
&lt;p&gt;The transfer function concept is &lt;em&gt;especially&lt;/em&gt; useful when we think about the &lt;em&gt;spectrum&lt;/em&gt; of the waveforms, because we have
$$
S_2(f) = T(f) S_1(f).
$$&lt;/p&gt;
&lt;p&gt;I.e., the spectrum of the output waveform is simply the spectrum of the input waveform &lt;em&gt;multiplied&lt;/em&gt; by the transfer function.&lt;/p&gt;
&lt;p&gt;Once you have \(S_2\), then you can get \(V_2(t)\) from
$$
V_2 \leftrightharpoons S_2.
$$&lt;/p&gt;
&lt;p&gt;Two examples are low-pass and high-pass filters.&lt;/p&gt;
&lt;figure&gt;&lt;a href="https://en.wikipedia.org/wiki/Filter_%28signal_processing"&gt;&lt;img src="https://iancze.github.io/courses/astro589/lectures/03-fourier-transform-ii/filter-pass.png"
 alt="Examples of different filter transfer functions \(T(f)\). Credit: Wikipedia/SpinningSpark"&gt;&lt;/a&gt;&lt;figcaption&gt;
 &lt;p&gt;Examples of different filter transfer functions \(T(f)\). Credit: Wikipedia/SpinningSpark&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;





&lt;h3 id="obtaining-v_2-using-the-time-domain"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro589/lectures/03-fourier-transform-ii/#obtaining-v_2-using-the-time-domain"&gt;Obtaining \(V_2\) using the time domain&lt;/a&gt;
&lt;/h3&gt;
&lt;p&gt;Now let&amp;rsquo;s think of digital signal processing, where you wanted to practically apply a filter to some waveform to produce a new waveform. As we just outlined, you could acquire \(V_1(t)\), Fourier transform it to access its spectrum \(S_1(f)\), multiply by the transfer function \(T_(f)\), and then do the inverse Fourier transform to get \(V_2(t)\). Is there a way to do this &lt;em&gt;directly&lt;/em&gt; in the time domain? What if you don&amp;rsquo;t have the complete waveform all at once?&lt;/p&gt;
&lt;p&gt;The answer is provided by the convolution theorem for Fourier transforms. Since the transfer function is applied via a multiplication in the Fourier domain, we could equivalently carry out the same operation by a convolution in the time domain.&lt;/p&gt;
&lt;p&gt;The convolutional kernel would be
$$
I(t) \leftrightharpoons T(f)
$$
and we&amp;rsquo;d have
$$
V_2(t) = I(t) * V_1(t).
$$&lt;/p&gt;
&lt;p&gt;To summarize,
&lt;figure&gt;&lt;img src="https://iancze.github.io/courses/astro589/lectures/03-fourier-transform-ii/square.png"
 alt="Credit Bracewell, Chapter 9."&gt;&lt;figcaption&gt;
 &lt;p&gt;Credit Bracewell, Chapter 9.&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;
&lt;/p&gt;




&lt;h3 id="determining-it"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro589/lectures/03-fourier-transform-ii/#determining-it"&gt;Determining \(I(t)\)&lt;/a&gt;
&lt;/h3&gt;
&lt;p&gt;If you had some system already in place, and you wanted to determine \(I(t)\) experimentally, what is one way you could do it? What waveform could you send the system?&lt;/p&gt;
&lt;p&gt;One simple option would be to send
$$
V_1(t) = \delta(t),
$$
then
$$
V_2(t) = I(t) * \delta(t) = I(t).
$$&lt;/p&gt;




&lt;h2 id="nyquist-shannon-sampling"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro589/lectures/03-fourier-transform-ii/#nyquist-shannon-sampling"&gt;Nyquist-Shannon sampling&lt;/a&gt;
&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;&lt;a href="https://www.youtube.com/watch?v=FcXZ28BX-xE&amp;amp;ab_channel=SteveBrunton"&gt;Youtube/SteveBrunton&lt;/a&gt; on The Sampling Theorem&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;Thus far we have been talking about continuous functions. As astrophysicists, though, we&amp;rsquo;re frequently dealing with discrete data points, which are presumed to be &lt;em&gt;samples&lt;/em&gt; of some unknown function. Maybe you&amp;rsquo;re the one designing the experiment to capture these data points, or maybe you&amp;rsquo;ve just been handed some dataset.&lt;/p&gt;
&lt;figure&gt;&lt;img src="https://iancze.github.io/courses/astro589/lectures/03-fourier-transform-ii/lines.png"
 alt="Say you are given a set of (noisless) samples that look like this. What do you think the function should look like in between the points? Credit: Ian Czekala"&gt;&lt;figcaption&gt;
 &lt;p&gt;Say you are given a set of (noisless) samples that look like this. What do you think the function should look like in between the points? Credit: Ian Czekala&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;

&lt;p&gt;Concisely put, the &lt;em&gt;sampling theorem&lt;/em&gt; states that under a certain condition, a function can be &lt;em&gt;completely&lt;/em&gt; reconstructed from a set of discrete samples&amp;mdash;without information loss. I.e., the set of discrete samples is &lt;em&gt;fully equivalent&lt;/em&gt; to having access to the full set of function values. Today, this sampling theorem is known as the Nyquist-Shannon sampling theorem, the Whittaker–Nyquist–Shannon theorem, or simply &amp;ldquo;the sampling theorem.&amp;rdquo;&lt;/p&gt;
&lt;p&gt;If we were to try to use these data points to actually reconstruct a function, then what sort of constraint would we need to impose on the function? We&amp;rsquo;d want to place some constraint on its spectrum, i.e., that there are no higher frequency components oscillating around faster than our sampling points.&lt;/p&gt;
&lt;p&gt;Before we dive into the derivation of the sampling theorem, let&amp;rsquo;s first take another look at what can go wrong when you undersample a time series.&lt;/p&gt;




&lt;h3 id="aliasing"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro589/lectures/03-fourier-transform-ii/#aliasing"&gt;Aliasing&lt;/a&gt;
&lt;/h3&gt;
&lt;figure&gt;&lt;img src="https://iancze.github.io/courses/astro589/lectures/03-fourier-transform-ii/sine-wave.png"
 alt="If the true signal is given by the solid line, but we undersample it, then the sine-wave we naively reconstruct from the samples would have the wrong frequency. Here we would say that the higher frequency signal has been aliased into the lower frequency range. Credit: Wikipedia Pluke"&gt;&lt;figcaption&gt;
 &lt;p&gt;If the true signal is given by the solid line, but we &lt;em&gt;undersample&lt;/em&gt; it, then the sine-wave we naively reconstruct from the samples would have the wrong frequency. Here we would say that the higher frequency signal has been aliased into the lower frequency range. Credit: Wikipedia Pluke&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;

&lt;p&gt;This is also called the Stroboscopic effect. There are some nice examples online:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;&lt;a href="https://www.youtube.com/watch?v=dNVtMmLlnoE&amp;amp;ab_channel=SmarterEveryDay"&gt;Youtube/SmarterEveryDay&lt;/a&gt;.&lt;/li&gt;
&lt;li&gt;&lt;a href="https://www.youtube.com/watch?v=smDpCsVVgPA&amp;amp;ab_channel=Edyourself"&gt;Stationary helicopter&lt;/a&gt;.&lt;/li&gt;
&lt;li&gt;Exoplanet transits! &lt;a href="https://ui.adsabs.harvard.edu/abs/2010ApJ...722..937D/abstract"&gt;Dawson and Fabrycky 2010&lt;/a&gt; find a shorter period for the exoplanet 55 Cnc e, previously confounded by the timing of the RV observations.&lt;/li&gt;
&lt;/ul&gt;




&lt;h3 id="derivation-of-sampling-theorem"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro589/lectures/03-fourier-transform-ii/#derivation-of-sampling-theorem"&gt;Derivation of sampling theorem&lt;/a&gt;
&lt;/h3&gt;
&lt;p&gt;Now, let&amp;rsquo;s use our understanding of Fourier transforms and the sampling/replicating function to develop a precise formulation of the sampling theorem.&lt;/p&gt;
&lt;p&gt;Recall that the &amp;ldquo;shah&amp;rdquo; function is an infinite series of delta functions spaced a unit dimension apart, and that it is its own Fourier transform
$$
\mathrm{shah}(x) \leftrightharpoons \mathrm{shah}(s).
$$&lt;/p&gt;
&lt;p&gt;Via the similarity theorem, if the delta functions of the shah get closer in the \(x\) domain, then they spread out in the Fourier domain, and vice-versa.&lt;/p&gt;
&lt;p&gt;We can adjust the spacing of the samples by dilating or shrinking the shah function by some factor. Here, we&amp;rsquo;ll write this as the sampling interval \(\Delta x = \tau\) or the sampling frequency \(1/\tau\).&lt;/p&gt;
&lt;p&gt;According to the similarity theorem, adjusting the sampling frequency in the \(x\) domain has the following effect in the frequency domain
$$
\mathrm{shah}(x/\tau) \leftrightharpoons \tau \mathrm{shah}(\tau s).
$$&lt;/p&gt;
&lt;p&gt;For example, if \(\tau = 0.2\), then we have&lt;/p&gt;
&lt;figure&gt;&lt;img src="https://iancze.github.io/courses/astro589/lectures/03-fourier-transform-ii/panel-b.png"
 alt="The shah function is its own Fourier transform. Via the similarity theorem, if we compress the shah function in the time domain (left), we expand it in the Fourier domain (right). Credit: Bracewell, Fig 10.3"&gt;&lt;figcaption&gt;
 &lt;p&gt;The shah function is its own Fourier transform. Via the similarity theorem, if we compress the shah function in the time domain (left), we expand it in the Fourier domain (right). Credit: Bracewell, Fig 10.3&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;

&lt;p&gt;Now let&amp;rsquo;s consider a function and its Fourier transform&lt;/p&gt;
&lt;figure&gt;&lt;img src="https://iancze.github.io/courses/astro589/lectures/03-fourier-transform-ii/function.png"
 alt="A generic function (left) and its Fourier transform (right). We say that this function is &amp;lsquo;band-limited&amp;rsquo; because its Fourier transform is 0 for all frequencies below some cutoff frequency \(|s| &amp;lt; s_c\). Credit: Bracewell Fig 10.2"&gt;&lt;figcaption&gt;
 &lt;p&gt;A generic function (left) and its Fourier transform (right). We say that this function is &amp;lsquo;band-limited&amp;rsquo; because its Fourier transform is 0 for all frequencies below some &lt;em&gt;cutoff frequency&lt;/em&gt; \(|s| &amp;lt; s_c\). Credit: Bracewell Fig 10.2&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;

&lt;p&gt;As before, we will use multiplication by the shah function to represent sampling of the function.&lt;/p&gt;
&lt;figure&gt;&lt;img src="https://iancze.github.io/courses/astro589/lectures/03-fourier-transform-ii/sampled-function.png"
 alt="Left: the sampled version of \(f\), which has the Fourier transform on the right. So long as the sampling frequency exceeds twice the cutoff frequency, the Fourier transform &amp;lsquo;islands&amp;rsquo; do not overlap (top two rows). Credit: Bracewell Fig 10.3"&gt;&lt;figcaption&gt;
 &lt;p&gt;Left: the sampled version of \(f\), which has the Fourier transform on the right. So long as the sampling frequency exceeds twice the cutoff frequency, the Fourier transform &amp;lsquo;islands&amp;rsquo; do not overlap (top two rows). Credit: Bracewell Fig 10.3&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;

&lt;p&gt;The non-overlappingness of the &amp;lsquo;islands&amp;rsquo; is the key to properly sampling a function, and we&amp;rsquo;ll see why in a moment when we talk about reconstruction. But first, let&amp;rsquo;s make a quantitative statement of the sampling theorem (Bracewell):&lt;/p&gt;
&lt;p&gt;If \(s_c\) is the cutoff frequency defining the band-limited nature of the signal, then so long as the function is sampled at equal intervals not exceeding \(\Delta x = 1/(2 s_c)\) then the function is properly sampled, i.e.
$$
\frac{1}{\tau} \geq 2 s_c.
$$&lt;/p&gt;




&lt;h3 id="restoration-of-signal-kernels"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro589/lectures/03-fourier-transform-ii/#restoration-of-signal-kernels"&gt;Restoration of signal kernels&lt;/a&gt;
&lt;/h3&gt;
&lt;p&gt;Now let&amp;rsquo;s talk about how we would actually reconstruct the continuous function from a set of samples. Let&amp;rsquo;s re-examine our plot of the Fourier domain&lt;/p&gt;
&lt;figure&gt;&lt;img src="https://iancze.github.io/courses/astro589/lectures/03-fourier-transform-ii/reconstruction.png"
 alt="Credit: Bracewell Fig 10.3"&gt;&lt;figcaption&gt;
 &lt;p&gt;Credit: Bracewell Fig 10.3&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;

&lt;p&gt;The &amp;ldquo;function&amp;rdquo; on the left is technically not the same (continuous) function that we started with, it is a discrete representation of it. We did just say, though, that if the function was band-limited, then these samples contained all of the same information as if we had access to the full function. So how do we go from these samples back to the full function?&lt;/p&gt;
&lt;p&gt;Let&amp;rsquo;s look at the Fourier side of this plot and compare it to the original Fourier side. The main difference is that &lt;em&gt;this&lt;/em&gt; Fourier plot has repeating &amp;lsquo;islands&amp;rsquo; at progressively higher frequencies, essentially to infinity. How can we get rid of these higher frequency islands?&lt;/p&gt;
&lt;p&gt;The answer is to multiply by a boxcar function in Fourier domain, completely truncating these higher order terms. Then, we can do the inverse Fourier transform and recover the original, continuous function.&lt;/p&gt;
&lt;p&gt;What is the analogous operation for the time-domain? This is the same thing as we discussed with the transfer function. Since it was a multiplication in the Fourier domain, it is a convolution in the time domain. And the convolutional kernel is the Fourier transform of the boxcar, which is a sinc function.&lt;/p&gt;
&lt;p&gt;So, to &lt;em&gt;exactly&lt;/em&gt; reconstruct a band-limited function from a set of samples, we do sinc-interpolation.&lt;/p&gt;
&lt;figure&gt;&lt;a href="https://www.dsprelated.com/freebooks/pasp/Windowed_Sinc_Interpolation.html"&gt;&lt;img src="https://iancze.github.io/courses/astro589/lectures/03-fourier-transform-ii/sinc-interpolation.png"
 alt="Credit: DSP related."&gt;&lt;/a&gt;&lt;figcaption&gt;
 &lt;p&gt;Credit: DSP related.&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;





&lt;h3 id="undersampling-and-aliasing"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro589/lectures/03-fourier-transform-ii/#undersampling-and-aliasing"&gt;Undersampling and aliasing&lt;/a&gt;
&lt;/h3&gt;
&lt;p&gt;If we didn&amp;rsquo;t sample the function at a sufficiently high rate, then we would have overlapping islands. Essentially, the higher frequency components of the Fourier islands are &amp;ldquo;folded-over&amp;rdquo; back into the range of frequencies &lt;em&gt;we thought&lt;/em&gt; was band-limited, resulting in a corrupted signal.&lt;/p&gt;
&lt;p&gt;In an alias, a higher frequency signal is masquerading as a lower-frequency signal.&lt;/p&gt;
&lt;figure&gt;&lt;img src="https://iancze.github.io/courses/astro589/lectures/03-fourier-transform-ii/aliased.png"
 alt="Credit: Bracewell Fig 10.3"&gt;&lt;figcaption&gt;
 &lt;p&gt;Credit: Bracewell Fig 10.3&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;





&lt;h3 id="compressed-sensing"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro589/lectures/03-fourier-transform-ii/#compressed-sensing"&gt;Compressed sensing&lt;/a&gt;
&lt;/h3&gt;
&lt;p&gt;You may have heard of &amp;ldquo;compressed sensing,&amp;rdquo; which is one of major signal processing results of the last few decades. The idea is that you can reconstruct a functional form using far fewer samples than required for the Nyquist rate, using some dictionary of functional forms, or knowledege that the signal may be sparse. You can, indeed, perfectly reconstruct the signal through optimization using the \(L_1\) norm. If you don&amp;rsquo;t want to make the assumption that your signal is sparse, though, it&amp;rsquo;s a good idea to sample at the Nyquist rate.&lt;/p&gt;




&lt;h2 id="fourier-series"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro589/lectures/03-fourier-transform-ii/#fourier-series"&gt;Fourier series&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;You probably first encountered Fourier series as part of your calculus course and later on as part of a partial differential equations course.
Say we have some periodic function \(g(x)\), then the Fourier series associated with this is
$$
a_0 + \sum_1^\infty (a_n \cos 2 \pi n f x + b_n \sin 2 \pi n f x)
$$
where the Fourier coefficients are determined by
$$
a_0 = \frac{1}{T} \int_{-T/2}^{T/2} g(x) \,\mathrm{d}x
$$&lt;/p&gt;
&lt;p&gt;$$
a_n = \frac{1}{T} \int_{-T/2}^{T/2} g(x) \cos 2 \pi n f x \,\mathrm{d}x
$$&lt;/p&gt;
&lt;p&gt;$$
b_n = \frac{1}{T} \int_{-T/2}^{T/2} g(x) \sin 2 \pi n f x \,\mathrm{d}x
$$&lt;/p&gt;
&lt;p&gt;i.e., we&amp;rsquo;ve projected the function onto its basis set of sines and cosines.&lt;/p&gt;
&lt;p&gt;Already, I&amp;rsquo;m sure you are starting to see the close connection with what we&amp;rsquo;ve discussed of the Fourier transform. Traditionally, Fourier series are used as a jumping off point for the discussion of the Fourier transform.&lt;/p&gt;
&lt;p&gt;In the last lecture, however, we signaled our intention to take the opposite approach, whereby we skipped over Fourier series and &lt;em&gt;started&lt;/em&gt; with the idea that Fourier transforms exist because we observe physical systems which exhibit their behavior. Now, let&amp;rsquo;s unify the discussion and demonstrate the Fourier series as an extreme situation of the Fourier transform.&lt;/p&gt;




&lt;h3 id="fourier-transform-of-sine-and-cosine"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro589/lectures/03-fourier-transform-ii/#fourier-transform-of-sine-and-cosine"&gt;Fourier transform of sine and cosine&lt;/a&gt;
&lt;/h3&gt;
&lt;p&gt;Let&amp;rsquo;s put together a number of the theorems that we&amp;rsquo;ve discussed to build up our understanding of what a Fourier series looks like in the Fourier domain.&lt;/p&gt;
&lt;p&gt;In the previous lecture we introduced the Fourier transform theorem for a delta function located at the origin&lt;/p&gt;
&lt;p&gt;$$
F(s) = \int_{-\infty}^{\infty} \delta(0) \exp (-i 2 \pi x s)\,\mathrm{d}x = 1
$$
which is a constant.&lt;/p&gt;
&lt;p&gt;We also introduced the shift theorem, which says
$$
f(x - a) \leftrightharpoons \exp(- 2 \pi i a s) F(s).
$$&lt;/p&gt;
&lt;p&gt;We can couple these together and write a relationship
$$
\delta(x - a) \leftrightharpoons \exp(-2 \pi i a s).
$$&lt;/p&gt;
&lt;p&gt;Now, let&amp;rsquo;s see if we can use this Fourier pair to derive the Fourier pairs for cosine and sine. In your quantum physics or partial differential equations classes, you probably used the Euler identity to write cosine and sine like
$$
\cos 2 \pi a s = \frac{e^{i 2 \pi a s} + e^{-i 2 \pi a s}}{2}
$$
and
$$
\sin 2 \pi a s = \frac{e^{i 2 \pi a s} - e^{-i 2 \pi a s}}{2i}
$$&lt;/p&gt;
&lt;p&gt;So we can do a bit of rearranging and arrive at&lt;/p&gt;
&lt;p&gt;$$
\cos \pi x \leftrightharpoons \mathrm{even}(x) = \frac{1}{2}\delta \left (x + \frac{1}{2} \right) + \frac{1}{2}\delta \left (x - \frac{1}{2} \right)
$$
and
$$
\sin \pi x \leftrightharpoons i\mathrm{odd}(x) = i \frac{1}{2}\delta \left (x + \frac{1}{2}\right) - i \frac{1}{2}\delta \left (x - \frac{1}{2}\right).
$$&lt;/p&gt;
&lt;p&gt;where these symbols are the even and odd impulse pairs.&lt;/p&gt;
&lt;figure&gt;&lt;img src="https://iancze.github.io/courses/astro589/lectures/03-fourier-transform-ii/even-odd-impulse.png"
 alt="The Fourier transform pairs of cosine and sine as the even and odd impulse pairs, respectively. Credit: Bracewell Fig 6.1"&gt;&lt;figcaption&gt;
 &lt;p&gt;The Fourier transform pairs of cosine and sine as the even and odd impulse pairs, respectively. Credit: Bracewell Fig 6.1&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;

&lt;p&gt;Now we&amp;rsquo;re well on our way to determining the Fourier spectrum of a Fourier series. The Fourier series is just a sum of sines and cosines at different frequencies. The Fourier transform is a linear operator, so we can just add together the contributions from each component in the Fourier domain
$$
a_0 + \sum_1^\infty (a_n \cos 2 \pi n f x + b_n \sin 2 \pi n f x)
$$&lt;/p&gt;
&lt;p&gt;We arrive at the result that the spectrum of a Fourier series is a collection of delta functions whose locations and amplitudes correspond to the frequencies and values of the Fourier coefficients, respectively.&lt;/p&gt;
&lt;figure&gt;&lt;a href="https://www.tutorialspoint.com/what-is-fourier-spectrum-theory-and-example#:~:text=The%20graph%20plotted%20between%20the,spectrum%20of%20a%20periodic%20signal.&amp;amp;text=Amplitude%20Spectrum%20%E2%88%92%20The%20amplitude%20spectrum,of%20Fourier%20coefficients%20versus%20frequency"&gt;&lt;img src="https://iancze.github.io/courses/astro589/lectures/03-fourier-transform-ii/line-spectra.png"
 alt="The amplitude of the line spectra corresponding to a Fourier series. Credit: TutorialsPoint"&gt;&lt;/a&gt;&lt;figcaption&gt;
 &lt;p&gt;The amplitude of the line spectra corresponding to a Fourier series. Credit: TutorialsPoint&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;

&lt;p&gt;Now that we&amp;rsquo;ve derived the Fourier spectrum of a Fourier series, we can see at least two reasons why this represents an extreme situation of the Fourier transform:&lt;/p&gt;
&lt;ol&gt;
&lt;li&gt;The input waveform is strictly periodic&lt;/li&gt;
&lt;li&gt;The input waveform is infinite in duration&lt;/li&gt;
&lt;/ol&gt;
&lt;p&gt;As we talked about in the last lecture, these conditions are violated in the real world. If we limit the duration of the sine wave to a finite duration (say, by multiplication of a truncated Gaussian (because technically the Gaussian is also non-zero over an infinite domain), which we call a &lt;em&gt;window function&lt;/em&gt;), then we see what happens in the Fourier domain: the delta functions are broadened by convolution with the Fourier transform of the window function.&lt;/p&gt;
&lt;figure&gt;&lt;img src="https://iancze.github.io/courses/astro589/lectures/03-fourier-transform-ii/broadening-line.png"
 alt="Left: the dotted line represents a broad window function to eventually make the waveform finite in duration. Right: this has the effect of broadening the delta functions by convolution with the Fourier transform of the window function. Credit: Bracewell Fig 10.12"&gt;&lt;figcaption&gt;
 &lt;p&gt;Left: the dotted line represents a broad window function to eventually make the waveform finite in duration. Right: this has the effect of broadening the delta functions by convolution with the Fourier transform of the window function. Credit: Bracewell Fig 10.12&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;
</description></item><item><title>The Fourier Transform I</title><link>https://iancze.github.io/courses/astro589/lectures/02-fourier-transform-i/</link><pubDate>Tue, 30 Aug 2022 00:00:00 +0000</pubDate><guid>https://iancze.github.io/courses/astro589/lectures/02-fourier-transform-i/</guid><description>&lt;ul&gt;
&lt;li&gt;&lt;a href="https://psu.mediaspace.kaltura.com/media/Astro+589A+Lecture+2/1_xc62zimg"&gt;Zoom recording&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;&lt;strong&gt;Reference Materials for this lecture&lt;/strong&gt;:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;&lt;a href="https://catalog.libraries.psu.edu/catalog/2010095"&gt;The Fourier Transform and its Applications&lt;/a&gt; by R. Bracewell&lt;/li&gt;
&lt;li&gt;&lt;a href="https://catalog.libraries.psu.edu/catalog/20789467"&gt;Interferometry and Synthesis in Radio Astronomy&lt;/a&gt; by Thompson, Moran, and Swenson, particularly Appendix 2.1&lt;/li&gt;
&lt;li&gt;&lt;a href="https://catalog.libraries.psu.edu/catalog/34517505"&gt;Fourier Analysis and Imaging&lt;/a&gt; by R. Bracewell&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;&lt;strong&gt;Useful (and entertaining!) introductions to the topic&lt;/strong&gt;:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Youtube: &lt;a href="https://www.youtube.com/watch?v=spUNpyF58BY&amp;amp;ab_channel=3Blue1Brown"&gt;3Blue1Brown&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;




&lt;h2 id="complex-numbers"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro589/lectures/02-fourier-transform-i/#complex-numbers"&gt;Complex Numbers&lt;/a&gt;
&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;&lt;a href="https://en.wikipedia.org/wiki/Complex_number#Polar_complex_plane"&gt;Wikipedia Reference&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;A complex number \(z\) is one in the form of \(z = a + b i\), where \(i\) is the imaginary unit. The imaginary unit satisfies the equation
$$
i^2 = -1.
$$&lt;/p&gt;
&lt;p&gt;So a complex number has both real (\(a\)) and imaginary (\(b\)) components to it. We can represent this as a plot on the Cartesian plane:&lt;/p&gt;
&lt;figure&gt;&lt;a href="https://en.wikipedia.org/wiki/File:Complex_number_illustration.svg"&gt;&lt;img src="https://iancze.github.io/courses/astro589/lectures/02-fourier-transform-i/complex-cartesian.png"
 alt="Representation of a complex number on the Cartesian plane. Credit: Wolfkeeper"&gt;&lt;/a&gt;&lt;figcaption&gt;
 &lt;p&gt;Representation of a complex number on the Cartesian plane. Credit: Wolfkeeper&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;

&lt;p&gt;Alternatively, we can also represent a complex number on the polar plane, using an amplitude \(r\) and phase \(\varphi\):&lt;/p&gt;
&lt;figure&gt;&lt;a href="https://en.wikipedia.org/wiki/File:Complex_number_illustration_modarg.svg"&gt;&lt;img src="https://iancze.github.io/courses/astro589/lectures/02-fourier-transform-i/complex-polar.png"
 alt="Representation of a complex number on the polar plane. Credit: Kan8eDie, based on Wolfkeeper"&gt;&lt;/a&gt;&lt;figcaption&gt;
 &lt;p&gt;Representation of a complex number on the polar plane. Credit: Kan8eDie, based on Wolfkeeper&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;

&lt;p&gt;$$
z = r e^{i \varphi} = r (\cos \varphi + i \sin \varphi).
$$&lt;/p&gt;
&lt;p&gt;This is closely related to Euler&amp;rsquo;s formula
$$
e^{i x} = \cos x + i \sin x
$$
for a complex sinusoid. It&amp;rsquo;s useful to keep in mind Euler&amp;rsquo;s identity:
$$
e^{i \pi} + 1 = 0.
$$&lt;/p&gt;
&lt;p&gt;It&amp;rsquo;s possible to convert from the Cartesian form to polar form and vice-versa:
$$
r = |z| = \sqrt{a^2 + b^2}
$$
and
$$
\varphi = \arg(z) = \arg(a + bi)
$$
which is most easily carried out using the &lt;code&gt;arctan2&lt;/code&gt; function, to avoid quadrant ambiguities.
$$
\varphi = \mathrm{arctan2}(b, a).
$$&lt;/p&gt;
&lt;p&gt;You can go from polar back to Cartesian by writing
$$
z = r e^{i \varphi} = r (\cos \varphi + i \sin \varphi)
$$
and then doing
$$
a = \Re(z) = r \cos \varphi
$$
and
$$
b = \Im(z) = r \sin \varphi.
$$&lt;/p&gt;




&lt;h2 id="the-fourier-transform"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro589/lectures/02-fourier-transform-i/#the-fourier-transform"&gt;The Fourier Transform&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;References include a mix of&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Ch 2 of &lt;a href="https://catalog.libraries.psu.edu/catalog/2010095"&gt;The Fourier Transform and its Applications&lt;/a&gt; by R. Bracewell&lt;/li&gt;
&lt;li&gt;Appendix 2.1 of &lt;a href="https://catalog.libraries.psu.edu/catalog/20789467"&gt;Interferometry and Synthesis in Radio Astronomy&lt;/a&gt; by Thompson, Moran, and Swenson&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;First, we&amp;rsquo;ll introduce the equations and then explain what&amp;rsquo;s going on.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Forward transform&lt;/strong&gt;:
$$
F(s) = \int_{-\infty}^{\infty} f(x) e^{-i 2 \pi x s}\,\mathrm{d}x
$$
also called the &amp;ldquo;minus-\(i\)&amp;rdquo; transform.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Inverse transform&lt;/strong&gt;:
$$
f(x) = \int_{-\infty}^{\infty} F(s) e^{i 2 \pi x s}\,\mathrm{d}s
$$
also called the &amp;ldquo;plus-\(i\)&amp;rdquo; transform.&lt;/p&gt;
&lt;p&gt;Note that there are alternate conventions for the Fourier transform pairs, which vary as to whether the \(2 \pi\) factor appears in the exponent or as a pre-factor. We prefer the notation we&amp;rsquo;ve provided here because we find it much easier to keep track of \(2\pi\) factors.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Successive transforms&lt;/strong&gt;:&lt;/p&gt;
&lt;p&gt;We can show that
$$
f(x) = \int_{-\infty}^{\infty} \left [ \int_{-\infty}^{\infty} f(x) e^{-i 2 \pi x s}\,\mathrm{d}x \right ] e^{i 2 \pi x s}\,\mathrm{d}s,
$$
i.e., successive transformations yield back the original function. We don&amp;rsquo;t have time to walk through the proof, but it&amp;rsquo;s available in &lt;a href="https://catalog.libraries.psu.edu/catalog/20789467"&gt;TMS Section A2.1&lt;/a&gt;.&lt;/p&gt;
&lt;p&gt;Therefore, we have
$$
f(x) \leftrightharpoons F(s),
$$
where the \(\leftrightharpoons\) denotes a Fourier transform pair. Sometimes you might also see the notation \(\leftrightarrow\). Generally, if we have functions like \(f\) or \(g\), then we denote their Fourier transform pairs with captial letters (e.g., \(F\), \(G\))).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How to think about the domains&lt;/strong&gt;
The Fourier transform maps domains from \(x\) to \(s\), where \(s\) has units of &amp;ldquo;cycles per unit of \(x\)&amp;rdquo;. E.g., if \(x\) is in units of time (seconds), then \(s\) has units of cycles/second, commonly known as hertz. The FT can also be applied to spatial distance coordinates (e.g., \(x\) in meters) and spatial coordinates on the sky (e.g., \(x\) in arcseconds).&lt;/p&gt;
&lt;p&gt;So let&amp;rsquo;s say someone is playing a chord on a musical instrument. If \(f(x)\) represents the time-series of pressure values near your ear, then \(F(s)\) represents the notes that are being played (we would probably be most interested in something like \(|F(s)|^2\) for power as a function of frequency, like a spectrogram).&lt;/p&gt;
&lt;p&gt;&lt;a href="https://musiclab.chromeexperiments.com/spectrogram/"&gt;Musical Spectrogram link&lt;/a&gt;&lt;/p&gt;




&lt;h3 id="conditions-on-existence"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro589/lectures/02-fourier-transform-i/#conditions-on-existence"&gt;Conditions on existence&lt;/a&gt;
&lt;/h3&gt;
&lt;p&gt;Given the strong evidence via physical systems that the Fourier transform of particular time-series exists (e.g., spectrums for waveforms, antenna radiation patterns), there are actually some mathematical functions whose Fourier transforms do not exist.&lt;/p&gt;
&lt;p&gt;Physical possibility (in physical systems) is a sufficient condition for the existence of its transform.&lt;/p&gt;
&lt;p&gt;Sometimes, though, we consider waveforms like&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;\(\sin(t)\): harmonic wave&lt;/li&gt;
&lt;li&gt;\(H(t)\): Heaviside step&lt;/li&gt;
&lt;li&gt;\(\delta(t)\): impulse&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;Strictly speaking, none of these functions has a Fourier transform. What do we mean that they do not have Fourier transforms?
$$
F(s) = \int_{-\infty}^{\infty} f(x) e^{-i 2 \pi x s}\,\mathrm{d}x
$$
does not converge for all values of \(s\). Consider \(\sin(t)\) integrated from \((-\infty, \infty)\)&amp;hellip; it&amp;rsquo;ll just keep oscillating about 0.&lt;/p&gt;
&lt;p&gt;None of these waveforms are actually physically possible, though, because a waveform \(\sin(t)\) would have need to been switched on an infinite time ago, a step function would need to be maintained for an infinite time, and an impulse would need to be infinitely large for an infinitely short time.&lt;/p&gt;
&lt;p&gt;So, what are the conditions for the existence of Fourier transforms.&lt;/p&gt;
&lt;ol&gt;
&lt;li&gt;The integral of \(|f(x)|\) from \(-\infty\) to \(\infty\) exists.&lt;/li&gt;
&lt;li&gt;Any discontinuities in \(f(x)\) are finite.&lt;/li&gt;
&lt;/ol&gt;
&lt;p&gt;In a physical circumstance, these conditions would be violated when there is infinite energy.&lt;/p&gt;




&lt;h3 id="transforms-in-the-limit"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro589/lectures/02-fourier-transform-i/#transforms-in-the-limit"&gt;Transforms in the limit&lt;/a&gt;
&lt;/h3&gt;
&lt;p&gt;Though we just outlined functional forms whose Fourier transforms do not &lt;em&gt;strictly&lt;/em&gt; exist, we can find a way to think such that the transforms of these functions do exist in a practical sense. This is by considering them &lt;em&gt;in the limit&lt;/em&gt;.&lt;/p&gt;
&lt;p&gt;Consider a periodic function whose transform would strictly not exist, such as \(P(x) = \sin(x) \), since
$$
\int_{-\infty}^\infty |P(x)|\,\mathrm{d}x
$$
would not converge.&lt;/p&gt;
&lt;p&gt;We could modify this function ever so slightly by multiplication with a &lt;em&gt;very&lt;/em&gt; broad Gaussian envelope \(\exp(-\alpha x^2)\), where \(\alpha\) is a small positive number, then this modified version may have a transform. As we let \(\alpha \rightarrow 0\), then we approach \(P\) in the limit.
$$
\int_{-\infty}^\infty |e^{-\alpha x^2} P(x)|\,\mathrm{d}x
$$&lt;/p&gt;
&lt;p&gt;As \(\alpha \rightarrow 0\) the transform may still not exist for all \(s\), however, we can still be quite productive with the sequence of transforms that do exist as we approach this limit. Therefore, we can practically use the Fourier transform for all physical systems that we might consider. We&amp;rsquo;ll revisit this in more detail when we talk about Fourier series, line spectra, and the sampling theorem.&lt;/p&gt;




&lt;h3 id="oddness-evenness-and-complex-conjugates"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro589/lectures/02-fourier-transform-i/#oddness-evenness-and-complex-conjugates"&gt;Oddness, Evenness, and Complex Conjugates&lt;/a&gt;
&lt;/h3&gt;
&lt;figure&gt;&lt;img src="https://iancze.github.io/courses/astro589/lectures/02-fourier-transform-i/even-odd.png"
 alt="An even function \(E(x)\) and an odd function \(O(x)\), followed by their sum. Credit: The Fourier Transform and Its Applications, Bracewell, Figs 2.2 and 2.3."&gt;&lt;figcaption&gt;
 &lt;p&gt;An even function \(E(x)\) and an odd function \(O(x)\), followed by their sum. Credit: The Fourier Transform and Its Applications, Bracewell, Figs 2.2 and 2.3.&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;

&lt;p&gt;&lt;strong&gt;Even functions&lt;/strong&gt; have
$$
E(-x) = E(x)
$$
and are symmetrical.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Odd functions&lt;/strong&gt; have
$$
O(-x) = -O(x)
$$
and are antisymmetrical.&lt;/p&gt;
&lt;p&gt;The sum of even and odd functions is, in general, neither even nor odd.&lt;/p&gt;
&lt;p&gt;Any function \(f(x)\) can be split unambiguously into it&amp;rsquo;s odd and even parts, though, where
$$
E(x) = \frac{1}{2} \left [ f(x) + f(-x) \right]
$$
and
$$
O(x) = \frac{1}{2} \left [ f(x) - f(-x) \right]
$$
and so we have
$$
f(x) = E(x) + O(x),
$$
where both \(E\) and \(O\) are in general complex-valued.&lt;/p&gt;
&lt;p&gt;Evenness and oddness are very useful because we can these definitions to re-write the Fourier transform as
$$
F(s) = 2 \int_0^\infty E(x) \cos(2 \pi x s)\,\mathrm{d}x - 2 i \int_0^\infty O(x) \sin (2 \pi x s)\,\mathrm{d}x.
$$&lt;/p&gt;
&lt;p&gt;We see that&lt;/p&gt;
&lt;ol&gt;
&lt;li&gt;If a function is even, its transform is even&lt;/li&gt;
&lt;li&gt;If a function is odd, its transform is odd&lt;/li&gt;
&lt;/ol&gt;
&lt;p&gt;and other results summarized by Bracewell in Chapter 2:&lt;/p&gt;
&lt;figure&gt;&lt;img src="https://iancze.github.io/courses/astro589/lectures/02-fourier-transform-i/ft-relationships.png"
 alt="If a function has the characteristics in the left column, then its Fourier transform has the characteristics in the right column. Credit: Bracewell Ch. 2"&gt;&lt;figcaption&gt;
 &lt;p&gt;If a function has the characteristics in the left column, then its Fourier transform has the characteristics in the right column. Credit: Bracewell Ch. 2&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;

&lt;p&gt;These relationships are extremely useful for quickly ascertaining the basic nature of the Fourier transform of any function you may encounter, as well as planning how to numerically compute transforms using &lt;code&gt;fft&lt;/code&gt; or &lt;code&gt;rfft&lt;/code&gt; packages.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Complex conjugates&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;If we have the complex conjugate of function \(f(x)\), denoted by \(f^*(x)\), then we have&lt;/p&gt;
&lt;p&gt;$$
f^{*}(x) \leftrightharpoons F^{*}(-s)
$$&lt;/p&gt;




&lt;h2 id="transforms-of-some-simple-functions"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro589/lectures/02-fourier-transform-i/#transforms-of-some-simple-functions"&gt;Transforms of some simple functions&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;Let&amp;rsquo;s practice by taking the Fourier transforms of some functions.&lt;/p&gt;




&lt;h3 id="boxcar"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro589/lectures/02-fourier-transform-i/#boxcar"&gt;Boxcar&lt;/a&gt;
&lt;/h3&gt;
&lt;p&gt;The rectangle or &amp;ldquo;boxcar&amp;rdquo; function is
&lt;figure&gt;&lt;img src="https://iancze.github.io/courses/astro589/lectures/02-fourier-transform-i/rectangle.png"
 alt="The rectangle, or boxcar function. Credit: Bracewell Ch. 3"&gt;&lt;figcaption&gt;
 &lt;p&gt;The rectangle, or boxcar function. Credit: Bracewell Ch. 3&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;
&lt;/p&gt;
&lt;p&gt;Let&amp;rsquo;s calculate the Fourier transform. The function itself is simple, so this is mainly an exercise in choosing the right limits
$$
F(s) = \int_{-\infty}^{\infty} f(x) e^{-i 2 \pi x s}\,\mathrm{d}x
$$&lt;/p&gt;
&lt;p&gt;$$
F(s) = \int_{-1/2}^{1/2} e^{-i 2 \pi x s}\,\mathrm{d}x
$$&lt;/p&gt;
&lt;p&gt;We can use Euler&amp;rsquo;s formula to write
$$
F(s) = \int_{-1/2}^{1/2} \cos(2 \pi x s) + i \sin(2 \pi x s) \,\mathrm{d}x
$$
and visually seen that the \(\sin\) term would eventually cancel itself out, or we could have relied upon the fact that we know \(\Pi(x)\) is an even function (\(O(x) = 0\)) and used
$$
F(s) = 2 \int_0^\infty E(x) \cos(2 \pi x s)\,\mathrm{d}x - 2 i \int_0^\infty O(x) \sin (2 \pi x s)\,\mathrm{d}x.
$$
to yield
$$
F(s) = 2 \int_0^{1/2} \cos(2 \pi x s)\,\mathrm{d}x
$$&lt;/p&gt;
&lt;p&gt;$$
F(s) = 2 \Big |_0^{1/2} \frac{\sin 2 \pi x s}{2 \pi s} = \frac{\sin \pi s}{\pi s} = \mathrm{sinc}(s)
$$&lt;/p&gt;
&lt;p&gt;Note that we (and Bracewell) define
$$
\mathrm{sinc}(s) = \frac{\sin \pi s}{\pi s}
$$
this is called the normalized sinc function, and (IMO) is the most useful because of it&amp;rsquo;s nice Fourier pair relationships. There is also the &amp;ldquo;unnormalized&amp;rdquo; sinc function, which doesn&amp;rsquo;t have the factors of \(\pi\) in it, but we won&amp;rsquo;t use that in this course. The &lt;strong&gt;normalized sinc function&lt;/strong&gt; has the properties that&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;its peak is equal to 1 \(\mathrm{sinc}(0) = 1\)&lt;/li&gt;
&lt;li&gt;its &amp;ldquo;nulls&amp;rdquo; are located at non-zero integer values of \(n\) for \(\mathrm{sinc}(n)\)&lt;/li&gt;
&lt;li&gt;its integral from \(-\infty,\infty\) is equal to 1&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;So we have the Fourier pair:
$$
\Pi(x) \leftrightharpoons \mathrm{sinc}(s)
$$
The unit rectangle (or boxcar) function has the Fourier transform pair of a normalized sinc function.&lt;/p&gt;
&lt;p&gt;This is also the same relationship that we introduced in the first lecture: the far field electric field pattern (sinc) is the Fourier transform of the electric field illuminating the aperture of the telescope (boxcar).&lt;/p&gt;




&lt;h3 id="gaussian"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro589/lectures/02-fourier-transform-i/#gaussian"&gt;Gaussian&lt;/a&gt;
&lt;/h3&gt;
&lt;p&gt;How about the Fourier transform of a Gaussian function?
$$
f(x) = \exp \left ( -\frac{x^2}{2 a ^2} \right )
$$&lt;/p&gt;
&lt;p&gt;$$
F(s) = \int_{-\infty}^{\infty} \exp \left ( -\frac{x^2}{2 a ^2} \right ) \exp (-i 2 \pi x s)\,\mathrm{d}x
$$&lt;/p&gt;
&lt;p&gt;$$
F(s) = \int_{-\infty}^{\infty} \exp \left ( -\frac{x^2}{2 a ^2} -i 2 \pi x s \right )\,\mathrm{d}x
$$&lt;/p&gt;
&lt;p&gt;Usually, when I see something like this, my standard approach is to start browsing books of integrals like &lt;a href="https://en.wikipedia.org/wiki/Gradshteyn_and_Ryzhik"&gt;Gradshteyn and Ryzhik&lt;/a&gt; for ideas about how I might rearrange the integrand and successfully evaluate the integral. I&amp;rsquo;d usually just go for this. But, in this case, we can actually do something using a trick you probably learned in jr. high school, called &lt;a href="https://www.mathsisfun.com/algebra/completing-square.html"&gt;completing the square&lt;/a&gt;.&lt;/p&gt;
&lt;p&gt;Ok, so we&amp;rsquo;ve got terms in the exponent with \(x^2\) and \(x\), and an equation that looks like
$$
a x^2 + bx + c = 0
$$
which we want to turn into something that looks like
$$
a(x + d)^2 + e = 0.
$$&lt;/p&gt;
&lt;p&gt;The answer is that
$$
d = \frac{b}{2a}
$$
and
$$
e = c - \frac{b^2}{4 a}
$$
such that our rearranged exponent becomes
$$
-\left [ \frac{x^2}{2 a ^2} + i 2 \pi x s \right ] = -[(x - 2i \pi a^2 s)^2 + 4 \pi^2 a^4 s^2]/2a^2
$$
Why was this useful? Well, the integral is over \(x\), so we can pull out all terms that do not depend on \(x\), giving us a rearranged integral of
$$
F(s) = \exp(-2 \pi^2 a^2 s^2 ) \int_{-\infty}^\infty \exp \left (-\frac{(x - 2 i \pi a^2 s)^2}{2 a^2} \right)\,\mathrm{d}x
$$&lt;/p&gt;
&lt;p&gt;Here it is helpful to remember your Gaussian integration formulas such that
$$
\int_{-\infty}^\infty e^{-a(x + b)^2}\,\mathrm{d}x = \sqrt{\frac{\pi}{a}}.
$$&lt;/p&gt;
&lt;p&gt;Thus, the integral contributes another factor of \(\sqrt{2 \pi} a\) and the final result is&lt;/p&gt;
&lt;p&gt;$$
F(s) = \sqrt{2 \pi} a \exp(-2 \pi^2 a^2 s^2 )
$$&lt;/p&gt;
&lt;p&gt;What functional form is this? This is another Gaussian, though the normalization and standard deviation are a bit different! So, we see that the Gaussian function is a Fourier transform pair with itself.&lt;/p&gt;
&lt;p&gt;Hopefully now you have a taste of how to compute Fourier transforms. At its most basic, it&amp;rsquo;s just a matter of setting up and evaluating the integral. For many function forms, you can use integration strategies to arrive at analytic solutions. As a practical matter, in the next lecture we&amp;rsquo;ll see how we can evaluate this integral numerically.&lt;/p&gt;




&lt;h2 id="convolution"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro589/lectures/02-fourier-transform-i/#convolution"&gt;Convolution&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;The convolution of two functions \(f(x)\) and \(g(x)\) is defined as
$$
\int_{-\infty}^\infty f(u) g(x - u)\,\mathrm{d}u
$$
and is frequently written using the \(*\) symbol. The convolution produces a new function, so we have
$$
h(x) = f(x) * g(x).
$$&lt;/p&gt;
&lt;p&gt;Convolution as a process is very useful to think of graphically&lt;/p&gt;
&lt;figure&gt;&lt;img src="https://iancze.github.io/courses/astro589/lectures/02-fourier-transform-i/convolution-1.png"
 alt="A graphical representation of the convolution of two functions \(f(x)\) and \(g(x)\). The \(g\) function is flipped, shifted to \(x\), and then multiplied against \(f\). The value of the convolution \(h(x)\) is given by the integral of the multiplied product. Credit: Bracewell Ch. 3"&gt;&lt;figcaption&gt;
 &lt;p&gt;A graphical representation of the convolution of two functions \(f(x)\) and \(g(x)\). The \(g\) function is flipped, shifted to \(x\), and then multiplied against \(f\). The value of the convolution \(h(x)\) is given by the integral of the multiplied product. Credit: Bracewell Ch. 3&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;

&lt;figure&gt;&lt;img src="https://iancze.github.io/courses/astro589/lectures/02-fourier-transform-i/convolution-2.png"
 alt="In general, convolution by most functions (e.g., boxcars, Gaussians, etc&amp;hellip;) results in a smoothing out of high-frequency structure. Credit: Bracewell Ch. 3"&gt;&lt;figcaption&gt;
 &lt;p&gt;In general, convolution by most functions (e.g., boxcars, Gaussians, etc&amp;hellip;) results in a smoothing out of high-frequency structure. Credit: Bracewell Ch. 3&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;

&lt;figure&gt;&lt;img src="https://iancze.github.io/courses/astro589/lectures/02-fourier-transform-i/convolution-3.png"
 alt="Convolution can also be thought of as a superposition of characteristic contributions. I.e., the final function \(h(x)\) has grabbed value from nearby regions of \(f(x)\), modulated by the envelope of \(g(x)\). This paradigm is very useful for understanding interpolation, smoothing, and kernel density estimation (KDE). Credit: Bracewell Ch. 3"&gt;&lt;figcaption&gt;
 &lt;p&gt;Convolution can also be thought of as a superposition of characteristic contributions. I.e., the final function \(h(x)\) has grabbed value from nearby regions of \(f(x)\), modulated by the envelope of \(g(x)\). This paradigm is very useful for understanding interpolation, smoothing, and kernel density estimation (KDE). Credit: Bracewell Ch. 3&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;

&lt;p&gt;Convolution is commutative&lt;/p&gt;
&lt;p&gt;$$
f * g = g * f
$$&lt;/p&gt;
&lt;p&gt;and associative&lt;/p&gt;
&lt;p&gt;$$
f * (g * h) = (f * g) * h
$$&lt;/p&gt;
&lt;p&gt;and distributive over addition&lt;/p&gt;
&lt;p&gt;$$
f * (g + h) = f * g + f * h.
$$&lt;/p&gt;
&lt;p&gt;It is a &lt;a href="https://undergroundmathematics.org/glossary/linear-operator#:~:text=A%20function%20f%20is%20called,x%20and%20all%20constants%20c."&gt;linear operator&lt;/a&gt;, just like the Fourier transform.&lt;/p&gt;




&lt;h2 id="the-impulse-symbol"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro589/lectures/02-fourier-transform-i/#the-impulse-symbol"&gt;The impulse symbol&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;We&amp;rsquo;ll develop a notation for an intense (unit-area) pulse so brief that the measuring equipment is unable to distinguish it from pulses yet briefer still. You may be quite familiar with this concept as a &amp;ldquo;delta-function,&amp;rdquo; especially in the context of quantum physics. Physically speaking, things like &amp;ldquo;point masses&amp;rdquo;, &amp;ldquo;point charges,&amp;rdquo; and (astrophysically speaking) &amp;ldquo;point sources&amp;rdquo; do not physically exist, but they are very useful concepts. The only important attribute of an impulse is &lt;em&gt;how it reacts under integration&lt;/em&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;\(\delta(x) = 0\) for \(x \ne 0\)&lt;/li&gt;
&lt;li&gt;\(\int_{-\infty}^\infty \delta(x)\,\mathrm{d}x = 1\)&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;And, there is a close relationship between the impulse symbol and the unit step function \(H(x)\) such that
$$
\int_{-\infty}^x \delta(x^\prime)\,\mathrm{d}x^\prime = H(x).
$$&lt;/p&gt;
&lt;p&gt;Another very important property of the impulse function is its &amp;ldquo;sifting property&amp;rdquo; (TMS A2.11), such that
$$
f(a) = \int_{-\infty}^{\infty} f(x) \delta(x - a)\,\mathrm{d}x^\prime.
$$
i.e., the integral of function \(f(x)\) times a delta-function located at \(a\) will give the value of the function evaluated at \(a\), \(f(a)\).&lt;/p&gt;
&lt;p&gt;The Fourier transform of a delta function (centered on 0) is&lt;/p&gt;
&lt;p&gt;$$
F(s) = \int_{-\infty}^{\infty} \delta(0) \exp (-i 2 \pi x s)\,\mathrm{d}x = 1.
$$
This is yet another important Fourier pair
$$
\delta(x) \leftrightharpoons \mathrm{constant\;amplitude\,\forall\, s}.
$$&lt;/p&gt;




&lt;h2 id="the-sampling-or-replicating-symbol-shah-function"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro589/lectures/02-fourier-transform-i/#the-sampling-or-replicating-symbol-shah-function"&gt;The Sampling or Replicating Symbol &amp;ldquo;Shah function&amp;rdquo;&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;This is an infinite sequence of unit impulses, given by
$$
\mathrm{shah}(x) = \sum_{n=-\infty}^\infty \delta(x - n)
$$&lt;/p&gt;
&lt;figure&gt;&lt;img src="https://iancze.github.io/courses/astro589/lectures/02-fourier-transform-i/shah.png"
 alt="The replicating function. Credit: Bracewell Fig 5.4"&gt;&lt;figcaption&gt;
 &lt;p&gt;The replicating function. Credit: Bracewell Fig 5.4&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;

&lt;p&gt;Also sometimes called a &amp;ldquo;Dirac Comb.&amp;rdquo; There is a generalization of the sifting property, such that if you multiply a function by a shah, you are effectively sampling it at unit intervals.&lt;/p&gt;
&lt;p&gt;You can use it to sample \(f(x)\) (by multiplication)&lt;/p&gt;
&lt;figure&gt;&lt;img src="https://iancze.github.io/courses/astro589/lectures/02-fourier-transform-i/sampling.png"
 alt="Sampling property of the shah function by multiplication. Credit: Bracewell Fig 5.5"&gt;&lt;figcaption&gt;
 &lt;p&gt;Sampling property of the shah function by multiplication. Credit: Bracewell Fig 5.5&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;

&lt;p&gt;And you can use it to replicate \(f(x)\) (by convolution)&lt;/p&gt;
&lt;figure&gt;&lt;img src="https://iancze.github.io/courses/astro589/lectures/02-fourier-transform-i/replicating.png"
 alt="Replicating property of the shah function by convolution. Credit: Bracewell Fig 5.6"&gt;&lt;figcaption&gt;
 &lt;p&gt;Replicating property of the shah function by convolution. Credit: Bracewell Fig 5.6&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;

&lt;p&gt;The unit shah function is also its own Fourier transform
$$
\mathrm{shah}(x) \leftrightharpoons \mathrm{shah}(s)
$$&lt;/p&gt;




&lt;h2 id="fourier-transform-theorems-properties-similarity-convolution-multiplication"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro589/lectures/02-fourier-transform-i/#fourier-transform-theorems-properties-similarity-convolution-multiplication"&gt;Fourier transform theorems properties: similarity, convolution, multiplication&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;There are several useful properties of the Fourier transform that you&amp;rsquo;ll want to familiarize yourself with. See Bracewell Ch. 6 or TMS A2.1.2.&lt;/p&gt;




&lt;h3 id="similarity"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro589/lectures/02-fourier-transform-i/#similarity"&gt;Similarity&lt;/a&gt;
&lt;/h3&gt;
&lt;p&gt;If&lt;/p&gt;
&lt;p&gt;$$
f(x) \leftrightharpoons F(s)
$$&lt;/p&gt;
&lt;p&gt;then&lt;/p&gt;
&lt;p&gt;$$
f(ax) \leftrightharpoons \frac{1}{|a|} F \left (\frac{s}{a} \right).
$$&lt;/p&gt;
&lt;p&gt;I.e., applied to waveforms and spectra, a compression of the time scale corresponds to an expansion of the frequency scale.&lt;/p&gt;




&lt;h4 id="fourier-transforms-and-the-heisenberg-uncertainty-principle"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro589/lectures/02-fourier-transform-i/#fourier-transforms-and-the-heisenberg-uncertainty-principle"&gt;Fourier transforms and the Heisenberg uncertainty principle&lt;/a&gt;
&lt;/h4&gt;
&lt;p&gt;In a signal-processing sense, this manifests as an inability to precisely specify a signal in &lt;em&gt;both&lt;/em&gt; the time and frequency domains at the same time. As you decrease the variance of a function (i.e., make it more concentrated and thus localized) in one domain, you increase the variance of it in the other domain (i.e., make it more extended and thus dispersed).&lt;/p&gt;
&lt;p&gt;In quantum mechanics, this same concept is at play in the Heisenberg Uncertainty principle, where probability distributions (i.e., wavefunctions) governing position and momentum are related by the Fourier transform. It&amp;rsquo;s impossible to know both position and momentum precisely.&lt;/p&gt;




&lt;h3 id="shift"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro589/lectures/02-fourier-transform-i/#shift"&gt;Shift&lt;/a&gt;
&lt;/h3&gt;
&lt;p&gt;If&lt;/p&gt;
&lt;p&gt;$$
f(x) \leftrightharpoons F(s)
$$&lt;/p&gt;
&lt;p&gt;then&lt;/p&gt;
&lt;p&gt;$$
f(x - a) \leftrightharpoons \exp(- 2 \pi i a s) F(s).
$$&lt;/p&gt;
&lt;p&gt;If you shift a function, then there are no changes in the &lt;em&gt;amplitude&lt;/em&gt; of the Fourier transform, but, there are changes to its phase, dependent on &lt;em&gt;s&lt;/em&gt;. The higher the frequency, the greater the change in phase angle. In radio astronomy, it&amp;rsquo;s common to hear of this as a translation in the R.A./Dec. plane results in a &lt;em&gt;phase shift&lt;/em&gt; in the visibility plane.&lt;/p&gt;</description></item><item><title>Introduction and Course Overview</title><link>https://iancze.github.io/courses/astro589/lectures/01-introduction/</link><pubDate>Tue, 21 Jun 2022 00:00:00 +0000</pubDate><guid>https://iancze.github.io/courses/astro589/lectures/01-introduction/</guid><description>&lt;ul&gt;
&lt;li&gt;&lt;a href="https://psu.mediaspace.kaltura.com/media/Astro+589A+Lecture+1/1_jr8qrgi1"&gt;Zoom recording&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;




&lt;h1 id="references-and-resources-for-this-lecture"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro589/lectures/01-introduction/#references-and-resources-for-this-lecture"&gt;References and Resources for this lecture&lt;/a&gt;
&lt;/h1&gt;
&lt;p&gt;Full reference information can always be found in the &lt;a href="https://iancze.github.io/courses/astro589/syllabus/"&gt;syllabus&lt;/a&gt;, under &amp;ldquo;References Materials.&amp;rdquo;&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;&lt;em&gt;Essential Radio Astronomy&lt;/em&gt; &lt;a href="https://www.cv.nrao.edu/~sransom/web/Ch1.html#S3"&gt;Ch 1&lt;/a&gt;: emission mechanisms, relevant astrophysical objects&lt;/li&gt;
&lt;li&gt;&lt;em&gt;Tools of Radio Astronomy&lt;/em&gt;: radio windows, units, flux densities&lt;/li&gt;
&lt;li&gt;&lt;em&gt;Interferometry and Synthesis in Radio Astronomy&lt;/em&gt;: single-dish observations, units, flux densities&lt;/li&gt;
&lt;/ul&gt;




&lt;h1 id="course-overview"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro589/lectures/01-introduction/#course-overview"&gt;Course Overview&lt;/a&gt;
&lt;/h1&gt;
&lt;p&gt;Welcome to Astro 589, a graduate seminar on radio astronomy and interferometric imaging!&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;&lt;a href="https://iancze.github.io/courses/astro589/syllabus/"&gt;Syllabus&lt;/a&gt; overview regarding format, HW assignments, group project including proposal dates and presentation dates&lt;/li&gt;
&lt;/ul&gt;




&lt;h1 id="astrophysics-at-radio-wavelengths"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro589/lectures/01-introduction/#astrophysics-at-radio-wavelengths"&gt;Astrophysics at radio wavelengths&lt;/a&gt;
&lt;/h1&gt;




&lt;h2 id="emission-mechanisms"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro589/lectures/01-introduction/#emission-mechanisms"&gt;Emission mechanisms&lt;/a&gt;
&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;Radio synchrotron (continuum), non-thermal emission from relativistic electrons in magnetic fields&lt;/li&gt;
&lt;li&gt;Bremsstrahlung (a.k.a. free-free) emission (continuum), thermal emission from ionized gas (H II regions)&lt;/li&gt;
&lt;li&gt;Thermal emission from cold (&amp;lt; 100 K) media, like dust (continuum)&lt;/li&gt;
&lt;li&gt;Atomic hyperfine splitting (&amp;ldquo;21-cm&amp;rdquo; line corresponding to neutral hydrogen)&lt;/li&gt;
&lt;li&gt;Molecular emission lines, primarily from rotational transitions (e.g., CO \(J=2-1\))&lt;/li&gt;
&lt;/ul&gt;
&lt;figure&gt;&lt;a href="https://www.cv.nrao.edu/~sransom/web/Ch1.html#S1"&gt;&lt;img src="https://iancze.github.io/courses/astro589/lectures/01-introduction/atmospheric_windows.jpg"
 alt="Atmospheric windows for astronomy. Credit: ESA/Hubble (F. Granato) and Essential Radio Astronomy."&gt;&lt;/a&gt;&lt;figcaption&gt;
 &lt;p&gt;Atmospheric windows for astronomy. Credit: ESA/Hubble (F. Granato) and Essential Radio Astronomy.&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;

&lt;p&gt;The earth&amp;rsquo;s atmosphere is very transparent in the radio region of the electromagnetic spectrum, especially compared to optical windows. Only towards the microwave region (wavelengths around 1 mm), does the atmospheric transmission start to decline.&lt;/p&gt;




&lt;h2 id="astrophysical-objects"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro589/lectures/01-introduction/#astrophysical-objects"&gt;Astrophysical objects&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;In truth, almost everything these days!&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;quasars&lt;/li&gt;
&lt;li&gt;gamma ray burst (GRB) afterglows&lt;/li&gt;
&lt;li&gt;fast radio bursts (FRBs)&lt;/li&gt;
&lt;li&gt;pulsars&lt;/li&gt;
&lt;li&gt;supernovae remnants&lt;/li&gt;
&lt;li&gt;cosmic microwave background (CMB)&lt;/li&gt;
&lt;li&gt;galaxies, including molecules at high redshift sources&lt;/li&gt;
&lt;li&gt;dust/interstellar medium&lt;/li&gt;
&lt;li&gt;the Sun&lt;/li&gt;
&lt;li&gt;planets (e.g., Jupiter, Uranus)&lt;/li&gt;
&lt;li&gt;protoplanetary disks&lt;/li&gt;
&lt;li&gt;molecular clouds (molecular emission)&lt;/li&gt;
&lt;li&gt;black hole accretion disks (EHT)&lt;/li&gt;
&lt;/ul&gt;
&lt;figure&gt;&lt;a href="https://www.nasa.gov/mission_pages/hubble/science/hercules-a.html"&gt;&lt;img src="https://iancze.github.io/courses/astro589/lectures/01-introduction/hercules.jpg"
 alt="Radio jets from the elliptical galaxy Hercules A (overlaid with an optical image from Hubble). Karl Jansky VLA. Credit: NASA, ESA, S. Baum and C. O&amp;rsquo;Dea (RIT), R. Perley and W. Cotton (NRAO/AUI/NSF), and the Hubble Heritage Team (STScI/AURA)."&gt;&lt;/a&gt;&lt;figcaption&gt;
 &lt;p&gt;Radio jets from the elliptical galaxy Hercules A (overlaid with an optical image from Hubble). Karl Jansky VLA. Credit: NASA, ESA, S. Baum and C. O&amp;rsquo;Dea (RIT), R. Perley and W. Cotton (NRAO/AUI/NSF), and the Hubble Heritage Team (STScI/AURA).&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;

&lt;figure&gt;&lt;a href="https://public.nrao.edu/gallery/hl-tau-birth-of-planets-revealed-in-astonishing-detail-2/"&gt;&lt;img src="https://iancze.github.io/courses/astro589/lectures/01-introduction/HLTau_ALMA.jpg"
 alt="The protoplanetary disk around HL Tau, imaged using the Atacama Large Millimeter Array. Credit: ALMA(ESO/NAOJ/NRAO); C. Brogan, B. Saxton (NRAO/AUI/NSF)"&gt;&lt;/a&gt;&lt;figcaption&gt;
 &lt;p&gt;The protoplanetary disk around HL Tau, imaged using the Atacama Large Millimeter Array. Credit: ALMA(ESO/NAOJ/NRAO); C. Brogan, B. Saxton (NRAO/AUI/NSF)&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;

&lt;figure&gt;&lt;a href="https://www.almaobservatory.org/en/press-releases/planetary-rings-of-uranus-glow-in-cold-light/"&gt;&lt;img src="https://iancze.github.io/courses/astro589/lectures/01-introduction/uranus-rings.png"
 alt="The rings of Uranus seen by ALMA (thermal emission from 77 K material). Credit: ALMA (ESO/NAOJ/NRAO); E. Molter and I. de Pater."&gt;&lt;/a&gt;&lt;figcaption&gt;
 &lt;p&gt;The rings of Uranus seen by ALMA (thermal emission from 77 K material). Credit: ALMA (ESO/NAOJ/NRAO); E. Molter and I. de Pater.&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;





&lt;h1 id="single-dish-radio-telescopes"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro589/lectures/01-introduction/#single-dish-radio-telescopes"&gt;Single-dish radio telescopes&lt;/a&gt;
&lt;/h1&gt;
&lt;p&gt;In this section, we&amp;rsquo;ll cover existing single-dish radio telescopes and refresh our memory of basic telescope performance.&lt;/p&gt;
&lt;p&gt;The same
$$
\theta \approx \frac{\lambda}{D}
$$
applies for radio antennas. Take a \(\lambda = 1\;\mathrm{cm}\) observation, for example. Compared to an optical \(\lambda = 500\;\mathrm{nm}\) telescope the same size, the resolution will be a factor of
$$
\frac{1\;\mathrm{cm}}{500\;\mathrm{nm}} = 20,000
$$
worse. Yikes!&lt;/p&gt;
&lt;p&gt;Radio astronomers are constantly trying to find ways to increase angular (spatial) resolution.&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;One way is to build bigger telescopes, such as the Green Bank Telescope (100m diameter)&lt;/li&gt;
&lt;/ul&gt;
&lt;figure&gt;&lt;a href="https://public.nrao.edu/gallery/green-bank-telescope/"&gt;&lt;img src="https://iancze.github.io/courses/astro589/lectures/01-introduction/gbt.jpg"
 alt="The Green Bank Telescope (100m diameter) operates at radio wavelegths. Credit: NRAO/AUI/NSF"&gt;&lt;/a&gt;&lt;figcaption&gt;
 &lt;p&gt;The Green Bank Telescope (100m diameter) operates at radio wavelegths. Credit: NRAO/AUI/NSF&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;

&lt;ul&gt;
&lt;li&gt;Another way is to work at higher frequencies (shorter wavelengths), e.g. sub-mm radio astronomy (IRAM 30m telescope)&lt;/li&gt;
&lt;/ul&gt;
&lt;figure&gt;&lt;a href="https://www.iram-institute.org/EN/30-meter-telescope.php"&gt;&lt;img src="https://iancze.github.io/courses/astro589/lectures/01-introduction/iram.jpg"
 alt="The IRAM 30m diameter telescope, which operates at sub-mm wavelengths. Credit: Wikipedia/IRAM-gre"&gt;&lt;/a&gt;&lt;figcaption&gt;
 &lt;p&gt;The IRAM 30m diameter telescope, which operates at sub-mm wavelengths. Credit: Wikipedia/IRAM-gre&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;

&lt;ul&gt;
&lt;li&gt;A final way is to use &lt;em&gt;interferometry&lt;/em&gt;, sometimes also at sub-mm wavelengths (ALMA), which will be the main focus of this course&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;It&amp;rsquo;s easier to build larger telescopes at longer wavelengths because the tolerances required for the reflecting surface are less strict than at optical wavelengths. Though typically one must keep surface tolerances to within
$$
\sigma = \frac{\lambda_\mathrm{min}}{16},
$$
otherwise the efficiency of the antenna starts to decline substantially. For the 100 m diameter GBT operating at it&amp;rsquo;s highest frequency (100 GHz) or 3 mm, this translates to \(200\;\mu\mathrm{m}\), which is the thickness of two sheets of paper! That&amp;rsquo;s quite an engineering challenge, and is the reason why large, steerable dishes are difficult to build.&lt;/p&gt;
&lt;p&gt;Keeping telescopes fixed is one way to build a little bit bigger, such as &lt;a href="https://en.wikipedia.org/wiki/Five-hundred-meter_Aperture_Spherical_Telescope"&gt;FAST&lt;/a&gt;, which is a five hundred meter diameter fixed telescope in China. See also Arecibo, which unfortunately collapsed in December 2020. Eventually, though, the materials/engineering cost to building large single dish telescopes becomes prohibitive.&lt;/p&gt;




&lt;h2 id="single-dish-observations"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro589/lectures/01-introduction/#single-dish-observations"&gt;Single dish observations&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;The &amp;ldquo;beam&amp;rdquo; of a receiving antenna, or power pattern as a function of direction, can be calculated using the reciprocity theorems for transmitting and receiving antennas. These state that the far field electric field pattern \(f(l)\) is the Fourier transform (much more in lectures 2 and 3!) of the electric field illuminating the aperture of the telescope \(g(u)\).&lt;/p&gt;
&lt;figure&gt;&lt;a href="https://www.cv.nrao.edu/~sransom/web/Ch3.html#S1"&gt;&lt;img src="https://iancze.github.io/courses/astro589/lectures/01-introduction/uniform-dish.png"
 alt="A schematic illustration of (top): Uniformly illuminated aperture (middle): The electric field pattern of the antenna, as a function of direction (bottom): The power pattern of the antenna, as a function of direction. Credit: Essential Radio Astronomy"&gt;&lt;/a&gt;&lt;figcaption&gt;
 &lt;p&gt;A schematic illustration of (top): Uniformly illuminated aperture (middle): The electric field pattern of the antenna, as a function of direction (bottom): The power pattern of the antenna, as a function of direction. Credit: Essential Radio Astronomy&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;

&lt;p&gt;For large apertures, the nulls at \( l = \pm 1, 2, \ldots\) appear at the angles \(\theta \approx \lambda/D, 2 \lambda/D, \ldots\). In two dimensions, for a circular aperture, this is an Airy pattern.&lt;/p&gt;
&lt;figure&gt;&lt;img src="https://iancze.github.io/courses/astro589/lectures/01-introduction/beam-pattern.png"
 alt="A beam power pattern plotten in polar coordinates, demonstrating that the antenna can pick up power from sidelobes at range of angles. Credit: Tools of Radio Astronomy."&gt;&lt;figcaption&gt;
 &lt;p&gt;A beam power pattern plotten in polar coordinates, demonstrating that the antenna can pick up power from sidelobes at range of angles. Credit: Tools of Radio Astronomy.&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;

&lt;p&gt;This is an idealized representation, but is still helpful. The beam can pick up power through sidelobes at a range of angles. Directional antennas help concentrate power in the main beam, but antennas with secondary stages (and thus supports, like most telescopes) create opportunities for ground radiation to reflect into the receiver.&lt;/p&gt;
&lt;p&gt;You can think of single-dish telescopes (unless they have a sophisticated, multi-pixel receiver) essentially as single-pixel devices. So to make a map of the sky, you would need to raster scan the telescope across the region of interest, reading out antenna temperature as a function of RA, Dec. To make a good (i.e., scientifically accurate) map, you should focus on Nyquist sampling the sky to a uniform sensitivity, usually through a hexagonal pattern of dithering. More advanced instruments may have an array of &amp;ldquo;feeds&amp;rdquo; in a focal plane (mirroring a set of &amp;ldquo;pixels&amp;rdquo;), but this is still a small number of pixels compared to a typical CCD (e.g., 25 or 36 compared to \(2046^2\)).&lt;/p&gt;




&lt;h2 id="image-units"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro589/lectures/01-introduction/#image-units"&gt;Image Units&lt;/a&gt;
&lt;/h2&gt;




&lt;h3 id="radiative-transfer-recap"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro589/lectures/01-introduction/#radiative-transfer-recap"&gt;Radiative transfer recap&lt;/a&gt;
&lt;/h3&gt;
&lt;p&gt;What are the units of the sky, and the images we make as representations of it?&lt;/p&gt;
&lt;p&gt;One of the most useful quantities from radiative transfer is \(I_\nu\).&lt;/p&gt;
&lt;p&gt;\(I_\nu \) is the &lt;em&gt;specific intensity&lt;/em&gt; of radiation, you can think of it as the energy carried along by an infinitesimal &amp;ldquo;bundle&amp;rdquo; of rays.&lt;/p&gt;
&lt;p&gt;It has dimensions of:
$$
\mathrm{energy}\; (\mathrm{time})^{-1} \;(\mathrm{area})^{-1} \;(\mathrm{solid\,angle})^{-1} \; (\mathrm{frequency})^{-1}
$$
in CGS units, we would write
$$
\mathrm{ergs}\;\mathrm{s}^{-1}\;\mathrm{cm}^{-2}\;\mathrm{ster}^{-1}\;\mathrm{Hz}^{-1}
$$
In astronomical settings, I&amp;rsquo;ve always seen \(I_\nu\) referred to as the &amp;ldquo;specific intensity.&amp;rdquo; In non-astronomy settings, I&amp;rsquo;ve seen &amp;ldquo;spectral intensity.&amp;rdquo; If \(I_\nu\) is integrated over all frequencies, it&amp;rsquo;s called the radiant intensity \(I\).&lt;/p&gt;
&lt;p&gt;\(I_\nu \) can be a little mind-bending to think about&amp;hellip; it can be a function of&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;3D space \(\vec{x}\)&lt;/li&gt;
&lt;li&gt;direction \(\vec{\Omega}\)&lt;/li&gt;
&lt;li&gt;frequency&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;Intensity itself is &lt;em&gt;not&lt;/em&gt; a vector quantity; rather it is a scalar field that &lt;em&gt;is&lt;/em&gt; function of position and direction \(I_\nu(\vec{x}, \vec{\Omega})\). Rybicki and Lightman write the angular direction vector as \(\vec{\Omega}\) and the solid angle surrounding that vector as \(\mathrm{d}\Omega\).&lt;/p&gt;
&lt;figure&gt;&lt;img src="https://iancze.github.io/courses/astro589/lectures/01-introduction/rybicky_lightman_1.2.png"
 alt="The geometry surrounding the concept of specific intensity. The normal vector is \(\vec{\Omega}\), the position \(\vec{x}\) in 3D space corresponds to the location of the \(dA\) patch. The Credit: Radiative Processes, Figure 1.2"&gt;&lt;figcaption&gt;
 &lt;p&gt;The geometry surrounding the concept of specific intensity. The normal vector is \(\vec{\Omega}\), the position \(\vec{x}\) in 3D space corresponds to the location of the \(dA\) patch. The Credit: Radiative Processes, Figure 1.2&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;

&lt;p&gt;If we have a defined reference frame, we would probably write \(\vec{\Omega}\) as a vector in spherical coordinates and define the components along \(\hat{\phi}\) and \(\hat{\theta}\) and let \(\mathrm{d}\Omega = d\phi \sin \theta d\theta\).&lt;/p&gt;
&lt;p&gt;When we are making astrophysical &lt;em&gt;observations&lt;/em&gt; from the earth, though,
we are making or acquiring images of regions of the celestial sky. So we generally talk of \(I_\nu(\alpha, \delta)\), where \(\alpha\) and \(\delta\) are R.A. and declination offsets from some direction, respectively. We&amp;rsquo;re always looking from the same place (at least compared to the size of the universe), so we don&amp;rsquo;t worry about specifying position within 3D space. But if we went to the Andromeda Galaxy and started mapping the celestial sky, we would need to, then. In the end though, images are have the same units because they represent specific intensity. It&amp;rsquo;s very common to refer to \(I_\nu(\alpha, \delta)\) as the &lt;em&gt;surface brightness&lt;/em&gt;.&lt;/p&gt;
&lt;figure&gt;&lt;img src="https://iancze.github.io/courses/astro589/lectures/01-introduction/solid-angle.png"
 alt="When we&amp;rsquo;re discussing images of astronomical sources, we&amp;rsquo;re usually using RA \(\alpha\) and Dec \(\delta\). A solid angle simply describes the area on a unit sphere (e.g., the sky), the area itself need not be circular. The Very Large Array, located in Socorro, NM. Credit: NRAO"&gt;&lt;figcaption&gt;
 &lt;p&gt;When we&amp;rsquo;re discussing images of astronomical sources, we&amp;rsquo;re usually using RA \(\alpha\) and Dec \(\delta\). A solid angle simply describes the area on a unit sphere (e.g., the sky), the area itself need not be circular. The Very Large Array, located in Socorro, NM. Credit: NRAO&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;





&lt;h3 id="flux"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro589/lectures/01-introduction/#flux"&gt;Flux&lt;/a&gt;
&lt;/h3&gt;
&lt;p&gt;Once you&amp;rsquo;ve defined \(I_\nu\), then it&amp;rsquo;s relatively easy to calculate quantities like energy density, flux, momentum, etc, as integrals of the specific intensity field.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Flux&lt;/strong&gt; is where you integrate out the angular dependence:&lt;/p&gt;
&lt;p&gt;$$
F_\nu = \int I_\nu \cos \theta d \Omega
$$
(intensity passing through some differential area \(dA\), lowered by the effective angle).&lt;/p&gt;
&lt;p&gt;\(F_\nu\) has units of
$$
\mathrm{ergs}\;\mathrm{s}^{-1}\;\mathrm{cm}^{-2}\;\mathrm{Hz}^{-1}
$$
(i.e., angular dependence has been integrated out).&lt;/p&gt;
&lt;p&gt;Most astrophysical sources produce significantly less energy in radio waves compared to higher frequency bands, and so the raw CGS unit can be quite cumbersome. To make this easier, astronomers use a unit called the &amp;ldquo;Jansky,&amp;rdquo; which is defined as&lt;/p&gt;
&lt;p&gt;$$
1\,\mathrm{Jy} = 10^{-23}~\mathrm{ergs}\;\mathrm{s}^{-1}\;\mathrm{cm}^{-2}\;\mathrm{Hz}^{-1}
$$&lt;/p&gt;
&lt;p&gt;The Jansky is a unit of flux.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How do we report specific intensities/surface brightnesses for radio sources, then?&lt;/strong&gt; We can, reintroduce the &amp;ldquo;per solid angle&amp;rdquo; to the Jansky, for example
$$
\mathrm{Jy}\;\mathrm{arcsec}^{-2}.
$$&lt;/p&gt;
&lt;p&gt;Later on the course, we&amp;rsquo;ll talk about \(\mathrm{Jy}\;\mathrm{beam}^{-1}\), which another unit of surface brightness/specific intensity.&lt;/p&gt;
&lt;p&gt;Other units of surface brightness that you might encounter at other wavelengths include \(\mathrm{mag}\;\mathrm{arcsec}^{-2}\) (optical) and \(\mathrm{MJy}\;\mathrm{sr}^{-1}\) (infrared).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Questions for review&lt;/strong&gt;:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;What is the name of \(I_\nu\), and what are its units?&lt;/li&gt;
&lt;li&gt;What is the name of \(F_\nu\), and what are its units?&lt;/li&gt;
&lt;li&gt;If we made an astronomical observation of a &amp;ldquo;point source,&amp;rdquo; would we report \(I_\nu\) or \(F_\nu\)?&lt;/li&gt;
&lt;li&gt;What about for a spatially resolved source?&lt;/li&gt;
&lt;li&gt;Is a Jansky a unit for \(I_\nu\) or \(F_\nu\)?&lt;/li&gt;
&lt;/ul&gt;
&lt;figure&gt;&lt;img src="https://iancze.github.io/courses/astro589/lectures/01-introduction/point-source-resolved-source.png"
 alt="Using \(F_\nu\) and \(I_\nu\) to represent point sources and spatially resolved sources, respectively. Credit: Ian Czekala"&gt;&lt;figcaption&gt;
 &lt;p&gt;Using \(F_\nu\) and \(I_\nu\) to represent point sources and spatially resolved sources, respectively. Credit: Ian Czekala&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;





&lt;h2 id="the-many-temperatures-of-radio-astronomy"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro589/lectures/01-introduction/#the-many-temperatures-of-radio-astronomy"&gt;The many temperatures of radio astronomy&lt;/a&gt;
&lt;/h2&gt;




&lt;h3 id="from-cosmic-sources-only"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro589/lectures/01-introduction/#from-cosmic-sources-only"&gt;From Cosmic Sources only&lt;/a&gt;
&lt;/h3&gt;
&lt;ul&gt;
&lt;li&gt;&lt;strong&gt;brightness temperature&lt;/strong&gt;: \(T_b\), the temperature corresponding to the specific intensity if we used the full form of the Planck formula
$$
I_\nu = \frac{2 h \nu^3}{c^2} \frac{1}{\exp{(h \nu / k T_b)} - 1}
$$&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;antenna temperature&lt;/strong&gt;: the temperature corresponding to the specific intensity if we were in the Rayleigh-Jeans domain. Added benefit that specific intensity is &lt;em&gt;linearly related&lt;/em&gt; to antenna temperature and makes it easy to substitute one for the other.
$$
T_A(\nu) = \frac{c^2}{2 k \nu^2} I_\nu
$$&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;In the field of radio astronomy, be aware that one frequently combines temperatures in other interesting ways. One can express random noise power in terms of an effective temperature
$$
P = k T \Delta \nu
$$
where \(\Delta \nu\) is the bandwidth of the observation. Here the power is equal to the noise power delivered to a &lt;strong&gt;matched load&lt;/strong&gt; by a resistor at physical temperature \(T\). By matched load, we mean we connect a resistor to the input terminals of a linear amplifier. The fact that this resistor has some temperature (i.e., we haven&amp;rsquo;t cooled it to absolute zero&amp;hellip;) means that the thermal motion of the electrons will produce a random, variable current \(i(t)\) input to the amplifier. The mean value of this current is zero, but the root mean squared value is non-zero, and this represents a non-zero power. I.e., you can draw (some) power from a resistor at room temperature, purely from thermal motions. The situation is not dissimilar to the random walk of a particle in Brownian motion. For more details, see &lt;em&gt;Tools of Radio Astronomy&lt;/em&gt;, Chapter 1.8.&lt;/p&gt;




&lt;h3 id="including-noise-sources"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro589/lectures/01-introduction/#including-noise-sources"&gt;Including noise sources&lt;/a&gt;
&lt;/h3&gt;
&lt;ul&gt;
&lt;li&gt;&lt;strong&gt;Antenna temperature&lt;/strong&gt; \(T_A\) the component of the power received by the antenna from &lt;em&gt;cosmic sources&lt;/em&gt;. It has the same interpretation as before (though we&amp;rsquo;ll talk about beam dilution in a second).&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Receiver temperature&lt;/strong&gt; \(T_R\) the component of the power from internal noise of the receiver components themselves, ground radiation, atmospheric emission, etc&amp;hellip;&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;System temperature&lt;/strong&gt; \(T_S = T_A + T_R\) is the sum of receiver temperature and antenna temperature. It&amp;rsquo;s the one power number coming out of the backend of your telescope. It&amp;rsquo;s up to you to calibrate \(T_R\) accurately enough to measure \(T_A\).&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;In any observation, you will have your cosmic signal of interest and several contributions of noise (see &lt;em&gt;Essential Radio Astronomy&lt;/em&gt;, Ch. 3.6.1)
$$
T_S=T_\mathrm{cmb}+T_\mathrm{rsb}+T_A + [1−\exp(−\tau_A)] T_\mathrm{atm}+T_\mathrm{spill}+T_r+ \ldots
$$
such as the CMB, other galactic background sources, the atmosphere, spillover radiation from the ground, the temperature of the radiometer itself (hopefully cryogenically cooled), etc.&lt;/p&gt;
&lt;p&gt;In the limit that \(T_A \ll T_S\) (most astronomy situations, unfortunately!), we have
$$
S/N \approx C \frac{T_A}{T_S} \sqrt{\Delta \nu \Delta t}
$$
where \(C\) is a constant of proportionality greater than or equal to 1, and \(\Delta t\) is the integration time. If we let \(\Delta \nu \approx 1\;\mathrm{GHz}\) and \(\Delta t \approx 1\;\mathrm{h}\), then we can get \(\sqrt{\Delta \nu \Delta t} \approx 10^6\), allowing us to detect a signal which is less than \(10^{-6}\) the system noise. A great illustration of this capability is the COBE satellite that studied CMB anisotropies with brightness temperatures \(&amp;lt; 10^{-7}\) that of the system temperature. To achieve these contrasts, however, it&amp;rsquo;s important to keep systematics under control, otherwise the S/N scaling won&amp;rsquo;t hold!&lt;/p&gt;




&lt;h2 id="beam-dilution"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro589/lectures/01-introduction/#beam-dilution"&gt;Beam dilution&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;In previous lectures, we&amp;rsquo;ve been talking about specific intensity \(I_\nu(\Omega)\) as a &amp;ldquo;known&amp;rdquo; quantity of direction (e.g., R.A., Dec.) and used antenna temperature \(T_A\) as a linear proxy for its specification. For the following discussion, we&amp;rsquo;re going to move into the realm of observations, and discuss the ways \(T_A\) can be an unfaithful proxy for the &amp;ldquo;true&amp;rdquo; specific intensity distribution or brightness temperature. we&amp;rsquo;ll use the symbol \(T_b(\Omega)\) to denote the &amp;ldquo;true&amp;rdquo; brightness/antenna temperature, assuming we&amp;rsquo;re in the Rayleigh-Jeans limit, and redefine \(T_A\) to mean the response of the telescope to the cosmic radiation.&lt;/p&gt;
&lt;p&gt;When we&amp;rsquo;re doing observations, we don&amp;rsquo;t always have access to the highest resolution version of \(T_b(\Omega)\), but rather we have access to a quantity which is the true \(T_b(\Omega)\) convolved with the beam of the telescope, which is the implication of \(T_A\) for this discussion.&lt;/p&gt;
&lt;figure&gt;&lt;img src="https://iancze.github.io/courses/astro589/lectures/01-introduction/beam-dilution.png"
 alt="When astrophysical sources are insufficiently resolved, our measurements of \(I_\nu\) do not trace the true sky distribution, specifically features are smeared out over a spatial extent and peak intensities are reduced. This means that using these insufficiently resolved measurements of \(T_A\) will not accurately trace the true underlying temperatures of the astrophysical source (even if the emission from the source is actually thermal in origin). Of course, we never have infinite spatial resolution, so there will always be structure on scales beyond that of our observations. Credit: Ian Czekala"&gt;&lt;figcaption&gt;
 &lt;p&gt;When astrophysical sources are insufficiently resolved, our measurements of \(I_\nu\) do not trace the true sky distribution, specifically features are smeared out over a spatial extent and peak intensities are reduced. This means that using these insufficiently resolved measurements of \(T_A\) will not accurately trace the true underlying temperatures of the astrophysical source (even if the emission from the source is actually thermal in origin). Of course, we never have infinite spatial resolution, so there will always be structure on scales beyond that of our observations. Credit: Ian Czekala&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;





&lt;h4 id="large-fully-resolved-source"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro589/lectures/01-introduction/#large-fully-resolved-source"&gt;large (fully resolved) source&lt;/a&gt;
&lt;/h4&gt;
&lt;p&gt;In the limit that we are observing a source that subtends a solid angle much larger than the beam of the antenna,
$$
\Omega_S &amp;gt; \Omega_A
$$
the convolution of the beam doesn&amp;rsquo;t matter, we&amp;rsquo;re still sensing approximately the same \(T_b(\Omega)\) such that
$$
T_A(\Omega) \approx T_b(\Omega).
$$
If the source is in LTE, then we also have that \(T_A \approx T\).&lt;/p&gt;




&lt;h3 id="small-unresolved-source"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro589/lectures/01-introduction/#small-unresolved-source"&gt;small (unresolved) source&lt;/a&gt;
&lt;/h3&gt;
&lt;p&gt;If the source is more compact than the antenna beam,
$$
\Omega_S &amp;lt; \Omega_A,
$$
then the measured antenna temperature is basically the &amp;ldquo;true&amp;rdquo; intensity averaged over the area of the main beam. This lowers the measured antenna temperature by a factor
$$
\frac{T_A}{T_b} = \frac{\Omega_S}{\Omega_A}
$$
where the ratio \(\frac{\Omega_S}{\Omega_A}\) is called the &lt;strong&gt;beam filling factor&lt;/strong&gt;.&lt;/p&gt;
&lt;p&gt;For example, you could have a compact source with \(T_b = 10^4\) K, but if it only fills 1% of the beam solid angle then you would measure an antenna temperature of 100 K. If you took your observations at face-value (and assumed LTE), then you would incorrectly conclude that the source is 100x cooler than it actually is.&lt;/p&gt;
&lt;p&gt;Beam dilution also applies to observations of sources that have &lt;em&gt;structure&lt;/em&gt; on spatial scales below the observable limit, which, to be honest, is going to be most astrophysical sources of interest. For example, consider a gas filament in a star-forming region.&lt;/p&gt;
&lt;p&gt;Radio-bright, spatially concentrated regions will be &amp;ldquo;smeared out&amp;rdquo; by the beam. If you wanted to use antenna temperature (and assume LTE) to estimate the physical conditions of the gas filament, you do so at the peril of measuring incorrect temperatures. The unfortunate reality here is that, without higher resolution images to guide you (which sometimes exist at optical or infrared wavelengths), it&amp;rsquo;s quite difficult to estimate how badly your measurements are affected by beam dilution.&lt;/p&gt;
&lt;p&gt;For more useful single-dish guidance, see &lt;a href="https://www.atnf.csiro.au/research/radio-school/2011/talks/Parkes-school-Fundamental-II.pdf"&gt;these notes&lt;/a&gt; by James Jackson, or &lt;a href="https://www.cv.nrao.edu/~sransom/web/Ch3.html#S1.SS6"&gt;Ch 3.1.6&lt;/a&gt; of &lt;em&gt;Essential Radio Astronomy&lt;/em&gt;.&lt;/p&gt;




&lt;h1 id="introduction-to-interferometric-arrays-alma-vla-sma-vlba"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro589/lectures/01-introduction/#introduction-to-interferometric-arrays-alma-vla-sma-vlba"&gt;Introduction to interferometric arrays, ALMA, VLA, SMA, VLBA&lt;/a&gt;
&lt;/h1&gt;
&lt;p&gt;Now that we&amp;rsquo;ve covered some of the fundamentals around radio telescopes and single-dish antennas, we&amp;rsquo;ll move on to discussing how we combine the signals from multiple antennas to do interferometry. Here are some of he interferometers operating today:&lt;/p&gt;
&lt;figure&gt;&lt;img src="https://iancze.github.io/courses/astro589/lectures/01-introduction/alma.jpg"
 alt="The Atacama Large (Sub)millimeter Array, an interferometric array of 66 antennas operating at sub-millimeter wavelengths. The largest antennas in the array are only 12m in diameter, yet through interferometry, the array is able to obtain far higher spatial resolution than the largest single-dish antennas. Credit: NRAO/ESO/NAOJ/JAO"&gt;&lt;figcaption&gt;
 &lt;p&gt;The Atacama Large (Sub)millimeter Array, an interferometric array of 66 antennas operating at sub-millimeter wavelengths. The largest antennas in the array are only 12m in diameter, yet through interferometry, the array is able to obtain far higher spatial resolution than the largest single-dish antennas. Credit: NRAO/ESO/NAOJ/JAO&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;

&lt;figure&gt;&lt;a href="https://public.nrao.edu/telescopes/vla/"&gt;&lt;img src="https://iancze.github.io/courses/astro589/lectures/01-introduction/VLA.jpg"
 alt="The Very Large Array, located in Socorro, NM. Credit: NRAO"&gt;&lt;/a&gt;&lt;figcaption&gt;
 &lt;p&gt;The Very Large Array, located in Socorro, NM. Credit: NRAO&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;

&lt;figure&gt;&lt;a href="https://en.wikipedia.org/wiki/Very_Long_Baseline_Array"&gt;&lt;img src="https://iancze.github.io/courses/astro589/lectures/01-introduction/VLBA_St_Croix.jpg"
 alt="One antenna at the Eastern end of the Very Long Baseline Array (VLBA), St. Croix, U.S. Virgin Islands. Credit: Cumulus Clouds"&gt;&lt;/a&gt;&lt;figcaption&gt;
 &lt;p&gt;One antenna at the Eastern end of the Very Long Baseline Array (VLBA), St. Croix, U.S. Virgin Islands. Credit: Cumulus Clouds&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;

&lt;figure&gt;&lt;a href="https://www.iram-institute.org/EN/noema-project.php?ContentID=9&amp;amp;rub=9&amp;amp;srub=0&amp;amp;ssrub=0&amp;amp;sssrub=0"&gt;&lt;img src="https://iancze.github.io/courses/astro589/lectures/01-introduction/NOEMA.jpg"
 alt="The NOEMA, located in the French Alps on the Plateau du Bure. Credit: IRAM/Rebus"&gt;&lt;/a&gt;&lt;figcaption&gt;
 &lt;p&gt;The NOEMA, located in the French Alps on the Plateau du Bure. Credit: IRAM/Rebus&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;

&lt;figure&gt;&lt;a href="https://lweb.cfa.harvard.edu/sma/"&gt;&lt;img src="https://iancze.github.io/courses/astro589/lectures/01-introduction/SMA.jpg"
 alt="The Submillimeter Array (SMA), located on Mauna Kea, Hawaii. Credit: I. Czekala"&gt;&lt;/a&gt;&lt;figcaption&gt;
 &lt;p&gt;The Submillimeter Array (SMA), located on Mauna Kea, Hawaii. Credit: I. Czekala&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;
</description></item><item><title>Astrochemistry</title><link>https://iancze.github.io/courses/astro542/lectures/35-astrochemistry/</link><pubDate>Wed, 08 Dec 2021 00:00:00 +0000</pubDate><guid>https://iancze.github.io/courses/astro542/lectures/35-astrochemistry/</guid><description>&lt;p&gt;&lt;a href="https://psu.mediaspace.kaltura.com/media/Astro+542A+December+8/1_oluy4593"&gt;Zoom link&lt;/a&gt;&lt;/p&gt;</description></item><item><title>Exoplanets and Planet Formation</title><link>https://iancze.github.io/courses/astro542/lectures/33-exoplanets-planet-formation/</link><pubDate>Sat, 04 Dec 2021 00:00:00 +0000</pubDate><guid>https://iancze.github.io/courses/astro542/lectures/33-exoplanets-planet-formation/</guid><description>&lt;p&gt;&lt;a href="https://psu.mediaspace.kaltura.com/media/Astro+542A+Dec+3rd/1_bpz5707q"&gt;Zoom link&lt;/a&gt;&lt;/p&gt;</description></item><item><title>Circumstellar Disks</title><link>https://iancze.github.io/courses/astro542/lectures/32-circumstellar-disks/</link><pubDate>Wed, 24 Nov 2021 00:00:00 +0000</pubDate><guid>https://iancze.github.io/courses/astro542/lectures/32-circumstellar-disks/</guid><description>&lt;p&gt;&lt;a href="https://psu.mediaspace.kaltura.com/media/Astro+542A+November+29/1_klingyqs"&gt;Zoom link&lt;/a&gt;&lt;/p&gt;</description></item><item><title>Star Formation: Observations</title><link>https://iancze.github.io/courses/astro542/lectures/30-star-formation-observations/</link><pubDate>Wed, 17 Nov 2021 00:00:00 +0000</pubDate><guid>https://iancze.github.io/courses/astro542/lectures/30-star-formation-observations/</guid><description>&lt;p&gt;&lt;a href="https://psu.mediaspace.kaltura.com/media/Astro+542A+Lecture+Nov+17/1_3ezwwaxf"&gt;Zoom link&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;References:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Draine Ch 42&lt;/li&gt;
&lt;li&gt;Stahler and Palla (2005) (comprehensive textbook)&lt;/li&gt;
&lt;/ul&gt;




&lt;h2 id="terminology-refresher"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/30-star-formation-observations/#terminology-refresher"&gt;Terminology refresher&lt;/a&gt;
&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;&amp;ldquo;Dark cloud&amp;rdquo; defined via magnitudes of optical extinction (\(A_V &amp;gt; 5\,\mathrm{mag}\))&lt;/li&gt;
&lt;li&gt;&amp;ldquo;Cores&amp;rdquo; are self-gravitating density peaks within an isolated dark cloud, which have masses from \(0.3 - 10\,M_\odot\), each is likely to form a single star or a binary star&lt;/li&gt;
&lt;li&gt;Within giant molecular clouds (GMCs), &lt;strong&gt;clump&lt;/strong&gt; refers to self-gravitating regions with masses as large as \(10^3\,M_\odot\).&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;clumps&lt;/strong&gt; may or may not be forming stars. Those that are are called &lt;em&gt;star-forming clumps&lt;/em&gt;. Each clump with generally contain a number of cores.&lt;/li&gt;
&lt;/ul&gt;




&lt;h2 id="core-collapse-to-form-stars"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/30-star-formation-observations/#core-collapse-to-form-stars"&gt;Core collapse to form stars&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;When the core becomes gravitationally unstable, it will begin to collapse.&lt;/p&gt;
&lt;p&gt;The free-fall timescale is
$$
\tau = \sqrt{\frac{3 \pi}{32 G \rho}}
$$&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;In initial stages, radiative cooling in the molecular lines keeps the gas cool. This means that gas pressure remains unimportant during this phase so the matter can move inward nearly in free-fall.&lt;/li&gt;
&lt;li&gt;In interior of the core, the density is higher, and so the free-fall time is shorter there. This leads the core to collapse in an &amp;ldquo;inside-out&amp;rdquo; manner, where the center collapses on itself first, and the outer material falls onto the central matter later.&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;As we discussed in the previous lecture, molecular clouds appear to have magnetic energies comparable to the kinetic energy (from turbulent motions) and an abundance of angular momentum that needs to be shed before collapse can complete.&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;
&lt;p&gt;The fact that stars do form implies that ambipolar diffusion (or some other process) is able to reduce the magnetic field to below some critical value such that it no longer prevents gravitational collapse.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Ambipolar diffusion&lt;/strong&gt;: Normally, we assume that the magnetic field is &amp;ldquo;frozen&amp;rdquo; into the plasma. Indeed, the magnetic field is coupled to charged particles (electrons, ions, and charged dust grains). But, the plasma can drift relative to the neutral gas as they become uncoupled. In a molecular cloud, where the fractional ionization is very low, neutral particles will only rarely encounter charged particles, so they aren&amp;rsquo;t completely hindered in their collapse. The magnetic force on the plasma is balanced against the force resulting from collisions of charged particles with neutrals. And so we get a timescale for the magnetic field to &amp;ldquo;slip&amp;rdquo; out of the clump. For high densities \(n_\mathrm{H} \gtrsim 10^5\,\mathrm{cm}^{-3}\) the timescale is short enough \(\lesssim 3 \times 10^6\,\mathrm{yr}\) such that this mechanism may be able to reduce the magnetic flux in a contracting clump.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;Infalling gas will generally have nonzero angular momentum, and as long as it remains relatively cold, it will collapse to form a rotationally supported disk. The material with the lowest specific angular momentum will be collected in a central protostar.&lt;/p&gt;
&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;The dominant sources of energy are&lt;/p&gt;
&lt;ol&gt;
&lt;li&gt;gravitational energy released as material is added to the protostar and it contracts&lt;/li&gt;
&lt;li&gt;energy released when the protostar starts to burn deuterium, and then hydrogen (main sequence)&lt;/li&gt;
&lt;/ol&gt;




&lt;h2 id="protostar-classes"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/30-star-formation-observations/#protostar-classes"&gt;Protostar classes&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;Protostellar systems (protostars and their disks) are conventionally divided into four different classes based on the overall shape of their infrared spectrum, characterized by their spectral index
$$
\alpha \equiv \frac{d \log(\lambda F_\lambda)}{d \log \lambda}.
$$
i.e.,
$$
\nu F_\nu \propto \nu^{-\alpha}.
$$
Observationally, this is usually calculated using observations done at 2.2 microns (K-band) and 10 microns (N-band), which are observable from the ground.&lt;/p&gt;
&lt;figure&gt;&lt;a href="https://ui.adsabs.harvard.edu/abs/2019PhDT........31V/abstract"&gt;&lt;img src="https://iancze.github.io/courses/astro542/lectures/30-star-formation-observations/vanthoff_thesis.png"
 alt="Schematic overview of the different stages of low-mass star formation (left) and the observational classification based on the spectral energy distribution (right). The embedded phase is characterized by Stage 0 and Stage 1. Credit: Merel van&amp;rsquo;t Hoff, Ph.D. thesis."&gt;&lt;/a&gt;&lt;figcaption&gt;
 &lt;p&gt;Schematic overview of the different stages of low-mass star formation (left) and the observational classification based on the spectral energy distribution (right). The embedded phase is characterized by Stage 0 and Stage 1. Credit: Merel van&amp;rsquo;t Hoff, Ph.D. thesis.&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;





&lt;h2 id="initial-mass-function-imf"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/30-star-formation-observations/#initial-mass-function-imf"&gt;Initial Mass Function (IMF)&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;If we go to a large star-forming region like the Orion Nebula Cluster, we can tabulate all of the stellar masses that we see and estimate the distribution of initial stellar masses, called the &lt;strong&gt;initial mass function&lt;/strong&gt; (IMF).&lt;/p&gt;
&lt;p&gt;Starting with Salpeter in 1955, many people have studied the IMF in different regions of the Milky Way and other galaxies. There is no immediate reason why we should expect the IMF to be universal (e.g., metallicity? turbulence? magnetic fields?), but studies seem to show remarkable uniformity from region to region. Thus far, any systematic differences appear to be very small.&lt;/p&gt;
&lt;figure&gt;&lt;img src="https://iancze.github.io/courses/astro542/lectures/30-star-formation-observations/fig_42_1.jpg"
 alt="Stellar initial mass function (IMF) as estimated by Kroupa (2001) and Chabrier (2003). The y-axis is \(d N / d \ln M\), the number of stars formed per logarithmic interval in stellar mass, normalized to the value at \(M = 1\,M_\odot\). Credit: Draine Figure 42.1"&gt;&lt;figcaption&gt;
 &lt;p&gt;Stellar initial mass function (IMF) as estimated by Kroupa (2001) and Chabrier (2003). The y-axis is \(d N / d \ln M\), the number of stars formed per logarithmic interval in stellar mass, normalized to the value at \(M = 1\,M_\odot\). Credit: Draine Figure 42.1&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;

&lt;p&gt;It&amp;rsquo;s hard to estimate the IMF at the high mass end because high-mass stars are rare (and in any SF region, you may only have a few). It&amp;rsquo;s hard to estimate the IMF at the low mass end because low-mass stars are faint. But over the range \(0.01 to 50\,M_\odot\), there appears to be reasonable agreement between studies.&lt;/p&gt;
&lt;p&gt;The mass per logarithmic mass interval peaks around \(0.5\,M_\odot\), meaning that solar-mass stars are not the most common.&lt;/p&gt;
&lt;p&gt;The IMF can also be used to estimate core-collapse supernova rates. If all \(M &amp;gt; 8\,M_\odot\) stars become Type II supernovae, then the Milky Way star formation rate of \(1.3\,M_\odot\,\mathrm{yr}^{-1}\) corresponds to a Type II supernova rate of about one every 70 yrs.&lt;/p&gt;




&lt;h2 id="star-formation-rates"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/30-star-formation-observations/#star-formation-rates"&gt;Star Formation Rates&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;How to estimate how many stars are being formed in our Galaxy?&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Massive stars are highly luminous and create H II regions, which can be observed at large distances.&lt;/li&gt;
&lt;li&gt;We can use observed number of high-mass stars and theoretical estimates of stellar lifetimes to estimate the rate at which massive stars are being formed&lt;/li&gt;
&lt;li&gt;Then use the IMF to estimate the total rate at which stars are being formed&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;Related to this question is how to calculate the rate of ionizing photons in our Galaxy.&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;N II emission (related to H II abundance)&lt;/li&gt;
&lt;li&gt;COBE data of [N II] 205 micron line&lt;/li&gt;
&lt;li&gt;Free-free radio emission (WMAP)&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;using these combined values, the total rate of star formation in the Milky Way (averaged over the past 3 Myr, which is the lifetime of early O-type stars), is about one solar mass per year.&lt;/p&gt;




&lt;h2 id="schmidt-kennicutt-law"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/30-star-formation-observations/#schmidt-kennicutt-law"&gt;Schmidt-Kennicutt Law&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;The total mass of molecular gas in the MW is \(10^9\,M_\odot\). If we say the typical density is \(n_\mathrm{H} \approx 50\,\mathrm{cm}^{-3}\), then the free-fall time of that gas would be about \(6 \times 10^6\,\mathrm{yr}\), so the maximum rate at which stars could be made would be
$$
\dot{M}&lt;em&gt;\mathrm{ff} = \frac{M&lt;/em&gt;\mathrm{tot}}{\tau_\mathrm{ff}} \approx 200\,M_\odot\,\mathrm{yr}^{-1}
$$
the actual SF rate is two orders of magnitude below this. Why?&lt;/p&gt;
&lt;ol&gt;
&lt;li&gt;most of the mass in giant molecular clouds is not undergoing free-fall collapse&lt;/li&gt;
&lt;li&gt;even in the regions that do collapse, only a fraction of the gas ends up in stars&lt;/li&gt;
&lt;/ol&gt;
&lt;p&gt;Big question in ISM theory is &lt;em&gt;why the SF rate has the observed value&lt;/em&gt; that it does. This is complicated because it involves&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;excitation and damping of MHD turbulence in molecular clouds&lt;/li&gt;
&lt;li&gt;transport of angular momentum out of contracting regions&lt;/li&gt;
&lt;li&gt;ambipolar diffusion to remove magnetic flux from contracting regions&lt;/li&gt;
&lt;li&gt;important effects of &amp;ldquo;feedback&amp;rdquo;&amp;ndash;how outflows and radiation from protostars and stars on the surrounding gas, either stimulating or suppressing further star formation.&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;Star-formation is the result of gravitational collapse, so one expects the specific star formation rate (SF rate per unit of gas mass) to be larger in higher density regions.&lt;/p&gt;
&lt;p&gt;&lt;em&gt;Schmidt (1959)&lt;/em&gt; proposed that the star formation rate per volume varied as a power of the local density \(\rho\). The volume density is difficult to determine from afar, so &lt;em&gt;Kennicut (1998)&lt;/em&gt; examined the relationship between global star formation in a galaxy and \(\Sigma_\mathrm{gas,disk}\), which is the gas surface density averaged over the &amp;ldquo;optical disk&amp;rdquo; of the galaxy. He found that the star formation rate per unit area varied as
$$
\Sigma_\mathrm{SFR, disk} = (2.5 \pm 0.7) \times 10^{-4} \left ( \frac{\Sigma_\mathrm{gas, disk}}{M_\odot\,\mathrm{pc}^{-2}} \right)\,M_\odot\,\mathrm{kpc}^{-2}\,\mathrm{yr}^{-1}
$$&lt;/p&gt;
&lt;p&gt;This is referred to as the &lt;strong&gt;Schmidt-Kennicut&lt;/strong&gt; Law, and describes the &lt;em&gt;global star formation rate&lt;/em&gt; of a galaxy. It works remarkably well from the low gas surface densities of gas-poor spiral disks to the very high surface densities in the cores of luminous starburst galaxies.&lt;/p&gt;
&lt;p&gt;You can create an even tighter relationship if you consider the surface density of &lt;em&gt;molecular gas&lt;/em&gt; and average over smaller volumes (for nearby, well-resolved galaxies).&lt;/p&gt;</description></item><item><title>Gravitational Collapse and Star Formation: Theory</title><link>https://iancze.github.io/courses/astro542/lectures/29-star-formation-theory/</link><pubDate>Mon, 15 Nov 2021 00:00:00 +0000</pubDate><guid>https://iancze.github.io/courses/astro542/lectures/29-star-formation-theory/</guid><description>&lt;p&gt;&lt;a href="https://psu.mediaspace.kaltura.com/media/Astro+542A+Lecture+Nov+15/1_4xd4rf28"&gt;Zoom link&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;References:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Draine Ch 41&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;Gravity is the force responsible for gathering gas into self-gravitating structures, ranging in size from stars to giant molecular cloud complexes.&lt;/p&gt;
&lt;p&gt;Star formation involves extreme &lt;em&gt;compression&lt;/em&gt;: drawing a gas cloud from size \(10^{18}\) cm down to the size of a star \(10^{11}\) cm.&lt;/p&gt;
&lt;p&gt;This lecture deals with the conditions necessary for gravitational collapse to occur (and be consistent w/ observations).&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Gravity must overcome the resistance of pressure (both gas pressure and magnetic pressure)&lt;/li&gt;
&lt;li&gt;Angular momentum must be transferred to nearby material&lt;/li&gt;
&lt;li&gt;Most magnetic field lines initially present in the gas must &lt;em&gt;not&lt;/em&gt; be swept into the forming protostar&lt;/li&gt;
&lt;/ul&gt;




&lt;h2 id="gravitational-instability"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/29-star-formation-theory/#gravitational-instability"&gt;Gravitational instability&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;Jeans instability: non-rotating and unmagnetized gas.&lt;/p&gt;
&lt;p&gt;Approach: write down the fluid equations expressing conservation of mass and momentum, and the equation for the gravitational potential.&lt;/p&gt;
&lt;p&gt;Then, solve for what happens to a perturbation. And define the criterion under which the perturbation grows rather than damps, that defines the &lt;strong&gt;Jeans instability&lt;/strong&gt; and a corresponding &lt;strong&gt;Jeans mass&lt;/strong&gt;.&lt;/p&gt;
&lt;p&gt;$$
\frac{\partial \rho}{\partial t} + \nabla \cdot (\rho \boldsymbol{v}) = 0
$$&lt;/p&gt;
&lt;p&gt;$$
\frac{\partial v}{\partial t} + (\boldsymbol{v} \cdot \nabla) \boldsymbol{v} = -\frac{1}{\rho} \nabla p - \nabla \phi
$$&lt;/p&gt;
&lt;p&gt;$$
\nabla^2 \phi = 4 \pi G \rho
$$&lt;/p&gt;
&lt;p&gt;See Draine Ch 35 for more.&lt;/p&gt;
&lt;p&gt;\(\phi\) is the scalar potential for the gravitational field.&lt;/p&gt;
&lt;p&gt;Suppose we have an equilibrium steady state solution with \(\rho_0(\boldsymbol{r})\), \(\boldsymbol{v}_0(\boldsymbol{r})\), \(p_0(\boldsymbol{r})\), and \(\phi_0(\boldsymbol{r})\).&lt;/p&gt;
&lt;p&gt;And then, we&amp;rsquo;ll determine the conditions under which the equilibrium solution is susceptible to gravitational collapse by introducing a small perturbation
$$
\boldsymbol{v} = \boldsymbol{v}_0 + \boldsymbol{v}_1
$$&lt;/p&gt;
&lt;p&gt;$$
\rho = \rho_0 + \rho_1
$$&lt;/p&gt;
&lt;p&gt;etc..&lt;/p&gt;
&lt;p&gt;Plugging these into the equations, we get
$$
\frac{\partial \rho_1}{\partial t} + \boldsymbol{v}_0 \cdot \nabla \rho_1 + \boldsymbol{v}_1 \cdot \nabla \rho_0 = - \rho_1 \nabla \cdot \boldsymbol{v}_0 - \rho_0 \nabla \cdot \boldsymbol{v}_1
$$
and
$$
\frac{\partial \boldsymbol{v}_1}{\partial t} + (\boldsymbol{v}_0 \cdot \nabla) \boldsymbol{v}_1 + (\boldsymbol{v}_1 \cdot \nabla) \boldsymbol{v}_0 = \frac{\rho_1}{\rho_0^2} \nabla p_0 - \frac{1}{\rho_0} \nabla p_1 - \nabla \phi_1
$$
and
$$
\nabla^2 \phi_1 = 4 \pi G \rho_1.
$$&lt;/p&gt;
&lt;p&gt;Then, if we consider an &lt;strong&gt;isothermal&lt;/strong&gt; gas, which has \(p = \rho c_s^2\), we reduce the second equation to
$$
\frac{\partial \boldsymbol{v}_1}{\partial t} + (\boldsymbol{v}_0 \cdot \nabla) \boldsymbol{v}_1 + (\boldsymbol{v}_1 \cdot \nabla) \boldsymbol{v}_0 = - c_s^2 \nabla \left ( \frac{\rho_1}{\rho_0} \right) - \nabla \phi_1,
$$
which gives us three equations for three unknowns \(\rho_1, \boldsymbol{v}_1, \phi_1\).&lt;/p&gt;
&lt;p&gt;Jeans (1928) considered the problem of a uniform, stationary gas, having \(\nabla \rho_0 = 0\), \(\nabla \phi_0 = 0\), \(\boldsymbol{v}_0 = 0\).&lt;/p&gt;
&lt;p&gt;We can rearrange this to
$$
\frac{\partial^2 \rho_1}{\partial t^2} = c_s^2 \nabla^2 \rho_1 + (4 \pi G \rho_0) \rho_1
$$&lt;/p&gt;
&lt;p&gt;Now, let&amp;rsquo;s consider perturbations in plane-wave form
$$
\rho_1 \propto \exp [i (\boldsymbol{k} \cdot \boldsymbol{r} - \omega t)]
$$
which has the dispersion relation
$$
\omega^2 = k^2 c_s^2 - 4 \pi G \rho_0
$$
if we define
$$
k_\mathrm{J}^2 \equiv (4 \pi G \rho_0)/c_s^2,
$$
then we have the dispersion relation
$$
\omega^2 = (k^2 - k_\mathrm{J}^2) c_s^2.
$$&lt;/p&gt;
&lt;p&gt;So, \(\omega\) is real if and only if \(k \geq k_\mathrm{J}\), i.e., the wave number is larger than the Jeans wavenumber.&lt;/p&gt;
&lt;p&gt;Otherwise, if \(k &amp;lt; k_\mathrm{J}\), then \(\omega\) becomes imaginary, which corresponds to exponential growth.&lt;/p&gt;
&lt;p&gt;So the &lt;strong&gt;Jeans instability&lt;/strong&gt; occurs for wavelengths
$$
\lambda &amp;gt; \lambda_\mathrm{J} \equiv \frac{2 \pi}{k_\mathrm{J}} = \left ( \frac{\pi c_s^2}{G \rho_0} \right)^{1/2}.
$$&lt;/p&gt;
&lt;p&gt;Basically, perturbations greater than this scale grow.&lt;/p&gt;
&lt;p&gt;We can define a corresponding &lt;strong&gt;Jeans mass&lt;/strong&gt;
$$
M_\mathrm{J} \equiv \frac{4 \pi}{3} \rho_0 \left ( \frac{\lambda_\mathrm{J}}{2} \right)^3
$$
which, when you plug in typical densities of quiescent dark clouds, you get something around a third of a solar mass.&lt;/p&gt;
&lt;p&gt;The exponential &amp;ldquo;growth time&amp;rdquo; is
$$
\tau_\mathrm{J} = \frac{1}{k_\mathrm{J} c_s} \approx 10^4\,\mathrm{yrs}.
$$&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Flaws in the preceeding Jeans analysis&lt;/strong&gt;: the assumption that \(\nabla \phi_0 = 0\) everywhere is completely unphysical, because it implies that \(\nabla^2 \phi_0 = 0\) everywhere, which implies \(\rho_0 = 0\) everywhere, i.e., we&amp;rsquo;re in a vacuum.&lt;/p&gt;
&lt;p&gt;But, for finite systems, rigorous analyses yield instability criteria that are close to Jeans&amp;rsquo;s.&lt;/p&gt;




&lt;h2 id="parker-instability"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/29-star-formation-theory/#parker-instability"&gt;Parker instability&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;We won&amp;rsquo;t have time to talk about it today, but there is an analogous analysis called &lt;strong&gt;Parker Instability&lt;/strong&gt; which treats magnetic fields.&lt;/p&gt;




&lt;h2 id="nonrotating-nonmagnetized-isothermal-core"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/29-star-formation-theory/#nonrotating-nonmagnetized-isothermal-core"&gt;Nonrotating, nonmagnetized isothermal core&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;Consider a spherical &amp;ldquo;core&amp;rdquo; with mass \(M\), radius \(R\), having external pressure \(p_0\) at the surface. The gravitational energy can be written as&lt;/p&gt;
&lt;p&gt;$$
E_\mathrm{grav} = - \frac{3}{5} a \frac{GM^2}{R}
$$
where \(a\) is a dimensionless factor that is 1 for uniform density and is \(a &amp;gt; 1\) if the density is centrally peaked.&lt;/p&gt;
&lt;p&gt;If the gas is in equilibrium with \(v=0\), then the virial theorem requires that
$$
0 = 3 M c_s^2 - 4 \pi p_0 R^3 - \frac{3}{5}a \frac{G M^2}{R}.
$$&lt;/p&gt;
&lt;p&gt;The external pressure \(p_0\) must be given by
$$
p_0 = \frac{1}{4 \pi R^3} \left [ 3 M c_s^2 - \frac{3}{5} a \frac{G M^2}{R} \right].
$$&lt;/p&gt;
&lt;p&gt;If the external pressure \(p_0\) is small, then equilibrium has
$$
R \approx \frac{a G M}{5 c_s^2}.
$$&lt;/p&gt;
&lt;p&gt;For a fixed mass \(M\), the external pressure has a maximum allowed value such that the equation holds.&lt;/p&gt;
&lt;p&gt;We can turn this around, and instead ask for a given pressure \(p_0\), what is the maximum core mass that can be in equilibrium?
$$
M_\mathrm{BE}(p_0) = 0.26 \left ( \frac{T}{10\,\mathrm{K}} \right)^2 \left (\frac{10^6\,\mathrm{cm}^{-3}\,\mathrm{K}}{p_0/k} \right)^{1/2}\, M_\odot
$$
where \(M_\mathrm{BE}\) is the &lt;strong&gt;Bonner-Ebert&lt;/strong&gt; mass, i.e., pressure-bounded isothermal sphere. The Bonner-Ebert mass differs from the Jeans mass by only order unity.&lt;/p&gt;
&lt;p&gt;Only cores with \(M &amp;gt; M_\mathrm{BE}\) are unstable to collapse, which provides a nice explanation to the fact that &amp;ldquo;typical&amp;rdquo; stars have masses of \(\sim 1 M_\odot\)&amp;mdash;only cores larger than this are unstable to collapse and will become stars.&lt;/p&gt;




&lt;h2 id="angular-momentum-problem"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/29-star-formation-theory/#angular-momentum-problem"&gt;Angular momentum problem&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;Self-gravitating cores will generally have non-zero angular momentum.&lt;/p&gt;
&lt;p&gt;But, if the cloud were to contract while preserving constant angular momentum, then the rotational kinetic energy will stop contraction of a cloud at a radius of \(10^{16}\,\mathrm{cm}\), which is about five orders of magnitude larger than the size of a star!&lt;/p&gt;
&lt;p&gt;So, it&amp;rsquo;s clear that a collapsing cloud cannot contract to anything approaching the size of a star if angular momentum is conserved.&lt;/p&gt;
&lt;p&gt;In order for the core to continue contracting, needs to transfer nearly all of its angular momentum to nearby material. The two mechanisms that can do this are&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;gravitational torques: non-axisymmetric density patters in the gas (e.g., spiral arms), then gravitational torques could remove angular momentum from the clump on a time scale of order the dynamical time. But, it&amp;rsquo;s unclear how non-axisymmetric a collapsing clump might be.&lt;/li&gt;
&lt;li&gt;magnetic torques: if the magnetic field is contributing to the support of the system against its self-gravity, then the magnetic braking time can be of order the dynamical time.&lt;/li&gt;
&lt;/ul&gt;</description></item><item><title>Supernovae Remnants and the Hot Ionized Medium</title><link>https://iancze.github.io/courses/astro542/lectures/28-winds-sne-grb/</link><pubDate>Sun, 07 Nov 2021 00:00:00 +0000</pubDate><guid>https://iancze.github.io/courses/astro542/lectures/28-winds-sne-grb/</guid><description>&lt;p&gt;&lt;a href="https://psu.mediaspace.kaltura.com/media/Astro+542A+Lecture+Nov+12/1_zci0znd8"&gt;Zoom link&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;References:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Ryden and Pogge Ch. 5.2&lt;/li&gt;
&lt;li&gt;Draine Ch. 39 and 40&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;Following our discussion of the physics of shock-fronts, let&amp;rsquo;s talk about how the energy deposition gets there in the first place.&lt;/p&gt;
&lt;p&gt;Explosions are a great way to deposit a lot of energy into a tiny volume of gas over a short period of time. The first nuclear fission bombs provided an urgent need for study, and independently, Leonid Sedov and Geoffrey Taylor found analytic self-similar solutions for the expansion of a non-radiative spherical shock front.&lt;/p&gt;
&lt;p&gt;What does self-similar mean? Basically, &lt;em&gt;scale-invariance&lt;/em&gt;, like a power law or fractal nature.&lt;/p&gt;
&lt;p&gt;The &lt;strong&gt;Sedov-Taylor&lt;/strong&gt; solution can be applied to the explosion of fission bombs \(10^{20}\,\mathrm{ergs}\) or supernovae \(10^{51}\,\mathrm{erg}\).&lt;/p&gt;
&lt;p&gt;There are two types of supernovae&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;&lt;strong&gt;core-collapse&lt;/strong&gt;: powered by gravitational collapse of the dense core of an evolved massive star&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;thermonuclear&lt;/strong&gt;: powered by runaway nuclear fusion in a degenerate white dwarf&lt;/li&gt;
&lt;/ul&gt;




&lt;h2 id="supernovae-remnants"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/28-winds-sne-grb/#supernovae-remnants"&gt;Supernovae Remnants&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;Let&amp;rsquo;s consider a core-collapse for now.&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Such a core-collapse explosion produces \(\sim 10^{53}\,\mathrm{erg}\) in neutrinos. Because neutrinos have a gigantic mean-free path for interactions with low-density baryonic matter, this energy is mostly lost and &lt;em&gt;not&lt;/em&gt; deposited in the ISM.&lt;/li&gt;
&lt;li&gt;The SNe also produces about \(\sim 10^{49}\,\mathrm{ergs}\) in photons, which is also mostly lost.&lt;/li&gt;
&lt;li&gt;The main energy that is deposited is in the form of kinetic energy of the expanding ejecta.&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;Let&amp;rsquo;s take \(M_\mathrm{ej} \sim 10\,M_\odot\) as typical ejecta mass, and \(u_\mathrm{ej} \sim 3000\,\mathrm{km}\,\mathrm{s}^{-1}\), which implies a kinetic energy of
$$
E = M_\mathrm{ej} u_\mathrm{ej}^2 \sim 10^{51}\,\mathrm{erg}.
$$&lt;/p&gt;
&lt;p&gt;[Though thermonuclear SNe have a lower ejecta mass, their ejecta speed is higher, and so they have approximately a similar kinetic energy.]&lt;/p&gt;




&lt;h3 id="free-expansion-phase"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/28-winds-sne-grb/#free-expansion-phase"&gt;Free-expansion phase&lt;/a&gt;
&lt;/h3&gt;
&lt;p&gt;The first stage after an SNe explosion is the &lt;em&gt;free-expansion&lt;/em&gt; phase, where the ejecta travel ballistically outward at nearly constant speed.&lt;/p&gt;
&lt;p&gt;The ejecta are only slowed down significantly when they have swept up a mass of ISM gas comparable to the mass of the ejecta, which occurs at
$$
r_\mathrm{sweep} = \left ( \frac{3 M_\mathrm{ej}}{4 \pi \rho_1} \right)^{1/3} = 4.1\,\mathrm{pc} \left ( \frac{M_\mathrm{ej}}{10\,M_\odot} \right)^{1/3} \left ( \frac{n_\mathrm{H}}{1\,\mathrm{cm}^{-3}} \right)^{-1/3}.
$$&lt;/p&gt;
&lt;p&gt;For most conditions, we expect this to last about 1000 yrs.&lt;/p&gt;
&lt;p&gt;The initial speed of the ejecta (3000 km/s) is much greater than the sound speed in the ISM gas (~ 1 km/s), so this creates a shockwave, like a bullet moving through the Earth&amp;rsquo;s atmosphere (Mach number &amp;raquo; 1).&lt;/p&gt;
&lt;p&gt;Immediately behind the shock front, the shock-heated gas has a very high pressure \(P_2\). But, the gas of the ejecta is also becoming less dense as it expands (and less hot). When the pressure within the ejecta becomes small relative to the pressure of the recently shock-heated interstellar gas, a reverse shock propagates backward toward the center of the SNe remnant, shock-heating the ejecta.&lt;/p&gt;
&lt;p&gt;Cartoon of the free-expansion phase:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;strong, spherical shock is expanding outward into the ISM&lt;/li&gt;
&lt;li&gt;behind the shock, is a layer of hot, relatively dense gas, consisting of mixed ejecta and interstellar gas&lt;/li&gt;
&lt;li&gt;interior to that is a core of hot, low-density gas consisting of ejecta shock-heated by the reverse shock&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;Real SNe remnants may be more complicated, for example, the Crab Nebula seems to have speed up its expansion, possibly accelerated by the magnetized pulsar at the heart of the nebula.&lt;/p&gt;




&lt;h3 id="sedov-taylor-phase"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/28-winds-sne-grb/#sedov-taylor-phase"&gt;Sedov-Taylor Phase&lt;/a&gt;
&lt;/h3&gt;
&lt;p&gt;After the free-expansion phase is over, the remnant transitions into the &lt;strong&gt;Sedov-Taylor&lt;/strong&gt; phase, also considered the &lt;strong&gt;blastwave phase&lt;/strong&gt;. In this phase,&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;the amount of shocked ISM gas is much greater than the original mass of the ejecta, so the shock front no longer depends on \(M_\mathrm{ej}\).&lt;/li&gt;
&lt;li&gt;The amount of energy radiated by photons is much smaller than the initial kinetic energy \(E\), though, so things still depend on the initial \(E\) deposited.&lt;/li&gt;
&lt;li&gt;\(P_2 \gg P_1\), so properties of the shock don&amp;rsquo;t depend on surrounding medium&lt;/li&gt;
&lt;li&gt;Properties of shock &lt;em&gt;do&lt;/em&gt; depend on \(\rho_1\), the &lt;em&gt;density&lt;/em&gt; of the surrounding medium&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;The two important quantities, \(E\) and \(\rho_1\), cannot be combined to form a characteristic length scale or timescale for the problem. And, it&amp;rsquo;s not relevant to introduce physical constants because&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;gravity is negligible: \(G\)&lt;/li&gt;
&lt;li&gt;the shock is non-relativistic and non-radiative: \(c\)&lt;/li&gt;
&lt;li&gt;classical (not quantum): \(\hbar\)&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;So the conclusion is that the solution to the shock propagation is scale-free, or &lt;strong&gt;self-similar&lt;/strong&gt;.&lt;/p&gt;
&lt;p&gt;One class of self-similar solutions are power-laws, so let&amp;rsquo;s try writing the radius as a function of time like
$$
r_\mathrm{sh}(t) = A E^\alpha \rho_1^\beta t^\eta
$$
where \(A\) is a dimensionless pre-factor. It turns out that in order to get an expression with the correct units, the exponents are such that
$$
r_\mathrm{sh} = A \left ( \frac{E t^2}{\rho_1} \right)^{1/5}.
$$&lt;/p&gt;
&lt;p&gt;And we can solve for the expansion speed of the shock front
$$
u_\mathrm{sh} = \frac{2}{5} A \left ( \frac{E}{\rho_1 t^3} \right)^{1/5} = \frac{2}{5} \frac{r_\mathrm{sh}}{t} \propto r_\mathrm{sh}^{-3/2}
$$&lt;/p&gt;
&lt;p&gt;These equations can be applied to supernova remnants in the Sedov-Taylor phase and, along with a distance estimate, be used to calculate the age of the remnant.&lt;/p&gt;
&lt;p&gt;Sedov and Taylor both found solutions for the density, velocity, and pressure as a function of time
$$
\rho(r, t) = \rho_1 f(x),
$$
and
$$
u(r, t) = \frac{r_\mathrm{sh}(t)}{t} g(x),
$$
and
$$
P(r, t) = \frac{\rho_1 r_\mathrm{sh}^2(t)}{t^2} h(x),
$$
where \(f, g, h\) are dimensionless functions of the dimensionless radius \(x = r/r_\mathrm{sh}(t)\). Using the conservation equations of momentum, energy, and entropy for a spherically symmetric system, you get three differential equations that you can solve for \(f, g, h\) as a function of \(x\), and there you go. It&amp;rsquo;s difficult to do in practice.&lt;/p&gt;
&lt;figure&gt;&lt;img src="https://iancze.github.io/courses/astro542/lectures/28-winds-sne-grb/Fig5-4.jpg"
 alt="Sedov-Taylor solution for a spherical blastwave with \(\gamma = 5/3\). The density, velocity, and pressure are normalized by their immediate post-shock values \(\rho_2, u_2, P_2\). The plotted velocity \(u\) is relative to the center of the blastwave. Credit: Ryden and Pogge Fig 5.4."&gt;&lt;figcaption&gt;
 &lt;p&gt;Sedov-Taylor solution for a spherical blastwave with \(\gamma = 5/3\). The density, velocity, and pressure are normalized by their immediate post-shock values \(\rho_2, u_2, P_2\). The plotted velocity \(u\) is relative to the center of the blastwave. Credit: Ryden and Pogge Fig 5.4.&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;

&lt;p&gt;Notice the steep rise in density as a function of radius.&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;&lt;strong&gt;Q&lt;/strong&gt;: What does this imply about the structure of a supernova remnant?&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;A&lt;/strong&gt;: most of the mass lies in a relatively thin outer layer. About 50% of the remnants mass likes within the outermost 17% of its volume.&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;The Sedov-Taylor phase lasts until the amount of energy radiated by the cooling gas is comparable to the initial energy \(E\). Because free-free cooling at very high temperatures is not &lt;em&gt;that&lt;/em&gt; effective, the gas won&amp;rsquo;t cool that much until \(T_2\), the temperature in the hot dense region just behind the shock, drops below \(2 \times 10^7\) K, at which point collisionally excited line radiation will take over.&lt;/p&gt;
&lt;p&gt;This gives us a handle on understanding supernova remnants that have ages between 1,000 yr and 40,000 yr. Once radiative losses become important at \(t \sim t_\mathrm{cool}\), a dense shell forms behind the shock (as seen in the following figure), and the supernova remnant enters the &lt;strong&gt;snowplow phase&lt;/strong&gt;.&lt;/p&gt;
&lt;figure&gt;&lt;img src="https://iancze.github.io/courses/astro542/lectures/28-winds-sne-grb/Fig5-5.jpg"
 alt="An expanding supernova remnant makes the transition from the Sedov-Taylor (blastwave) phase to the snowplow phase. Credit: Ryden and Pogge Fig 5.5, Following Shu 1992."&gt;&lt;figcaption&gt;
 &lt;p&gt;An expanding supernova remnant makes the transition from the Sedov-Taylor (blastwave) phase to the snowplow phase. Credit: Ryden and Pogge Fig 5.5, Following Shu 1992.&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;





&lt;h3 id="snowplow-phase"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/28-winds-sne-grb/#snowplow-phase"&gt;Snowplow phase&lt;/a&gt;
&lt;/h3&gt;
&lt;p&gt;In the snowplow phase, the dense shell contains most of the mass of the SNe remnant, so its mass at time \(t \sim t_\mathrm{cool}\) is
$$
M_\mathrm{sh} \sim \frac{4 \pi}{3} \rho_1 r_\mathrm{cool}^3 \sim 1600\,M_\odot
$$&lt;/p&gt;
&lt;p&gt;This phase is called the snowplow phase because, just as a snowplow does, it scoops up additional mass as it expands.&lt;/p&gt;
&lt;p&gt;In this phase, most of the mass is in the dense shell. The inner region still contains gas at very high temperatures. Even though it&amp;rsquo;s low density, this gas has high pressure, which helps push the dense shell outward. So, this is called the &lt;strong&gt;pressure-assisted snowplow phase&lt;/strong&gt;.&lt;/p&gt;
&lt;p&gt;Let&amp;rsquo;s briefly analyze the forces at play. Let \(P_i\) be the mean pressure within the hot, low-density interior of the SNe remnant. This gas cannot cool effectively, so its pressure only drops through adiabatic expansion
$$
P V^\gamma = \mathrm{const}
$$
meaning the pressure drops as
$$
P_i \propto V^{-\gamma} \propto r_\mathrm{sh}^{-3 \gamma}.
$$
Positing that there exists a power-law solution, and using conservation of momentum, we can approximate the expansion of the shell in this phase as
$$
r_\mathrm{sh}(t) = r_\mathrm{cool} \left ( \frac{t}{t_\mathrm{cool}} \right)^{2/7}
$$
(order of magnitude 58 pc in this phase).&lt;/p&gt;
&lt;p&gt;And expansion speed
$$
u_\mathrm{sh} = \frac{2}{7} \frac{r_\mathrm{cool}}{t_\mathrm{cool}} \left ( \frac{t}{t_\mathrm{cool}} \right)^{-5/7}
$$
of order 16 km/s at 1 Myr.&lt;/p&gt;
&lt;p&gt;Eventually, the expansion speed drops to the sound speed of the surrounding medium, and the shell degenerates into an expanding sound wave, of order 35 Myr after the SNe, and shell radius 160 pc.&lt;/p&gt;
&lt;p&gt;So, hopefully this combination of the free-expansion phase, the Sedov-Taylor phase, and the snowplow phase has provided a schematic for understanding how the hot SNe blow hot, low-density bubbles surrounded by dense shells. Which, over timescales of 35 Myr, grow to be 300 pc in diameter. These bubbles from SNe remnants are the origin of the hot ionized medium.&lt;/p&gt;</description></item><item><title>Fluids and Shocks</title><link>https://iancze.github.io/courses/astro542/lectures/27-fluid-dynamics/</link><pubDate>Thu, 04 Nov 2021 00:00:00 +0000</pubDate><guid>https://iancze.github.io/courses/astro542/lectures/27-fluid-dynamics/</guid><description>&lt;p&gt;&lt;a href="https://psu.mediaspace.kaltura.com/media/Astro+542A+Nov+10/1_v4bhqjbe"&gt;Zoom link&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;References:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Ryden and Pogge Ch 5.1&lt;/li&gt;
&lt;li&gt;Draine Ch 34, 35, 36&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;&lt;strong&gt;Hot Ionized Medium&lt;/strong&gt; occupies about half the volume of the ISM, but, because it is low-density, it is only a few percent by mass.&lt;/p&gt;
&lt;p&gt;The HIM is hot because it has been shock-heated by supernovae explosions, which means it consists of a set of bubbles. For example, we live in the Local Bubble, which is 50 pc in radius and is thought to have been created by one or more SNe that went off 2 Myr ago.&lt;/p&gt;
&lt;p&gt;The gas in the local bubble has \(T \sim 10^6\) K and density \(n_H \sim 0.004\;\mathrm{cm}^{-3}\).&lt;/p&gt;
&lt;p&gt;The bubble does have cooler and denser regions within itself, of course. The Sun is near the edge of the Local Interstellar Cloud, which is 10 pc across and has properties consistent with the warm ionized medium (WIM, \(T\sim 8000\) K, \(n_H \sim 0.2\;\mathrm{cm}^{-3}\)).&lt;/p&gt;
&lt;p&gt;In this lecture, we&amp;rsquo;re going to dig into how shock fronts (in particular, those from SNe) can heat the gas of the ISM.&lt;/p&gt;




&lt;h2 id="shock-front"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/27-fluid-dynamics/#shock-front"&gt;Shock front&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;A &lt;strong&gt;shock front&lt;/strong&gt; is a thin boundary between two regions of gas with very different densities, pressures, and bulk velocities. It will naturally tend to form from a sound wave travelling through gas.&lt;/p&gt;
&lt;figure&gt;&lt;img src="https://iancze.github.io/courses/astro542/lectures/27-fluid-dynamics/Fig5-1.jpg"
 alt="The steepening of a sound wave to form a shock. At Time 3, the sound wave is unable to be triple-valued; instead it develops a nearly discontinuous jump in pressure. Credit: Ryden and Pogge Fig 5.1."&gt;&lt;figcaption&gt;
 &lt;p&gt;The steepening of a sound wave to form a shock. At Time 3, the sound wave is unable to be triple-valued; instead it develops a nearly discontinuous jump in pressure. Credit: Ryden and Pogge Fig 5.1.&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;

&lt;p&gt;For an adiabatic process, like the propagation of a low-amplitude sound wave, the equation of state is given by
$$
P(\rho) = P_0 \left ( \frac{\rho}{\rho_0} \right)^\gamma.
$$
The adiabatic index \(\gamma\) is 5/3 for a monatomic gas and 7/5 for a cool gas of diatomic molecules like \(H_2\).&lt;/p&gt;
&lt;p&gt;The speed of propagation is
$$
c_s = \left ( \frac{\gamma P}{\rho} \right)^{1/2} \propto \rho^{(\gamma - 1)/2}.
$$
As long as \(\gamma &amp;gt; 1\), sound will travel faster in a denser gas.&lt;/p&gt;
&lt;p&gt;Sound waves are pressure and density waves. So the crest of the wave, where the density and pressure are highest, travels faster than the trough of the wave. When the crest catches up with the trough, the gradient in density between the two steepens.&lt;/p&gt;
&lt;p&gt;In an ocean wave, this leads to the wave breaking. But since a sound wave doesn&amp;rsquo;t have the extra dimension&amp;mdash;at any one position there can only be one density and one pressure. So a sound wave steepens until a shock front forms.&lt;/p&gt;




&lt;h3 id="shock-fronts-on-earth"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/27-fluid-dynamics/#shock-fronts-on-earth"&gt;Shock fronts on earth&lt;/a&gt;
&lt;/h3&gt;
&lt;p&gt;Let the dimensionless amplitude of a sound wave be
$$
\delta \equiv \frac{P_\mathrm{crest} - P_\mathrm{trough}}{P_\mathrm{crest} + P_\mathrm{trough}}
$$
then the distance it will travel before steepening into a shock will be \(d \sim \lambda/\delta\), where \(\lambda\) is the wavelength.&lt;/p&gt;
&lt;p&gt;Why do we not hear shock waves all the time from ordinary sounds? First, the amplitude of these fall off as the inverse square of distance, so they attenuate quickly. And, even a loud ambulance siren at the distance of 3m has a sound wave of amplitude \(\delta \sim 2 \times 10^{-4}\). But that signal would also need to travel about 2 km before steeping into a shock front.&lt;/p&gt;
&lt;p&gt;High amplitude pressure fluctuations with \(\delta \sim 1\), like those from stun grenades and lightning bolts, will rapidly steepen into shocks.&lt;/p&gt;




&lt;h2 id="shock-physics"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/27-fluid-dynamics/#shock-physics"&gt;Shock physics&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;Let&amp;rsquo;s examine a plane parallel steady-state shock, where \(u\) is the speed&lt;/p&gt;
&lt;figure&gt;&lt;img src="https://iancze.github.io/courses/astro542/lectures/27-fluid-dynamics/Fig5-2.jpg"
 alt="Geometry of a plane parallel steady-state shock."&gt;&lt;figcaption&gt;
 &lt;p&gt;Geometry of a plane parallel steady-state shock.&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;

&lt;p&gt;We need to obey mass conservation
$$
\frac{d}{dx} (\rho u) = 0
$$
and momentum conservation
$$
\frac{d}{dx}(\rho u^2 + P) = 0
$$
and energy conservation
$$
\frac{d}{dx} \left ( u \rho [u^2/2 + \epsilon] + u P\right) = 0
$$
where \(\epsilon\) is the specific internal energy of the gas, including thermal energy and rotational or vibrational energy of the molecules. In general,
$$
\epsilon = \frac{1}{\gamma - 1} \frac{kT}{m} = \frac{1}{\gamma - 1}\frac{P}{\rho}
$$
where \(m\) is the mass per particle. [Here we&amp;rsquo;re assuming the gas doesn&amp;rsquo;t lose energy by radiating photons, the shock is non-radiative.]&lt;/p&gt;
&lt;p&gt;By applying these conservation laws across the shock front, we end up with the &lt;strong&gt;Rankine-Hugoniot&lt;/strong&gt; conditions:&lt;/p&gt;
&lt;p&gt;$$
\rho_1 u_1 = \rho_2 u_2
$$
and
$$
\rho_1 u_1^2 + P_1 = \rho_2 u_2^2 + P_2
$$
and
$$
u_1 \rho_1 [u_1^2/2 + \epsilon_1] + u_1 P_1 = u_2 \rho_2 [u_2/2 + \epsilon_2] + u_2 P_2.
$$&lt;/p&gt;
&lt;p&gt;The &lt;strong&gt;Mach number&lt;/strong&gt; is a dimensionless number that is useful for describing shock fronts
$$
\mathcal{M} \equiv \frac{u_1}{c_1} = \left ( \frac{\rho_1 u_1^2}{\gamma P_1} \right)^{1/2}
$$
where \(c_1\) is the sound speed in the pre-shocked gas.&lt;/p&gt;
&lt;p&gt;We can use the Rankine-Huginoit conditions to solve for the density, pressure, and temperature ratios across the shock front:
$$
\frac{\rho_2}{\rho_1} = \frac{u_1}{u_2}
$$
and
$$
\frac{P_2}{P_1} = \frac{2 \gamma \mathcal{M}_1^2 - (\gamma - 1)}{\gamma + 1}
$$
and
$$
\frac{T_2}{T_1} = \frac{[(\gamma - 1) \mathcal{M}_1^2 + 2][2 \gamma \mathcal{M}_1^2 - (\gamma - 1)]}{(\gamma + 1)^2 \mathcal{M}_1^2}
$$&lt;/p&gt;
&lt;p&gt;When post-shock density, pressure, and temperature values are greater than their pre-shock values, that is equivalent to having a Mach number greater than 1. A &lt;strong&gt;strong shock&lt;/strong&gt; is when \(\mathcal{M} \gg 1\). In this regime,
$$
\frac{\rho_2}{\rho_1} \approx \frac{\gamma + 1}{\gamma - 1}
$$
and
$$
P_2 \approx \frac{2}{\gamma + 1} \rho_1 u_1^2
$$
and
$$
T_2 \approx \frac{2(\gamma -1 )}{(\gamma + 1)^2} \frac{m}{k} u_1^2.
$$
In the strong-shock regime, the density ratio has a finite value, which depends on the adiabatic index. For a monoatomic gas with \(\gamma = 5/3\), the post-shock gas can be at most four times the density of the pre-shock gas.&lt;/p&gt;
&lt;p&gt;Shocks can produce very high densities and temperatures, though. An interstellar shock front with propagation speed \(u_1 \sim 1000\,\mathrm{km}\,\mathrm{s}^{-1}\) produces shock-heated gas with
$$
T_2 \approx 1.1 \times 10^7\,\mathrm{K} \left ( \frac{u_1}{1000\,\mathrm{km}\,\mathrm{s}^{-1}} \right)^2.
$$&lt;/p&gt;
&lt;p&gt;In the frame of the shock, shock-fronts convert supersonic gas into subsonic gas. They increase the density, pressure, and temperature while decreasing the bulk velocity relative to the shock front. Shocks act to generate &lt;em&gt;entropy&lt;/em&gt;, with the increase in specific entropy \(s_2 - s_1 \propto \ln \mathcal{M}_1\) for strong shocks.&lt;/p&gt;




&lt;h2 id="radiating-and-cooling"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/27-fluid-dynamics/#radiating-and-cooling"&gt;Radiating and Cooling&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;The hot shocked gas is out of equilibrium and starts cooling. So a shock is followed by a &lt;em&gt;radiative zone&lt;/em&gt; where the shock-heated gas is cooling by emitting photons.&lt;/p&gt;
&lt;figure&gt;&lt;img src="https://iancze.github.io/courses/astro542/lectures/27-fluid-dynamics/Fig5-3.jpg"
 alt="The structure of a plane parallel radiative shock. Credit: Ryden and Pogge Fig 5.3."&gt;&lt;figcaption&gt;
 &lt;p&gt;The structure of a plane parallel radiative shock. Credit: Ryden and Pogge Fig 5.3.&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;





&lt;h3 id="strong-fast-hot-shocks"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/27-fluid-dynamics/#strong-fast-hot-shocks"&gt;Strong, fast, hot shocks&lt;/a&gt;
&lt;/h3&gt;
&lt;p&gt;For temperatures \(T &amp;gt; 2 \times 10^7\,\mathrm{K}\), the cooling is dominated by free-free (Bremsstrahlung) emission. Assuming fully ionized hydrogen gas, we get a free-free cooling time of
$$
t_\mathrm{cool} \approx 24\,\mathrm{Myr}\,\left ( \frac{T}{10^7\,\mathrm{K}} \right)^{1/2} \left ( \frac{n_\mathrm{H}}{1\,\mathrm{cm}^{-3}} \right)^{-1}
$$
We can recast the post-shock temperature in terms of the shock velocity, and find that in a cooling time, the gas will move a distance
$$
R_\mathrm{cool} \sim u_2 t_\mathrm{cool} \sim \frac{u_1}{4} t_\mathrm{cool} \sim 6\,\mathrm{kpc} \left( \frac{u_1}{1000\,\mathrm{km}\,\mathrm{s}^{-1}} \right)^2 \left ( \frac{n_\mathrm{H}}{1\,\mathrm{cm}^{-3}} \right)^{-1}
$$.&lt;/p&gt;
&lt;p&gt;This is the approximate thickness of the radiative zone for a strong shock with high temperatures. This is a long distance, especially compared to the scale height of interstellar gas in our galaxy. So, the hot gas produced by high-speed shocks &lt;em&gt;doesn&amp;rsquo;t have enough time to cool&lt;/em&gt; before the shock runs out of gas to shock.&lt;/p&gt;




&lt;h3 id="slower-cooler-shocks"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/27-fluid-dynamics/#slower-cooler-shocks"&gt;Slower, cooler shocks&lt;/a&gt;
&lt;/h3&gt;
&lt;p&gt;At slightly lower temperatures, \(10^5\,\mathrm{K} &amp;lt; T &amp;lt; 2\times10^7\,\mathrm{K}\), which correspond to shock speeds of \(90\,\mathrm{km}\,\mathrm{s}^{-1} &amp;lt; u_1 &amp;lt; 1300\,\mathrm{km}\,\mathrm{s}^{-1}\), collisionally-excited line emission does the cooling.&lt;/p&gt;
&lt;p&gt;This results in a cooling time of
$$
t_\mathrm{cool} \approx 0.10 \,\mathrm{Myr}\, \left(\frac{u_1}{300\,\mathrm{km}\,\mathrm{s}^{-1}} \right)^{3.46} \left ( \frac{n_\mathrm{H}}{1\,\mathrm{cm}^{-3}} \right)^{-1}.
$$
And we can calculate the thickness of the radiative zone as
$$
R_\mathrm{cool} \sim \frac{u_1}{4} t_\mathrm{cool} \sim 8\,\mathrm{pc} \left ( \frac{u_1}{300\,\mathrm{km}\,\mathrm{s}^{-1}} \right)^{4.46} \left( \frac{n_\mathrm{H}}{1\,\mathrm{cm}^{-3}} \right)^{-1}
$$.
These shorter time and length scales mean that radiative cooling is more effective at changing the structure of slower shocks.&lt;/p&gt;




&lt;h3 id="isothermal-shocks"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/27-fluid-dynamics/#isothermal-shocks"&gt;Isothermal shocks&lt;/a&gt;
&lt;/h3&gt;
&lt;p&gt;Often, it&amp;rsquo;s the case that the thermal equilibrium temperature \(T_3\) that the gas reaches after passing through the radiative zone is nearly equal to the initial temperature \(T_1\). This is called an &lt;strong&gt;isothermal shock&lt;/strong&gt;, and we can re-solve the Rankine-Hugoniot conditions to find that the increase in density is
$$
\frac{\rho_3}{\rho_1} = \left ( \frac{u_1}{c_1} \right)^2 = \mathcal{M}_T^2
$$
where \(\mathcal{M}_T\) is the isothermal Mach number of the shock. So, radiative shocks can reach arbitrarily high compression as their Mach number approaches infinity.&lt;/p&gt;</description></item><item><title>Molecular Clouds, Observations, Chemistry, and Ionization</title><link>https://iancze.github.io/courses/astro542/lectures/26-molecular-clouds/</link><pubDate>Mon, 08 Nov 2021 00:00:00 +0000</pubDate><guid>https://iancze.github.io/courses/astro542/lectures/26-molecular-clouds/</guid><description>&lt;p&gt;&lt;a href="https://psu.mediaspace.kaltura.com/media/Astro+542A+Nov+8/1_s6az9bt5"&gt;Zoom link&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;References:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Draine Ch. 32, 33&lt;/li&gt;
&lt;li&gt;Ryden and Pogge Ch. 7&lt;/li&gt;
&lt;/ul&gt;




&lt;h2 id="taxonomy-and-astronomy"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/26-molecular-clouds/#taxonomy-and-astronomy"&gt;Taxonomy and Astronomy&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;First, let&amp;rsquo;s try to bring some order to the various scales and densities at which molecular hydrogen can be found in our Galaxy.&lt;/p&gt;
&lt;p&gt;In general, individual clouds are separated into categories based upon their &lt;em&gt;optical&lt;/em&gt; appearance: diffuse, translucent, or dark, depending on the magnitudes of visual extinction \(A_V\) through the cloud.&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;&lt;strong&gt;Diffuse molecular cloud&lt;/strong&gt;: \(A_V &amp;lt; 1 \) mag&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Translucent molecular cloud&lt;/strong&gt;: \(1 &amp;lt; A_V &amp;lt; 5 \) mag&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Dark cloud&lt;/strong&gt;: \(5 &amp;lt; A_V &amp;lt; 20 \) mag&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Infrared Dark cloud&lt;/strong&gt;: 20 up to \(&amp;gt; 100 \) mag&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;&lt;strong&gt;Diffuse&lt;/strong&gt; and &lt;strong&gt;translucent&lt;/strong&gt; clouds have sufficient UV radiation to keep gas-phase carbon photoionized throughout the cloud. These types of clouds are usually pressure-confined.&lt;/p&gt;
&lt;p&gt;The typical &lt;strong&gt;dark cloud&lt;/strong&gt; is self-gravitating, and some contain regions with \(A_V\) up to 20 mag.&lt;/p&gt;
&lt;p&gt;There are some examples of dark clouds that are opaque to background PAHs even at 8 microns wavelengths, (indicating \(A_V &amp;gt; 100\) mag) these are called &lt;strong&gt;infrared dark clouds&lt;/strong&gt;.&lt;/p&gt;
&lt;p&gt;These terms describe the surface density of the cloud in terms of visual extinction. But, molecular clouds are not a one-parameter family, and there are other terms to describe cloud properties (not always uniformly employed)!&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;We can distinguish a &lt;strong&gt;giant molecular cloud (GMC)&lt;/strong&gt; from a &lt;strong&gt;dark cloud&lt;/strong&gt; based on total mass.&lt;/li&gt;
&lt;li&gt;Groups of distinct clouds are called &lt;strong&gt;cloud complexes&lt;/strong&gt;&lt;/li&gt;
&lt;li&gt;Structures within a cloud (self-gravitating) are called &lt;strong&gt;clumps&lt;/strong&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;Clumps may or may not be forming stars. If they are, they are called &lt;strong&gt;star-forming clumps&lt;/strong&gt;.
&lt;strong&gt;Cores&lt;/strong&gt; are density peaks within star-forming clumps that will form a &lt;em&gt;single star&lt;/em&gt; (or a binary star).&lt;/p&gt;
&lt;p&gt;Molecular clouds are sometimes found in isolation, but in most cases they are grouped together into &lt;strong&gt;complexes&lt;/strong&gt;. Large complexes hav substructure, though, so the distinction between &amp;ldquo;cloud&amp;rdquo; and &amp;ldquo;cloud complex&amp;rdquo; is somewhat arbitrary. Observers use intensities, radial velocities, and thermal emission from dust to delineate boundaries.&lt;/p&gt;
&lt;p&gt;Draine Table 32.3 has a nice summary of the terminology used for cloud complexes and their components.&lt;/p&gt;
&lt;p&gt;Most molecular mass is found in large clouds known as &amp;ldquo;giant molecular clouds&amp;rdquo; or GMCs, which have masses ranging from \(10^3\,M_\odot\) to \(\sim 2 \times 10^5\,M_\odot\).&lt;/p&gt;
&lt;p&gt;A &lt;strong&gt;GMC complex&lt;/strong&gt; is a gravitationally-bound group of GMCs (as well as smaller clouds) with total mass \(&amp;gt; 10^{5.3}\,M_\odot\). The nearest GMC complex is the Orion Molecular Cloud (OMC) complex. Here is a map of the structure&lt;/p&gt;
&lt;figure&gt;&lt;img src="https://iancze.github.io/courses/astro542/lectures/26-molecular-clouds/fig_32_1.jpg"
 alt="Schematic showing the boundaries of molecular clouds in the Orion region. Three GMCs form the Orion GMC complex: Orion A, B, and the Northern Filament. Credit: Draine Figure 32.1, after Maddalena et al. 1986."&gt;&lt;figcaption&gt;
 &lt;p&gt;Schematic showing the boundaries of molecular clouds in the Orion region. Three GMCs form the Orion GMC complex: Orion A, B, and the Northern Filament. Credit: Draine Figure 32.1, after Maddalena et al. 1986.&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;

&lt;figure&gt;&lt;img src="https://iancze.github.io/courses/astro542/lectures/26-molecular-clouds/dame.jpg"
 alt="Locations of prominent molecular clouds along the Milky Way. Dame et al. 2001"&gt;&lt;figcaption&gt;
 &lt;p&gt;Locations of prominent molecular clouds along the Milky Way. Dame et al. 2001&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;





&lt;h2 id="star-counts"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/26-molecular-clouds/#star-counts"&gt;Star counts&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;Molecular clouds were originally discovered by star counts. First, Herschel noticed in 1785 that there were patches along the Milky Way where very few stars were seen, but he incorrectly attributed this to an actual underdensity.&lt;/p&gt;
&lt;p&gt;Now, we know that this is from the obscuration by dusty clouds.&lt;/p&gt;
&lt;p&gt;Modern studies including J, H, K to optical surveys can now provide a very good map of dusty. In fact, coupling stellar measurements with spectral models, and in some cases distances, allows us to make a 3D map of the dust distribution in our Galaxy.&lt;/p&gt;




&lt;h2 id="molecular-radio-lines"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/26-molecular-clouds/#molecular-radio-lines"&gt;Molecular radio lines&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;Most common way to study molecular gas is through the \(J = 1 \rightarrow 0\) transition of carbon monoxide. The CO luminosity of a cloud is approximately proportional to the total mass of the cloud.&lt;/p&gt;
&lt;p&gt;Additionally, using velocity information + assumed Galactic rotation curve (and conversion from CO to molecular Hydrogen) allows us to infer the surface density of molecular hydrogen over the Milky Way disk.&lt;/p&gt;




&lt;h2 id="gamma-rays"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/26-molecular-clouds/#gamma-rays"&gt;Gamma rays&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;The ISM is actually an appreciable source of gamma rays.&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;cosmic rays colliding with interstellar gas (pion decay and bremsstrahlung)&lt;/li&gt;
&lt;li&gt;inverse Compton&lt;/li&gt;
&lt;li&gt;pair annihilation&lt;/li&gt;
&lt;/ul&gt;




&lt;h2 id="size-linewidth-relation-in-molecular-clouds"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/26-molecular-clouds/#size-linewidth-relation-in-molecular-clouds"&gt;Size-Linewidth Relation in Molecular Clouds&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;See this &lt;a href="https://astrobites.org/2012/11/18/astrophysical-classics-larsons-laws/"&gt;astrobite&lt;/a&gt; by Adele Plunkett.&lt;/p&gt;
&lt;p&gt;&lt;a href="https://ui.adsabs.harvard.edu/abs/1981MNRAS.194..809L/abstract"&gt;Larson 1981&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;Power-laws describing molecular clouds.&lt;/p&gt;
&lt;ol&gt;
&lt;li&gt;Turbulence: velocity dispersion is proportional to cloud size&lt;/li&gt;
&lt;li&gt;Gravity: velocity dispersion is proportional to cloud mass&lt;/li&gt;
&lt;li&gt;Density: cloud size is inversely proportional to density&lt;/li&gt;
&lt;/ol&gt;
&lt;p&gt;For larger clouds, turbulent velocities exceed local sound speed, meaning that the turbulent motions are supersonic.&lt;/p&gt;
&lt;p&gt;The density peaks within a molecular cloud are self-gravitating. If we consider only the kinetic energy associated with fluid motions (neglect magnetic fields and pressure), we can estimate the clump mass using the virial theorem.&lt;/p&gt;
&lt;p&gt;With modern data, laws still hold up reasonably well.&lt;/p&gt;




&lt;h2 id="magnetic-fields-is-molecular-clouds"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/26-molecular-clouds/#magnetic-fields-is-molecular-clouds"&gt;Magnetic Fields is Molecular Clouds&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;Virial estimate of cloud mass assumed that it was turbulence alone that was supporting the cloud. But, strong evidence from Zeeman 21-cm observations that there is significant B field in H I gas.&lt;/p&gt;




&lt;h2 id="energy-dissipation-in-molecular-clouds"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/26-molecular-clouds/#energy-dissipation-in-molecular-clouds"&gt;Energy dissipation in molecular clouds&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;For clouds bigger than 0.02 pc, turbulent motions are supersonic. Strongly supersonic for clouds larger than 1 pc.&lt;/p&gt;
&lt;p&gt;Ongoing research area.&lt;/p&gt;
&lt;p&gt;Shocked gas would quickly cool and then radiate away kinetic energy, and the clouds would collapse quickly.&lt;/p&gt;
&lt;p&gt;So what is the source of additional energy that is injected into the turbulent motions? Outflows from protostars could be one possibility.&lt;/p&gt;
&lt;p&gt;Another is that the time scale for shock waves to damp is longer than expected (but not backed up by simulations).&lt;/p&gt;




&lt;h1 id="molecular-clouds-chemistry-and-ionization"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/26-molecular-clouds/#molecular-clouds-chemistry-and-ionization"&gt;Molecular Clouds: Chemistry and Ionization&lt;/a&gt;
&lt;/h1&gt;
&lt;p&gt;In the Milky Way, about 22% of interstellar gas is in molecular clouds, and most of the hydrogen there is in the form of \(H_2\).&lt;/p&gt;
&lt;p&gt;Cosmic Rays guarantee that there are always &lt;em&gt;some&lt;/em&gt; ions present in the gas, and in the outer layers of molecular clouds there may be enough UV radiation to photoionize metal species.&lt;/p&gt;
&lt;p&gt;Let&amp;rsquo;s walk through the various types of reactions that can be important&lt;/p&gt;
&lt;ol&gt;
&lt;li&gt;&lt;strong&gt;photoionization&lt;/strong&gt;:
$$
AB + h \nu \rightarrow AB^+ + e^-
$$
Many molecules can be photoionized w/ photons with \(h \nu &amp;lt; 13.6\) eV (but not \(H_2\), which has an ionization potential of 15.43). Thus, molecular hydrogen will not be photoionized even in H I regions.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;photodissociation&lt;/strong&gt;:
$$
AB + h \nu \rightarrow AB^* \rightarrow A + B
$$
For species like \(H_2\) and CO, photoexcitation leading to dissociation occurs via &lt;em&gt;lines&lt;/em&gt; rather than continuum. This key fact allows these species to self-shield, or be partially shielded (like CO) if there is accidental overlap between important absorption lines with strong lines of \(H_2\).&lt;/li&gt;
&lt;li&gt;Neutral-neutral exchange reactions
$$
AB + C \rightarrow AC + B
$$
In a molecular cloud, most species are neutral, and so neutral-neutral collisions are frequent. Frequently, there will be an energy barrier that must be overcome for a reaction to proceed, i.e., the ABC complex must pass through an intermediate stat that has higher energy level than the initial AB + C, even for exothermic reactions. Interestingly, this isn&amp;rsquo;t the case for CO formation, which has a negligible energy barrier and can form at the low temperatures of molecular clouds
$$
C + OH \rightarrow CO + H
$$&lt;/li&gt;
&lt;li&gt;Ion-neutral exchange reactions
$$
AB^+ + C \rightarrow AC^+ + B
$$
Exothermic reactions of this type generally lack energy barriers, so the reaction can proceed even at low temperatures. And, the induced dipole interaction boosts the rate somewhat.&lt;/li&gt;
&lt;li&gt;Radiative association reactions (as we discussed in the context of molecular hydrogen). Basically, the excited complex needs to emit a photon before the complex flies apart on the timescale of a single vibrational period.&lt;/li&gt;
&lt;/ol&gt;</description></item><item><title>Molecular Hydrogen</title><link>https://iancze.github.io/courses/astro542/lectures/25-molecular-hydrogen/</link><pubDate>Sat, 23 Oct 2021 00:00:00 +0000</pubDate><guid>https://iancze.github.io/courses/astro542/lectures/25-molecular-hydrogen/</guid><description>&lt;p&gt;&lt;a href="https://psu.mediaspace.kaltura.com/media/Astro+542A+Lecture+Nov+1/1_7tqdibo1"&gt;Zoom link&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;References&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Ryden and Pogge Ch. 7.4&lt;/li&gt;
&lt;li&gt;Draine Ch. 31&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;In this lecture, we&amp;rsquo;re going to take a deep dive into molecular hydrogen, which will set us up for the next lecture on molecular clouds, observation, chemistry, and ionization.&lt;/p&gt;




&lt;h2 id="formation-mechanisms"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/25-molecular-hydrogen/#formation-mechanisms"&gt;Formation Mechanisms&lt;/a&gt;
&lt;/h2&gt;




&lt;h3 id="direct"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/25-molecular-hydrogen/#direct"&gt;Direct&lt;/a&gt;
&lt;/h3&gt;
&lt;p&gt;Why does molecular hydrogen exist in the ISM at all? The direct formation
$$
H + H \rightarrow H_2 + h \nu
$$
has such a small rate coefficient that the reaction can be ignored completely in ISM chemistry. The three-body reaction
$$
3H \rightarrow H_2 + H + KE
$$
can occur, but the rate is also negligible at interstellar or intergalactic densities. (This is possible at the densities of a protostar or protoplanetary disk, however).&lt;/p&gt;
&lt;p&gt;The process by which two hydrogen atoms bond in the ISM is called &lt;strong&gt;direct radiative association&lt;/strong&gt;
$$
H + H \rightarrow H_2^* \rightarrow H_2 + h \nu.
$$&lt;/p&gt;
&lt;p&gt;Two hydrogen atoms meet to create an excited hydrogen molecule \(H_2^*\) that is unbound; it needs to emit a photon with enough energy to leave it in a bound state, otherwise it will break apart.&lt;/p&gt;
&lt;p&gt;The lifetime of the excited hydrogen molecule (until it breaks apart) is exceptionally short, approximately one vibration period \(\sim 2 \pi / \omega_0 \sim 10^{-14}\) s.&lt;/p&gt;
&lt;p&gt;Because the hydrogen molecule has no electric dipole moment, the probability that it will emit a photon is small. One possibility is via rotational quadrupole transitions, which have \(A_{ul} \sim 10^{-11}\) s. So to form a hydrogen molecule via this, you multiple the probability to emit during the lifetime of the excited hydrogen molecule, and get a probability of
$$
p \sim A_{ul} (2 \pi / \omega_0) \sim 10^{-25}
$$
of emitting a photon before it falls apart. This leads to a miniscule rate coefficient \(k_\mathrm{dra} 10^{-23}\;\mathrm{cm}\;\mathrm{s}^{-1}\), and therefore at densities typical of the interstellar medium.&lt;/p&gt;




&lt;h3 id="pure-gas-not-that-fast"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/25-molecular-hydrogen/#pure-gas-not-that-fast"&gt;Pure gas (not that fast)&lt;/a&gt;
&lt;/h3&gt;
&lt;p&gt;Terminology:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;&lt;strong&gt;recombination&lt;/strong&gt;: when a free electron binds to a positive ion&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;attachment&lt;/strong&gt;: when a free electron binds to a neutral atom or negative ion&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;ionization&lt;/strong&gt;: when an electron is stripped from a neutral atom or positive ion&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;detachment&lt;/strong&gt;: when an electron is stripped from a negative ion&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;In pure gas, \(H_2\) is made via a two-step process:&lt;/p&gt;
&lt;ol&gt;
&lt;li&gt;Produce negative hydrogen ions by radiative attachment&lt;/li&gt;
&lt;/ol&gt;
&lt;p&gt;$$
H^0 + e^- \rightarrow H^- + h \nu
$$&lt;/p&gt;
&lt;p&gt;This is a slow process (not as slow as direct radiative association, though), with rate coefficient \(k_\mathrm{ra} \approx 10^{-16}\).&lt;/p&gt;
&lt;p&gt;The next step (is more rapid) is the production of molecular hydrogen by associative detachment
$$
H^- + H^0 \rightarrow H_2 + e^-
$$
this step has a rate coefficient of \(k_\mathrm{ad} \approx 10^{-9}\,\mathrm{cm}^3\,\mathrm{s}^{-1}).&lt;/p&gt;
&lt;p&gt;This two-step process is hindered by the fact that the intermediate product \(H^-\) is a very fragile ion, it only takes 0.77 eV to detach one of the electrons. In most of interstellar space, the photodetachment rate of \(H^-\) is
$$
\zeta_\mathrm{det} \approx 10^{-7}\; \mathrm{s}^{-1} \approx 8.5 \;\mathrm{yr}^{-1}.
$$&lt;/p&gt;
&lt;p&gt;So, associative detachment (and the resulting production of hydrogen molecules) will dominate over photodetachment when
$$
n_\mathrm{H\,I} &amp;gt; \frac{\zeta_\mathrm{det}}{k_\mathrm{ad}} \approx 70\,\mathrm{cm}^3
$$
That&amp;rsquo;s pretty dense, so in most of the ISM, an \(H^-\) ion will undergo photodetachment before it has the opportunity to make an \(H_2\) molecule.&lt;/p&gt;
&lt;p&gt;Without any dust, though, this is dominant channel of molecular hydrogen formation (such as in the early universe).&lt;/p&gt;




&lt;h3 id="grain-assisted-grain-catalysis"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/25-molecular-hydrogen/#grain-assisted-grain-catalysis"&gt;Grain assisted (grain catalysis)&lt;/a&gt;
&lt;/h3&gt;
&lt;p&gt;Whenever dust grains are present, they form the dominant mechanism for making molecular hydrogen, called grain catalysis.&lt;/p&gt;
&lt;p&gt;Compared to low-density interstellar gas, the surface of a dust grain is a frenzied lab of chemical activity. Let&amp;rsquo;s unpack why.&lt;/p&gt;
&lt;p&gt;Terminology&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;&lt;strong&gt;adsorption&lt;/strong&gt;: atoms sticking to the surface of a grain&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;desoprtion&lt;/strong&gt;: atoms (or molecules) escaping from the surface of a grain&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;When a hydrogen atom collides with a grain, there is some probability that it will stick, with the modulation that&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;slower atoms are more likely to stick&lt;/li&gt;
&lt;li&gt;hot grains are less sticky&lt;/li&gt;
&lt;li&gt;smaller grains are less sticky&lt;/li&gt;
&lt;li&gt;some dependence on composition&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;For typical molecular cloud conditions, the probability of sticking (adsorption) is about 30% for a 0.1 micron grain. That&amp;rsquo;s pretty high!&lt;/p&gt;
&lt;p&gt;Once the hydrogen atom is stuck to the surface of the grain, it&amp;rsquo;s held there by the relatively weak van der Waals force. The binding energy is large enough such that the atom doesn&amp;rsquo;t immediately evaporate from the grain&amp;rsquo;s surface, however, it&amp;rsquo;s also small enough such that the atom can skitter its way across the grain surface in a thermally driven random walk. This is called surface diffusion.&lt;/p&gt;
&lt;figure&gt;&lt;img src="https://iancze.github.io/courses/astro542/lectures/25-molecular-hydrogen/Fig7-4.jpg"
 alt="Atoms adsorb onto the surface of a grain, diffuse until they meet another atom, and sometimes form a molecule. Credit: Ryden and Pogge Figure 7.4, following Dulieu et al. 2013"&gt;&lt;figcaption&gt;
 &lt;p&gt;Atoms adsorb onto the surface of a grain, diffuse until they meet another atom, and sometimes form a molecule. Credit: Ryden and Pogge Figure 7.4, following Dulieu et al. 2013&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;

&lt;p&gt;If, during the random walk, the atom encounters another H atom, they can bond to form a hydrogen molecule. The energy released by this reaction will allow the new molecule to escape from the surface (desorption).&lt;/p&gt;
&lt;p&gt;To calculate the formation rate, we need to define a grain cross section, assume some grain size distribution, and average over the velocity distribution of hydrogen atoms (thermalized), and then incorporate a probability that it sticks to the dust grain.&lt;/p&gt;
&lt;p&gt;Carrying out these calculations using state of the art assumptions leads to the conclusion that dust grains are converting atomic hydrogen into molecular hydrogen on a &lt;em&gt;characteristic timescale of 13 Myr&lt;/em&gt;.&lt;/p&gt;




&lt;h2 id="destroying-molecular-hydrogen"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/25-molecular-hydrogen/#destroying-molecular-hydrogen"&gt;Destroying Molecular Hydrogen&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;Photodissociation is the main way by which molecular hydrogen is destroyed. Because we are dealing with a hydrogen molecule, though, the path is a bit more complicated than you would otherwise expect.&lt;/p&gt;
&lt;p&gt;First, let&amp;rsquo;s look at the potential of the molecule.&lt;/p&gt;
&lt;figure&gt;&lt;img src="https://iancze.github.io/courses/astro542/lectures/25-molecular-hydrogen/Fig7-1.jpg"
 alt="We can look at the potential of the \(\mathrm{H}_2\) molecule as a function of \(r\), the distance between the two protons. The bound vibrational energy levels \(v = 0 \rightarrow 13\) are depicted as horizontal lines. Credit: Ryden and Pogge Fig 7.1, Data from Sharp 1971"&gt;&lt;figcaption&gt;
 &lt;p&gt;We can look at the potential of the \(\mathrm{H}_2\) molecule as a function of \(r\), the distance between the two protons. The bound vibrational energy levels \(v = 0 \rightarrow 13\) are depicted as horizontal lines. Credit: Ryden and Pogge Fig 7.1, Data from Sharp 1971&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;

&lt;p&gt;In theory, we could just hit a hydrogen molecule with a photon with energy \(h \nu &amp;gt; 4.5 \)eV, lifting it to an unbound potential energy. The problem is that this transition is an electric quadrupole transition, which has a small transition probability.&lt;/p&gt;
&lt;p&gt;The main process by which molecular hydrogen is dissociated is a two-step process. Let&amp;rsquo;s look at the potential energy curve for the ground electronic state of molecular hydrogen, as well as for the first and second excited electronic states.&lt;/p&gt;
&lt;figure&gt;&lt;img src="https://iancze.github.io/courses/astro542/lectures/25-molecular-hydrogen/Fig7-5.jpg"
 alt="Schematic diagram of the potential energy curves for the ground electronic states and the first and second excited electronic states of molecular hydrogen. Credit: Ryden and Pogge 7.5, data from Sharp 1971."&gt;&lt;figcaption&gt;
 &lt;p&gt;Schematic diagram of the potential energy curves for the ground electronic states and the first and second excited electronic states of molecular hydrogen. Credit: Ryden and Pogge 7.5, data from Sharp 1971.&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;

&lt;ul&gt;
&lt;li&gt;Difference between ground state, \(v = 0, J=0\) and first electronic excited state \(v = 0, J=0\) is 11.18 eV, slightly higher than the 10.2 eV Lyman alpha line. This is a permitted transition with a large transition probability.&lt;/li&gt;
&lt;li&gt;Once in the first excited state, the transitions between various vibrational and rotational in this state and the various vibrational and rotational states in the electronic ground state are called &lt;em&gt;Lyman band&lt;/em&gt;.&lt;/li&gt;
&lt;li&gt;The transitions between the second electronic state and ground electronic state are called &lt;em&gt;Werner band&lt;/em&gt;.&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;Both Lyman and Werner band bands lie in energy range 11.18 to 13.60 eV.&lt;/p&gt;
&lt;p&gt;Once in the first electronic state, it can transition back down to the ground electronic state. But, about 12% of the time, it can also transition to a vibrational state with \(v &amp;gt; 14\), which is unbound, the hydrogen atoms fly apart and dissociation is complete.&lt;/p&gt;




&lt;h2 id="relevant-timescales-and-densities"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/25-molecular-hydrogen/#relevant-timescales-and-densities"&gt;Relevant timescales and densities&lt;/a&gt;
&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;The first step, radiative excitation of \(\mathrm{H}_2\) to an excited electronic state, requires a photon with energy in the wavelength range 912 to 1108 angstroms. In the solar neighborhood, these photons are produced by hot stars.&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;Because of the Lyman and Werner levels, the cross section for a hydrogen molecule is a complicated function of energy. But, using the energy density of the solar neighborhood and incorporating the dissociation fraction, the rate of dissociation is
$$
\zeta_\mathrm{dis} \approx 4.2 \times 10^{-11}\,\mathrm{s}^{-1}.
$$&lt;/p&gt;
&lt;p&gt;So the typical timescale for photodissociation of molecular hydrogen is \(1/\zeta_\mathrm{dis}\), or about \(10^3\) yr, about 4 orders of magnitude quicker than the \(10^7\) yr timescale for creation of molecular hydrogen by grain surface catalysis.&lt;/p&gt;
&lt;p&gt;The equilibrium abundance of molecular hydrogen is determined by a balance between the &lt;em&gt;slow&lt;/em&gt; creation of \(\mathrm{H}_2\) on the surface of grains and the &lt;em&gt;quick&lt;/em&gt; destruction of molecular hydrogen by UV photons from stars. If you work out the equilibrium densities and assume that the UV radiation field inside molecular clouds were the same as outside of it, you find that molecular hydrogen should be pretty rare, only dominating over atomic hydrogen when the density was greater than&lt;/p&gt;
&lt;p&gt;$$
n_\mathrm{H} \sim 10^6\,\mathrm{cm}^{-3}
$$&lt;/p&gt;
&lt;p&gt;But, observations tell us that molecular hydrogen formation dominates over atomic hydrogen at densities as low as \(n_\mathrm{H} \sim 300\,\mathrm{cm}^{-3}\). Why?&lt;/p&gt;
&lt;p&gt;This is because of &lt;strong&gt;self-shielding&lt;/strong&gt;, whereby UV photons in the range \(h \nu = 11.18 \rightarrow 13.60\) eV are absorbed by hydrogen molecules in the outer layer of the cloud, preventing them from reaching hydrogen molecules in the cloud center.
(Remember, these &amp;ldquo;shield&amp;rdquo; hydrogen molecules aren&amp;rsquo;t immediately dissociated&amp;hellip; the dissociation only occurs some fraction of the time when the decay reaction is down to a \(v &amp;gt; 14\) state, which is unbound).&lt;/p&gt;
&lt;p&gt;Self-shielding becomes important when you have molecular hydrogen column density greater than \(10^{14}\,\mathrm{cm}^{-2}\). The interiors of molecular clouds are mostly saved from photodissociation by a self-sacrificing layer of gas about 0.1 parsec thick, and with density \(n_\mathrm{H} \sim 300\,\mathrm{cm}^{-3}\).&lt;/p&gt;
&lt;p&gt;Things change, though, when newly formed stars are of spectral types O, B, or A and are hot enough to pour out photons with energies \(h \nu &amp;gt; 11.18\) eV, which form photodissociation regions around the hot stars.&lt;/p&gt;




&lt;h2 id="photodissociation-regions"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/25-molecular-hydrogen/#photodissociation-regions"&gt;Photodissociation regions&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;Have an obvious parallel to H II regions, and in fact you can have a complicated structure of ionized, neutral atoms, and molecular hydrogen.&lt;/p&gt;
&lt;figure&gt;&lt;img src="https://iancze.github.io/courses/astro542/lectures/25-molecular-hydrogen/Fig7-6.jpg"
 alt="Schematic diagram of a photodissociation region. The shielded molecular gas is on the left, the source of UV photons is on the right. The numbers at the top of the diagram represent the column density of hydrogen measured from the ionization front. Credit: Ryden and Pogge Fig 7.6"&gt;&lt;figcaption&gt;
 &lt;p&gt;Schematic diagram of a photodissociation region. The shielded molecular gas is on the left, the source of UV photons is on the right. The numbers at the top of the diagram represent the column density of hydrogen measured from the ionization front. Credit: Ryden and Pogge Fig 7.6&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;

&lt;figure&gt;&lt;a href="https://webbtelescope.org/contents/media/images/2021/024/01F5KKSNNM6YWR7MNMDJ2NEBCN?news=true"&gt;&lt;img src="https://iancze.github.io/courses/astro542/lectures/25-molecular-hydrogen/anatomy_of_h.png"
 alt="This graphic depicts the stratified nature of a photodissociation region (PDR) such as the Orion Bar. Once thought to be homogenous areas of warm gas and dust, PDRs are now known to contain complex structure and four distinct zones. The box at the left shows a portion of the Orion Bar within the Orion Nebula. The box at the top right illustrates a massive star-forming region whose blasts of ultraviolet radiation are affecting a PDR. The box at the bottom right zooms in on a PDR to depict its four, distinct zones: 1) the molecular zone, a cold and dense region where the gas is in the form of molecules and where stars could form; 2) the dissociation front, where the molecules break apart into atoms as the temperature rises; 3) the ionization front, where the gas is stripped of electrons, becoming ionized, as the temperature increases dramatically; and 4) the fully ionized flow of gas into a region of atomic, ionized hydrogen. For the first time, Webb will be able to separate and study these different zones&amp;rsquo; physical conditions. Credits: ILLUSTRATION: NASA, ESA, CSA, Jason Champion (CNRS), Pam Jeffries (STScI), PDRs4ALL ERS Team"&gt;&lt;/a&gt;&lt;figcaption&gt;
 &lt;p&gt;This graphic depicts the stratified nature of a photodissociation region (PDR) such as the Orion Bar. Once thought to be homogenous areas of warm gas and dust, PDRs are now known to contain complex structure and four distinct zones. The box at the left shows a portion of the Orion Bar within the Orion Nebula. The box at the top right illustrates a massive star-forming region whose blasts of ultraviolet radiation are affecting a PDR. The box at the bottom right zooms in on a PDR to depict its four, distinct zones: 1) the molecular zone, a cold and dense region where the gas is in the form of molecules and where stars could form; 2) the dissociation front, where the molecules break apart into atoms as the temperature rises; 3) the ionization front, where the gas is stripped of electrons, becoming ionized, as the temperature increases dramatically; and 4) the fully ionized flow of gas into a region of atomic, ionized hydrogen. For the first time, Webb will be able to separate and study these different zones&amp;rsquo; physical conditions. Credits: ILLUSTRATION: NASA, ESA, CSA, Jason Champion (CNRS), Pam Jeffries (STScI), PDRs4ALL ERS Team&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;
</description></item><item><title>HI Clouds: Observations, Heating and Cooling</title><link>https://iancze.github.io/courses/astro542/lectures/24-hi-heating-cooling/</link><pubDate>Mon, 25 Oct 2021 00:00:00 +0000</pubDate><guid>https://iancze.github.io/courses/astro542/lectures/24-hi-heating-cooling/</guid><description>&lt;p&gt;&lt;a href="https://psu.mediaspace.kaltura.com/media/Astro+542A+Lecture+Oct+29/1_vfna4zcb"&gt;Zoom link&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;References&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Draine Ch. 29, 30&lt;/li&gt;
&lt;li&gt;Ryden and Pogge Ch 1.3 and 1.4&lt;/li&gt;
&lt;/ul&gt;




&lt;h1 id="h-i-clouds-observations"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/24-hi-heating-cooling/#h-i-clouds-observations"&gt;H I Clouds: Observations&lt;/a&gt;
&lt;/h1&gt;
&lt;ul&gt;
&lt;li&gt;60% of gas in Milky Way is in H I regions (predominantly atomic H).&lt;/li&gt;
&lt;li&gt;Gas can be surveyed by 21-cm line (emission and absorption), absorption lines in spectra of stars, and infrared emission from dust mixed w/ hydrogen&lt;/li&gt;
&lt;li&gt;magnetic field in gas visualized through starlight polarized by aligned dust grains and in some cases Zeeman effect&lt;/li&gt;
&lt;/ul&gt;




&lt;h2 id="h-i-21-cm-line-observations"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/24-hi-heating-cooling/#h-i-21-cm-line-observations"&gt;H I 21-cm line observations&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;We covered a lot of this material when talking about excitation and atomic levels, but the following is a more observational picture.&lt;/p&gt;
&lt;p&gt;One of the important pieces we derived from that lecture is that &lt;em&gt;as long as 21-cm is optically thin&lt;/em&gt;, a measurement of the emission is a &lt;em&gt;direct&lt;/em&gt; measure of the total column density of H I.&lt;/p&gt;
&lt;p&gt;We also saw how if we have a background quasar, we could determine the spin temperature, which is normally close to the kinetic temperature of the gas.&lt;/p&gt;
&lt;p&gt;Let&amp;rsquo;s look at the following figure&lt;/p&gt;
&lt;figure&gt;&lt;img src="https://iancze.github.io/courses/astro542/lectures/24-hi-heating-cooling/fig_29_1.jpg"
 alt="Left panels: observed H I emission (OFF) and absorption (ON, towards quasar). Lower right: spin temperature as a function of LSR velocity. Tick marks of 0, 1, 2, 4 kpc show LSR velocity expected for a gas at those distances, assuming a Galactic rotation curve. Upper right: column density per velocity bin under different assumptions regarding the relative foreground/background locations of cold absorbing gas and warm gas seen only in emission. Credit: Draine 29.1, Originally Dickey et al. 1978"&gt;&lt;figcaption&gt;
 &lt;p&gt;Left panels: observed H I emission (OFF) and absorption (ON, towards quasar). Lower right: spin temperature as a function of LSR velocity. Tick marks of 0, 1, 2, 4 kpc show LSR velocity expected for a gas at those distances, assuming a Galactic rotation curve. Upper right: column density per velocity bin under different assumptions regarding the relative foreground/background locations of cold absorbing gas and warm gas seen only in emission. Credit: Draine 29.1, Originally Dickey et al. 1978&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;

&lt;p&gt;Surveys like these support the idea that (in the solar neighborhood), interstellar H I is found primarily in two distinct phases: cold neutral medium (cold absorbing gas) and warm neutral medium (warm gas seen only in emission).&lt;/p&gt;




&lt;h2 id="distribution-of-the-h-i"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/24-hi-heating-cooling/#distribution-of-the-h-i"&gt;Distribution of the H I&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;Differential rotation of gas in the Galactic disk means that regions of the Galaxy at different distances from the Sun will have different radial velocities. Therefore the 21 cm intensity vs. radial velocity chart can be used to map out distribution of H I in our Galaxy.&lt;/p&gt;




&lt;h2 id="zeeman-effect"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/24-hi-heating-cooling/#zeeman-effect"&gt;Zeeman effect&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;For the weak (5 microGauss) fields in the diffuse ISM, the Zeeman effect results in a small frequency shift between the left and right circularly polarized 21-cm emission. Observations are in general difficult.&lt;/p&gt;




&lt;h2 id="infrared-emission"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/24-hi-heating-cooling/#infrared-emission"&gt;Infrared emission&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;Dust is a tracer of the ISM because H I gas is dusty. In the dust chapter, we talked about the correlation between 100 micron emission and H I column density, as measured from 21-cm observations.&lt;/p&gt;
&lt;figure&gt;&lt;img src="https://iancze.github.io/courses/astro542/lectures/24-hi-heating-cooling/fig_29_4.jpg"
 alt="Credit: Draine Fig 29.4, originally Arendt et al. 1998."&gt;&lt;figcaption&gt;
 &lt;p&gt;Credit: Draine Fig 29.4, originally Arendt et al. 1998.&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;

&lt;p&gt;The good correlation tells us that&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;dust and gas are well-mixed&lt;/li&gt;
&lt;li&gt;starlight heating the dust must be fairly uniform&lt;/li&gt;
&lt;/ul&gt;




&lt;h1 id="heating-and-cooling-in-h-i-clouds"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/24-hi-heating-cooling/#heating-and-cooling-in-h-i-clouds"&gt;Heating and cooling in H I clouds&lt;/a&gt;
&lt;/h1&gt;
&lt;p&gt;Most of the interstellar gas in the Milky Way is neutral, and 78% of the neutral hydrogen is atomic (not molecular).&lt;/p&gt;
&lt;p&gt;Let&amp;rsquo;s examine the heating and cooling processes of H I so that we can estimate what temperatures we&amp;rsquo;d expect it to be at. We will find that, under some circumstances, the ISM can be thought of as two distinct phases in pressure equilibrium.&lt;/p&gt;




&lt;h2 id="heating"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/24-hi-heating-cooling/#heating"&gt;Heating&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;Heating mechanisms include&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;ionization by CRs&lt;/li&gt;
&lt;li&gt;Photoionization of H and He by x-rays&lt;/li&gt;
&lt;li&gt;Photoionization of dust grains by starlight UV&lt;/li&gt;
&lt;li&gt;Photoionization of metals like C, Mg, Si, Fe, etc. by starlight UV&lt;/li&gt;
&lt;li&gt;Heating by shock waves and other MHD phenomena&lt;/li&gt;
&lt;/ul&gt;




&lt;h3 id="photoelectric-heating-by-dust"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/24-hi-heating-cooling/#photoelectric-heating-by-dust"&gt;Photoelectric heating by dust&lt;/a&gt;
&lt;/h3&gt;
&lt;p&gt;Photoelectrons emitted by dust grains dominate the heating of diffuse H I in the Milky Way. Large neutral carbonaceous grains can in principle be photoionized by photons with energies down to 4.5 eV.&lt;/p&gt;
&lt;p&gt;Small grains account for most of the UV absorption and photoelectric yields are enhanced for small grains, so therefore the photoelectric heating rate is dominated by photoelectrons from very small grains, including the PAHs.&lt;/p&gt;




&lt;h2 id="cooling-via-c-ii-158-microns-o-i-63-microns-and-other-lines"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/24-hi-heating-cooling/#cooling-via-c-ii-158-microns-o-i-63-microns-and-other-lines"&gt;Cooling via [C II] 158 microns, [O I] 63 microns, and other lines&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;For temperatures above 10^4 K, the cooling is dominated by two fine structure lines [C II] 158 microns, [O I] 63 microns.&lt;/p&gt;
&lt;p&gt;The critical densities for these lines are high, implying that collisional deexcitation is unimportant in the diffuse ISM of the Milky way.&lt;/p&gt;




&lt;h2 id="two-phases-for-h-i-in-the-ism"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/24-hi-heating-cooling/#two-phases-for-h-i-in-the-ism"&gt;Two phases for H I in the ISM&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;Now let&amp;rsquo;s investigate what thermal equilibrium looks like in an H I region. As before, we require a balance between heating and cooling
$$
\Gamma(T_\mathrm{eq}) = \Lambda(T_\mathrm{eq}).
$$&lt;/p&gt;
&lt;p&gt;First, let&amp;rsquo;s examine how the steady state temperature \(T_\mathrm{eq}\) varies as a function of density \(n_H\).&lt;/p&gt;
&lt;figure&gt;&lt;img src="https://iancze.github.io/courses/astro542/lectures/24-hi-heating-cooling/fig_30_2a.jpg"
 alt="The steady state temperature as a function of density, for gas heated by cosmic rays and photoelectric heating by dust grains. Two lines of constant \(n_H T\) are shown. Credit: Draine Fig 30.2a"&gt;&lt;figcaption&gt;
 &lt;p&gt;The steady state temperature as a function of density, for gas heated by cosmic rays and photoelectric heating by dust grains. Two lines of constant \(n_H T\) are shown. Credit: Draine Fig 30.2a&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;

&lt;p&gt;As we see, the equilibrium temperature is a monotonically decreasing function of \(n_H \). This is because of the line-cooling mechanisms, which are throttled by how quickly atoms can be collisionally excited (to be line-radiated away).&lt;/p&gt;
&lt;p&gt;Now, let&amp;rsquo;s examine the solution for equilibrium temperature when we fix the &lt;em&gt;pressure&lt;/em&gt;
$$
p = n k T.
$$
Fixing the pressure is saying that things are in &lt;em&gt;dynamic equilibrium&lt;/em&gt;, because if there was a pressure imbalance, then things would move to equalize pressure (the way wind blows from a high pressure system to a low pressure system on earth).&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;We see that at &lt;em&gt;low pressures&lt;/em&gt; the heating balances cooling at 6000 K (which correspond to WNM conditions).&lt;/li&gt;
&lt;li&gt;We see that at &lt;em&gt;high pressure&lt;/em&gt; the heating balances cooling at 100 K (corresponding to CNM conditions).&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;When you combine a heating gain \(\Gamma\) that is independent of gas temperature with a cooling function \(\Lambda(T)\) that is not a simple power law, you get multiple phases of the ISM in pressure equilibrium with each other.&lt;/p&gt;
&lt;figure&gt;&lt;img src="https://iancze.github.io/courses/astro542/lectures/24-hi-heating-cooling/fig_30_2b.jpg"
 alt="The steady state temperature as a function of thermal pressure \(p\). For pressures in the range between the two dotted lines, there are three possible equilibria temperatures: a high temperature WNM solution, a low temperature CNM solution, and an intermediate temperature equilibrium that is unstable. Credit: Draine Fig 30.2b"&gt;&lt;figcaption&gt;
 &lt;p&gt;The steady state temperature as a function of thermal pressure \(p\). For pressures in the range between the two dotted lines, there are three possible equilibria temperatures: a high temperature WNM solution, a low temperature CNM solution, and an intermediate temperature equilibrium that is unstable. Credit: Draine Fig 30.2b&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;

&lt;p&gt;At intermediate pressures, for a given pressure, there are three possible solutions for equilibrium temperature. The upper and lower solutions are stable (if perturbed away from equilibrium temperature, it will return to it). But the intermediate solution is unstable:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;if perturbed upward, it will warm to the stable WNM setup&lt;/li&gt;
&lt;li&gt;if perturbed downward, it will cool to the stable CNM setup&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;So, given our best estimates of cosmic ray ionization, photoelectric heating, and cooling processes in the diffuse ISM, we conclude that an ISM in thermal equilibrium and dynamic equilibrium (equal pressure) would have two distinct phases, provided the pressure is in between the two vertical lines (otherwise would just have one phase).&lt;/p&gt;
&lt;p&gt;This is called the &amp;ldquo;two-phase&amp;rdquo; ISM model, and was first developed by George Field in 1969. Later revisited a number of times with differing assumptions about grain photoelectric heating, grain-assisted recombination, inelastic collisional cross-sections for cooling processes, and coolant abundances. So, the &amp;ldquo;two-phase&amp;rdquo; model has a long and storied history in ISM study. McKee and Ostriker 1977 expanded this to the &amp;ldquo;three-phase&amp;rdquo; ISM, with the hot ionized medium, where bremsstrahlung is the chief cooling process.&lt;/p&gt;
&lt;p&gt;Observationally, the different phases will have different emission spectra.&lt;/p&gt;
&lt;p&gt;First, the dust emission at 100 microns will be the same for both phases, since the dust is heated by the same radiation field. But, the line emission will vary considerably. For the CNM, the strongest coolant is [C II] 150 microns. For the WMN, the strongest coolant is [O I] 63.2 microns.&lt;/p&gt;</description></item><item><title>Heating and Cooling of HII regions</title><link>https://iancze.github.io/courses/astro542/lectures/23-hii-heating-cooling/</link><pubDate>Mon, 25 Oct 2021 00:00:00 +0000</pubDate><guid>https://iancze.github.io/courses/astro542/lectures/23-hii-heating-cooling/</guid><description>&lt;p&gt;&lt;a href="https://psu.mediaspace.kaltura.com/media/Astro+542A+Lecture+Oct+27/1_rgnhel5n"&gt;Zoom link&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;References:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Draine Ch. 27&lt;/li&gt;
&lt;li&gt;Ryden and Pogge Ch. 4.3&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;Now that we&amp;rsquo;ve taken some time to introduce and discuss dust, we&amp;rsquo;ll return to H II regions and discuss heating and cooling.&lt;/p&gt;
&lt;p&gt;H II regions are regions of photoionized gas around hot stars. Thus far, we&amp;rsquo;ve just asserted that the temperatures of these regions were around \(10^4\) K, and introduced nebular diagnostics to allow us to measure those temperatures. In today&amp;rsquo;s lecture, we&amp;rsquo;ll be discussing the heating and cooling mechanisms that regulate the temperature.&lt;/p&gt;
&lt;p&gt;The dominant heating mechanism is photoionization. Photoionizing photons have energies larger that the ionization energy, the resulting photoelectron will have nonzero kinetic energy and add to the thermal energy of the gas.&lt;/p&gt;
&lt;p&gt;Recombination processes, primarily radiative recombination, are removing electrons from the plasma (along with the kinetic energy they had). Thermal energy is also lost when an electron collisionally excites ions from lower to higher energy levels (followed by the emission of photons).&lt;/p&gt;
&lt;p&gt;The balance of these processes sets the temperature of the gas.&lt;/p&gt;




&lt;h2 id="heating-by-photoionization"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/23-hii-heating-cooling/#heating-by-photoionization"&gt;Heating by photoionization&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;Photoelectric absorption creates a photoelectron with some kinetic energy
$$
X^{+r} + h \nu \rightarrow X^{+r+1} + e^- + KE
$$&lt;/p&gt;
&lt;p&gt;Let \(\sigma_{pe}(\nu)\) be the photoionization cross section for \(X^{+r}\) in its ground electronic state. Given some radiation field \(u_\nu\), how can we calculate the probability per unit time for photoionization? i.e., units of 1/s?&lt;/p&gt;
&lt;p&gt;Let&amp;rsquo;s walk through how we set up an integral, using the cross section, the speed (c), and the energy density
$$
\zeta(X^{+r}) = \int_{\nu_0}^\infty \sigma_\mathrm{pe}(\nu) c \left [ \frac{u_\nu}{h \nu}\right] d \nu.
$$&lt;/p&gt;
&lt;p&gt;How much kinetic energy is injected into the gas? Well, \(h \nu_0\) is the threshold energy for photoionization from the ground state. Therefore, if a photon with \(h \nu\) does the photoionization, then \(h \nu - h \nu_0\) is injected into the plasma. We can calculate the &lt;strong&gt;heating rate&lt;/strong&gt; (ergs/cm^3) by calculating the kinetic energy released per volume per photoionization event as
$$
\Gamma_\mathrm{pe} = n(X^{+r}) \int_{\nu_0}^\infty \sigma_\mathrm{pe}(\nu) c \left [ \frac{u_\nu}{h \nu}\right] (h \nu - h \nu_0) d \nu.
$$&lt;/p&gt;
&lt;p&gt;The mean energy of each photoelectron is
$$
E_\mathrm{pe}(X^{+r}) = \frac{\Gamma_\mathrm{pe}}{n(X^{+r}) \zeta(X^{+r})}
$$
which depends on the spectrum of photons responsible for photoionization. If we assume that the star&amp;rsquo;s radiation field can be approximated as blackbody with some color temperature \(T_c\), then we can define the dimensionless ratio
$$
\varphi \equiv \frac{E_\mathrm{pe}}{k T_c}
$$
For different temperature stars, there will correction factors for photoelectron energy weighted by the blackbody spectrum \(\varphi_0 k T_c\) and the average photoelectron energy \(\langle \varphi \rangle k T_c\), but they are order unity across a wide range of stellar temperatures.&lt;/p&gt;
&lt;p&gt;The heating rate depends on the abundance of the species that is being ionized. If the H II region is in photoionization equilibrium, then
$$
\zeta(X^{+r}) n(X^{+r}) = \alpha n_e n(X^{+r+1}).
$$
If you recall, \(\alpha\) is the rate coefficient for recombination. This can be used to obtain the local heating rate
$$
\Gamma_{pe} = \alpha n_e n(X^{+r + 1}) \varphi k T_c.
$$
The rate per unit volume of injection of photoelectron energy rate is equal to the recombination rate per unit volumen times the mean photoelectron energy.&lt;/p&gt;
&lt;p&gt;In an H II region, the dominant element H will be nearly fully ionized, so \(n(H^+)\approx n_H\), and the rate due to photoionization of H is
$$
\Gamma_\mathrm{pe}( H \rightarrow H^+) \approx \alpha_B n_H n_e \varphi k T_c.
$$&lt;/p&gt;




&lt;h2 id="other-heating-processes"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/23-hii-heating-cooling/#other-heating-processes"&gt;Other heating processes&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;Here are a few other heating processes that can contribute.&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Photoelectric emission from dust: dust grains absorb photons, then produce energetic photoelectrons. This effect is important when the radiation field has lots of photons between 10 and 13.6 eV&amp;hellip; too weak to ionize H, but able to ionize a dust grain AND there is enough dust present to absorb photons.&lt;/li&gt;
&lt;li&gt;Cosmic Rays: interaction w/ bound electrons (ejecting energetic secondary electron) and transfer of KE to free electrons by elastic scattering, \(\Gamma_\mathrm{CR}\).&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;In normal H II regions, heating of the gas is dominated by photoionization of H and He, so we usually just refer to \(\Gamma \approx \Gamma_\mathrm{pe}\). The photoionization heating is dependent on the electron density, the ion density, nd the temperature (UV intensity) of star doing the ionizing.&lt;/p&gt;




&lt;h2 id="cooling-processes"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/23-hii-heating-cooling/#cooling-processes"&gt;Cooling processes&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;Consider the balance between heating and cooling processes when the plasma is in ionization equilibrium&amp;mdash;every ionization event is balanced by a recombination event.&lt;/p&gt;
&lt;p&gt;The &lt;strong&gt;total cooling rate&lt;/strong&gt; is denoted by \(\Lambda\) and has units of (ergs/cm^3). We&amp;rsquo;ll examine some of the sub-contributions.&lt;/p&gt;




&lt;h3 id="recombination-radiation"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/23-hii-heating-cooling/#recombination-radiation"&gt;Recombination radiation&lt;/a&gt;
&lt;/h3&gt;
&lt;p&gt;Every time an electron recombines with an ion, the plasma loses the kinetic energy of the recombining electron. The rate per unit volume at which thermal energy is lost is
$$
\Lambda_\mathrm{rr} = \alpha_B n_e n(H^+) \langle E_\mathrm{rr} \rangle
$$
where \(\langle E_\mathrm{rr} \rangle\) is the mean KE of the recombining electrons.&lt;/p&gt;




&lt;h3 id="free-free-emission"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/23-hii-heating-cooling/#free-free-emission"&gt;Free-Free Emission&lt;/a&gt;
&lt;/h3&gt;
&lt;p&gt;Free electrons scattering off of free ions produce free-free emission. If we&amp;rsquo;re in a pure H plasma at 10^4 K, then the free-free emission per recombining electron is
$$
\frac{\Lambda_\mathrm{ff}}{n_e n(H^+) \alpha_B} = 0.54 T_4^{0.37} k T.
$$
Where \(\Lambda_\mathrm{ff} \) is the power per unit volume given by (Ch 10.12). For T of 10^4, then radiative recombination and free-free emission together cause the plasma to lose about \(\sim k T\) of kinetic energy per Case-B recombination.&lt;/p&gt;




&lt;h3 id="collisionally-excited-line-radiation"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/23-hii-heating-cooling/#collisionally-excited-line-radiation"&gt;Collisionally-Excited Line Radiation&lt;/a&gt;
&lt;/h3&gt;
&lt;p&gt;In an H II region, most H will be ionized.&lt;/p&gt;
&lt;p&gt;If there is He or He^+, the energy of the first excited state is quite far above the ground state, so it&amp;rsquo;s rare that it will be collisionally excited for any relevant thermal plasma.&lt;/p&gt;
&lt;p&gt;But if there are heavier metals, like oxygen, then there may be ions like O I, O II, or O III that have energy levels that can be collisionally excited by electrons with kinetic energies of just a few eV (as we studied in the nebular diagnostics lecture). And then they will radiate via lines, and cool the gas.&lt;/p&gt;
&lt;p&gt;(If the collisional excitation is immediately followed by a collisional deexcitation, then the kinetic energy of the gas is unchanged.)&lt;/p&gt;
&lt;p&gt;Some of the principal collisionally excited cooling lines in H II regions are N II, O II, O III, Ne II, S II and S III.&lt;/p&gt;




&lt;h2 id="thermal-equilibrium"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/23-hii-heating-cooling/#thermal-equilibrium"&gt;Thermal equilibrium&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;The gas will stabalize at a temperature where heating rate balances the total cooling rate
$$
\Gamma(T) = \Lambda(T).
$$&lt;/p&gt;
&lt;p&gt;When you investigate each of these processes in detail for an average H II region, you get a steady-state temperature of 8,000 K.&lt;/p&gt;
&lt;p&gt;If you fiddle with the metallicities, and thus the abundance of cooling species, the steady state temperature changes. In a low metallicity galaxy, for example, the equilibrium temperature rises to 15,000 K. But if you raise the heavy element abundances by a factor of three, such as in the central region of a mature spiral galaxy, then the H II region temperature is 5,000 K.&lt;/p&gt;
&lt;p&gt;&lt;figure&gt;&lt;img src="https://iancze.github.io/courses/astro542/lectures/23-hii-heating-cooling/fig_27_1a.jpg"
 alt="Photoelectric heating function \(\Gamma_\mathrm{pe}\) and radiative cooling function \(\Lambda\) as function of gas temperature \(T\) in an H II region with Orion-like abundances and density \(n_H = 4000\) cm^-3. Heating and cooling balance at T approx 8050 K. Credit: Draine Fig 27.1a"&gt;&lt;figcaption&gt;
 &lt;p&gt;Photoelectric heating function \(\Gamma_\mathrm{pe}\) and radiative cooling function \(\Lambda\) as function of gas temperature \(T\) in an H II region with Orion-like abundances and density \(n_H = 4000\) cm^-3. Heating and cooling balance at T approx 8050 K. Credit: Draine Fig 27.1a&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;

&lt;figure&gt;&lt;img src="https://iancze.github.io/courses/astro542/lectures/23-hii-heating-cooling/fig_27_1b.jpg"
 alt="Contributions of the individual lines to the loss from collisional excitation. Credit: Draine Fig 27.1b"&gt;&lt;figcaption&gt;
 &lt;p&gt;Contributions of the individual lines to the loss from collisional excitation. Credit: Draine Fig 27.1b&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;

&lt;figure&gt;&lt;img src="https://iancze.github.io/courses/astro542/lectures/23-hii-heating-cooling/fig_27_2a.jpg"
 alt="Photoelectric heating function \(\Gamma_\mathrm{pe}\) and radiative cooling function \(\Lambda\) as function of gas temperature \(T\) in an H II region with abundances that are only 10% that of Orion. Equilibrium occurs at 15,000 K. Credit: Draine Fig 27.2a"&gt;&lt;figcaption&gt;
 &lt;p&gt;Photoelectric heating function \(\Gamma_\mathrm{pe}\) and radiative cooling function \(\Lambda\) as function of gas temperature \(T\) in an H II region with abundances that are only 10% that of Orion. Equilibrium occurs at 15,000 K. Credit: Draine Fig 27.2a&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;

&lt;figure&gt;&lt;img src="https://iancze.github.io/courses/astro542/lectures/23-hii-heating-cooling/fig_27_2b.jpg"
 alt="Photoelectric heating function \(\Gamma_\mathrm{pe}\) and radiative cooling function \(\Lambda\) as function of gas temperature \(T\) in an H II region with abundances that are enhanced 3 times relative to Orion. Equilibrium occurs at 5,000 K. Credit: Draine Fig 27.2b"&gt;&lt;figcaption&gt;
 &lt;p&gt;Photoelectric heating function \(\Gamma_\mathrm{pe}\) and radiative cooling function \(\Lambda\) as function of gas temperature \(T\) in an H II region with abundances that are enhanced 3 times relative to Orion. Equilibrium occurs at 5,000 K. Credit: Draine Fig 27.2b&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;

&lt;figure&gt;&lt;img src="https://iancze.github.io/courses/astro542/lectures/23-hii-heating-cooling/fig_27_3.jpg"
 alt="Cooling function \(\Lambda(T)\) for different densities. The gas is assumed to have Orion-like abundances and ionization conditions. As the gas density is varied from low to high, the equilibrium temperature also varies from low to high because of collisional deexcitation of excited states. Credit: Draine Fig 27.3"&gt;&lt;figcaption&gt;
 &lt;p&gt;Cooling function \(\Lambda(T)\) for different densities. The gas is assumed to have Orion-like abundances and ionization conditions. As the gas density is varied from low to high, the equilibrium temperature also varies from low to high because of collisional deexcitation of excited states. Credit: Draine Fig 27.3&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;
&lt;/p&gt;</description></item><item><title>Grain Temperatures, Physics, and Dynamics</title><link>https://iancze.github.io/courses/astro542/lectures/22-grain-temp-dynamics/</link><pubDate>Mon, 11 Oct 2021 00:00:00 +0000</pubDate><guid>https://iancze.github.io/courses/astro542/lectures/22-grain-temp-dynamics/</guid><description>&lt;p&gt;&lt;a href="https://psu.mediaspace.kaltura.com/media/Astro+542A+Lecture+Oct+25/1_8herzg4g"&gt;Zoom link&lt;/a&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Draine Ch. 24, 25, 26&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;We&amp;rsquo;ve defined temperature as an &amp;ldquo;internal energy.&amp;rdquo; For an ensemble of particles, this translated into a statement about their velocity distributions (Maxwellian). For a dust grain, a temperature is a measure of the internal energy present in its vibrational modes and low-lying electronic excitations.&lt;/p&gt;
&lt;p&gt;If a dust grain has an internal energy \(E\), then we can define the temperature as that such that for the grain
$$
\langle E_\mathrm{int} \rangle_T = E
$$
bascially, the temperature defines the average internal energy.&lt;/p&gt;
&lt;p&gt;When the internal energy is very small (similar to the level of the first excited state), the idea of temperature becomes difficult to grapple with. But, most grains will have temperatures larger than this, and we can use temperature as a reasonable concept.&lt;/p&gt;
&lt;p&gt;We can add or remove energy to a grain through absorption and emission of photons, or by inelastic collisions with atoms and molecules from the gas. (Grain-grain collisions can do this, but they are rare).&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;In a diffuse region, grain heating is dominated by absorption of starlight photons.&lt;/li&gt;
&lt;li&gt;In a dark region, grain heating is dominated by inelastic collisions.&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;The dust grains responsible for the bulk of observed extinction at optical wavelengths are grains with radii \(a &amp;gt; 0.03\,\mu m\) and are considered &amp;ldquo;classical&amp;rdquo; macroscopic grains. Absorption/emission of a single quantum photon does not appreciably change the total energy in vibrational or electronic excitations.&lt;/p&gt;
&lt;p&gt;There are also ultrasmall particles (down to large molecules) where quantum effects are important.&lt;/p&gt;




&lt;h2 id="radiative-heating"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/22-grain-temp-dynamics/#radiative-heating"&gt;Radiative heating&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;After an optical or UV photon is absorbed by a grain, an electron is raised into an excited electronic state. Sometimes the grain will luminesce, where the excited state decays radiatively and emits a photon equal to or less than the absorbed photon.&lt;/p&gt;
&lt;p&gt;But, more likely, the electronically excited state will deexcite nonradiatively, such that the energy goes into many vibrational modes as heat. We can calculate the rate of grain heating from absorption of radiation as&lt;/p&gt;
&lt;p&gt;$$
\frac{d E}{d t} = \int \frac{u_\nu d \nu}{h \nu} \times c \times h \nu \times Q_\mathrm{abs}(\nu) \pi a^2
$$&lt;/p&gt;
&lt;p&gt;i.e., the number density of photons in that frequency bin, photons moving at the speed of light with energy \(h \nu\), and the grain&amp;rsquo;s absorption cross section.&lt;/p&gt;
&lt;figure&gt;&lt;img src="https://iancze.github.io/courses/astro542/lectures/22-grain-temp-dynamics/fig_24_1.jpg"
 alt="Absorption efficiency divided by grain radius for spheres of amorphous silicate and graphite. Credit: Draine Figure 24.1"&gt;&lt;figcaption&gt;
 &lt;p&gt;Absorption efficiency divided by grain radius for spheres of amorphous silicate and graphite. Credit: Draine Figure 24.1&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;





&lt;h2 id="radiative-cooling"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/22-grain-temp-dynamics/#radiative-cooling"&gt;Radiative cooling&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;Grains loose energy by infrared cooling, at a rate
$$
\frac{d E}{d t} = \int d \nu 4 \pi B_\nu(T_d) C_\mathrm{abs}(\nu) = 4 \pi a^2 \langle Q_\mathrm{abs} \rangle_{T_d} \sigma T_d^4
$$
where \(\sigma\) is the Stefan-Boltzmann constant and \(\langle Q_\mathrm{abs} \rangle\) is the emission efficiency averaged over some spectrum. We&amp;rsquo;ll use \(T_d\) for the Plank function of the dust temperature, and \(\star\) for the spectrum of radiation heating the grain.&lt;/p&gt;




&lt;h2 id="steady-state-grain-temperature"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/22-grain-temp-dynamics/#steady-state-grain-temperature"&gt;Steady-state grain temperature&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;The steady state temperature \(T_{ss}\) can be calculated by the balance of cooling and heating
$$
4 \pi a^2 \langle Q_\mathrm{abs} \rangle_{T_{ss}} \sigma T^4_{ss} = \pi a^2 \langle Q_\mathrm{abs} \rangle_\star u_\star c
$$&lt;/p&gt;
&lt;p&gt;For a typical interstellar radiation field, and the absorption and emission properties of silicate and graphite, you get steady-state grain temperatures of about 15 - 20 Kelvin.&lt;/p&gt;




&lt;h2 id="ultrasmall-dust-grains"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/22-grain-temp-dynamics/#ultrasmall-dust-grains"&gt;Ultrasmall dust grains&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;If the grain is very small, it doesn&amp;rsquo;t really make sense to define a temperature, since each photon absorption event dramatically spikes what we would otherwise define as temperature. Instead, we need to consider a probability distribution function of temperatures. But, small grains can reach high &amp;ldquo;temperatures&amp;rdquo; above 100 K, where they will radiate strongly in the PAH features at 7.7, 8.6, and 11.3 microns.&lt;/p&gt;




&lt;h2 id="infrared-emission-from-grains"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/22-grain-temp-dynamics/#infrared-emission-from-grains"&gt;Infrared emission from grains&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;In a typical spiral galaxy, about 1/3 of the energy radiated by stars is absorbed by dust grains and then reemitted in the infrared. Though infrared emission is a quantum process, you can approximate it as a &amp;ldquo;thermal&amp;rdquo; approach,&lt;/p&gt;
&lt;p&gt;$$
j_\nu = \sum_i \int d a \frac{d n_i}{d a} \int d T \left (\frac{d P}{d T} \right)&lt;em&gt;{i,a} C&lt;/em&gt;\mathrm{abs} (\nu, i, a) B_\nu(T)
$$&lt;/p&gt;
&lt;p&gt;where you&amp;rsquo;re summing over the different types of grains.&lt;/p&gt;
&lt;p&gt;To calculate \(j_\nu\), you need&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;a grain model to provide the grain size distribution for each composition&lt;/li&gt;
&lt;li&gt;the absorption cross sections&lt;/li&gt;
&lt;li&gt;the temperature distribution functions (for large grains, the temperature distribution function is so narrow that you can approximate it via a delta function located at \(T_{ss}\))&lt;/li&gt;
&lt;/ul&gt;




&lt;h1 id="grain-physics-charging-and-sputtering"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/22-grain-temp-dynamics/#grain-physics-charging-and-sputtering"&gt;Grain Physics: Charging and Sputtering&lt;/a&gt;
&lt;/h1&gt;
&lt;p&gt;Grains can be given some charge by collisions. If the grain is neutral, an approaching electron on ion will polarize the grain, attracting the projectile to the grain. This can improve the charging rate for very small grains.&lt;/p&gt;




&lt;h2 id="photoelectric-emission"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/22-grain-temp-dynamics/#photoelectric-emission"&gt;Photoelectric emission&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;When an energetic photon is absorbed by the grain, excites an electron to a sufficiently high energy such that the electron escapes from the grain. When an electron is ejected this way, it&amp;rsquo;s called a &amp;ldquo;photo electron.&amp;rdquo;&lt;/p&gt;
&lt;p&gt;The photoelectron yield \(Y_\mathrm{pe}(h \nu, a , U)\) is the probability that a photon of energy \(h \nu\) will result in a photoelectron. It appears that UV radiation in average starlight background is able to drive grains with radii \(a &amp;gt; 0.01 \mu m\) to positive potentials.&lt;/p&gt;




&lt;h2 id="sputtering"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/22-grain-temp-dynamics/#sputtering"&gt;Sputtering&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;If the gas temperature is sufficiently high, then impinging atoms or ions will erode the grain, one atom at a time. This process is called sputtering. It also happens on the surfaces of moons in our solar system. In the ISM, sputtering rates can dictate grain lifetimes, for \(10^5\) yrs near a supernova remnant. In a galaxy cluster, \(10^7\) yr, which is short compared to the cluster age.&lt;/p&gt;




&lt;h1 id="grain-dynamics"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/22-grain-temp-dynamics/#grain-dynamics"&gt;Grain dynamics&lt;/a&gt;
&lt;/h1&gt;
&lt;p&gt;We&amp;rsquo;ll cover the list of forces and torques that act upon grains. These are what govern grain dynamics.&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;grain motion can decouple the movement of dust from gas&lt;/li&gt;
&lt;li&gt;rapid motion can lead to grain destruction&lt;/li&gt;
&lt;li&gt;moving grains may help couple the neutral gas to the magnetic field where fractional ionization might be low (because grains can be charged)&lt;/li&gt;
&lt;/ul&gt;




&lt;h2 id="gas-drag"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/22-grain-temp-dynamics/#gas-drag"&gt;Gas drag&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;Aerodynamic gas drag. There is a slowing down time, or &amp;ldquo;drag time&amp;rdquo; of
$$
\tau_\mathrm{drag} = \frac{M_\mathrm{gr} v}{F_\mathrm{drag}}.
$$
In diffuse clouds, gas drag can decelerate grains on relatively short timescales of only \(10^5\) yr.&lt;/p&gt;




&lt;h2 id="lorentz-force"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/22-grain-temp-dynamics/#lorentz-force"&gt;Lorentz force&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;If grains have a velocity component transverse to the local magnetic field, then they will experience a Lorentz force.&lt;/p&gt;




&lt;h2 id="radiation-pressure-and-recoil-forces"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/22-grain-temp-dynamics/#radiation-pressure-and-recoil-forces"&gt;Radiation pressure and recoil forces&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;The interstellar radiation field is, in general, quite anisotropic (I mean, look around). The radiation pressure force on a grain is
$$
F_\mathrm{rad} = \langle Q_\mathrm{pr} \rangle \pi a^2 \Delta u_\mathrm{rad}.
$$
Radiation pressure can drive grains through the gas with appreciable velocities.&lt;/p&gt;
&lt;p&gt;If the grain preferentially ejects atoms or electrons from the preferentially illuminated side of the grain, then the recoil can be large and accelerate the grain.&lt;/p&gt;




&lt;h2 id="poynting-robertson-effect"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/22-grain-temp-dynamics/#poynting-robertson-effect"&gt;Poynting-Robertson effect&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;Special Relativistic effect. Consider a dust grain orbiting a star.&lt;/p&gt;
&lt;p&gt;Because of the abberation of starlight, in the instantaneous rest frame of the orbiting particle, the radiative flux from the star has a component in the direction antiparallel to the motion of the grain. The radiation acts to reduce the orbital momentum of the particle, and leads to an orbital decay that depends on the grain size.&lt;/p&gt;
&lt;p&gt;For micron sized particles, PR drag can induce orbital decay of less than a Gyr for particles up to 10 cm in size.&lt;/p&gt;




&lt;h2 id="suprathermal-rotation-of-large-grains"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/22-grain-temp-dynamics/#suprathermal-rotation-of-large-grains"&gt;Suprathermal rotation of large grains&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;We would think classical grains would just be dominated by random collisions with gas atoms and you&amp;rsquo;d have the rotation temperature of the grain approximately equal to the gas temperature.&lt;/p&gt;
&lt;p&gt;However, because the ISM is &lt;em&gt;not&lt;/em&gt; in LTE, interstellar grains act as heat engines, meaning they can attain suprathermal rotation rates where \(T_\mathrm{rot} \gg T_\mathrm{gas}\). This can happen by a variety of processes that impart a systematic torque on the grain.&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Formation of molecular hydrogen on the grain surface, followed by impulsive ejection of said \(H_2\).&lt;/li&gt;
&lt;li&gt;Emission of photoelectrons from grains exposed to UV radiation&lt;/li&gt;
&lt;li&gt;Irregular grain surfaces&lt;/li&gt;
&lt;li&gt;Radiative torques due to absorption and scattering of starlight by irregular grains&lt;/li&gt;
&lt;/ul&gt;




&lt;h2 id="rotation-of-small-grains-microwave-emission"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/22-grain-temp-dynamics/#rotation-of-small-grains-microwave-emission"&gt;Rotation of small grains: microwave emission&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;For the very smallest grains \(a \lesssim 0.001\,\mu m\), rotational dynamics is complex. Any one of the following processes has an important effect on the angular momentum&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;random collisions with neutral gas atoms&lt;/li&gt;
&lt;li&gt;collisions of neutral or negatively charged grains with positive ions&lt;/li&gt;
&lt;li&gt;fluctuating electric fields due to passing ions acting on the permanent electric dipole moment of the grain&lt;/li&gt;
&lt;li&gt;angular momentum deposition by absorbed starlight photons&lt;/li&gt;
&lt;li&gt;angular momentum loss to radiated IR photons&lt;/li&gt;
&lt;li&gt;angular momentum loss in electric dipole radiation from the spinning grain&lt;/li&gt;
&lt;/ul&gt;




&lt;h2 id="alignment-of-interstellar-dust"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/22-grain-temp-dynamics/#alignment-of-interstellar-dust"&gt;Alignment of Interstellar dust&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;Interstellar grains are observed to be systematically aligned with their short axis tending parallel to the local magnetic field. Still an open line of research, but the problem involves two separate problems of&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;align grain angular momentum with the magnetic field&lt;/li&gt;
&lt;li&gt;align nonspherical body of the grain with the grain angular momentum&lt;/li&gt;
&lt;/ul&gt;</description></item><item><title>Telescope Proposals and Time Allocation Committees</title><link>https://iancze.github.io/courses/astro542/lectures/20-telescope-proposal-tac/</link><pubDate>Thu, 07 Oct 2021 00:00:00 +0000</pubDate><guid>https://iancze.github.io/courses/astro542/lectures/20-telescope-proposal-tac/</guid><description>&lt;p&gt;&lt;a href="https://psu.mediaspace.kaltura.com/media/Astro+542A+Lecture+Oct+15/1_l3tinekc"&gt;Zoom link&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Goal&lt;/strong&gt;: Discuss how the telescope proposal process works, using ALMA as a guide.&lt;/p&gt;
&lt;p&gt;This includes&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;the technical components to writing and submitting a proposal&lt;/li&gt;
&lt;li&gt;tips on developing and strengthening your scientific idea&lt;/li&gt;
&lt;li&gt;how your proposal is reviewed and graded&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;&lt;em&gt;Why write a telescope proposal&lt;/em&gt;? To get data to answer a scientific question. (Sometimes, the data needed to answer that scientific question might already exist in the archive. In that case, save the time writing a proposal and use the archival data to write your paper).&lt;/p&gt;
&lt;p&gt;To start, let&amp;rsquo;s walk through the example proposal provided on the course Canvas page (&lt;code&gt;Czekala_Proposal.pdf&lt;/code&gt;).&lt;/p&gt;




&lt;h2 id="lifecycle-of-a-proposal"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/20-telescope-proposal-tac/#lifecycle-of-a-proposal"&gt;Lifecycle of a proposal&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;This is my own proposal, and it may be helpful to walk through a timeline of the project. This is a Cycle 6 proposal.&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;&lt;strong&gt;2018: January and February&lt;/strong&gt;: Start thinking seriously about ALMA proposals for the upcoming cycle based upon anticipated capabilities that have been hinted at throughout the previous year, through information provided at conferences and other venues, and brainstorming scientific questions that could be addressed with new data.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;2018: March ~15&lt;/strong&gt;: Cycle 8 &lt;strong&gt;call for proposals&lt;/strong&gt; announced. This is where the capabilities for the upcoming cycle are definitively announced, along with the official (up-to-date) technical documents, and the proposal deadline is set (usually in mid-late April).&lt;/li&gt;
&lt;li&gt;Speaking for myself, ~1 month is barely enough time to conceptualize, design, make figures for, edit, and submit one proposal even if &lt;em&gt;I am doing nothing else&lt;/em&gt;. So, if you are like me, either plan ahead &lt;em&gt;before&lt;/em&gt; the call for proposals is announced (relying upon documents from previous cycles), or truly clear your schedule of all other activities (most of the time this is impossible, hence the need to start early).&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;2018 March - April ~20th&lt;/strong&gt;: Turn brainstormed ideas into concrete proposals, download and examine relevant archival data, make figures, draft text, contact collaborators, iterate on proposal (leaving &amp;gt; 1 week time for comments, preferably 2 weeks), submit proposal&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Summer 2018&lt;/strong&gt;: ALMA TAC &amp;ldquo;meets&amp;rdquo; and ranks proposals&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;August 2018&lt;/strong&gt;: Ranking results released. For the proposal we&amp;rsquo;re about to review, I learned that I had received a &amp;ldquo;B&amp;rdquo; ranking, which means that the proposal would be scheduled for the upcoming cycle, but with priority lower than the &amp;ldquo;A&amp;rdquo; ranked proposals.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;October 2018 - September 2019&lt;/strong&gt;: The observing window for Cycle 6. My scheduled program might be observed anytime in this window, provided it is 1) observable 2) weather is good 3) no highly ranked programs are ahead of me&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;October 3rd, 2018&lt;/strong&gt;: I get a notification that the program was observed, hooray! (For a &amp;ldquo;B&amp;rdquo;-ranked program, this wasn&amp;rsquo;t guaranteed).&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;November 2018&lt;/strong&gt;: Calibrated data available for download from NAASC.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;November 2018 - February 2021&lt;/strong&gt;: Work on data analysis, writing, etc (among other things). [Proprietary period expired November 2019]. 25 months&amp;hellip;&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;February 2021&lt;/strong&gt;: Submit paper to &lt;em&gt;Astrophysical Journal&lt;/em&gt;&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;March 2021&lt;/strong&gt;: (Mercifully quick) referee process and acceptance to &lt;em&gt;Astrophysical Journal&lt;/em&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;Let&amp;rsquo;s now take a look at the proposal itself and the &lt;a href="https://ui.adsabs.harvard.edu/abs/2021ApJ...912....6C/abstract"&gt;paper&lt;/a&gt; that resulted, and discuss the components.&lt;/p&gt;




&lt;h3 id="components-of-an-alma-proposal"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/20-telescope-proposal-tac/#components-of-an-alma-proposal"&gt;Components of an ALMA proposal&lt;/a&gt;
&lt;/h3&gt;
&lt;ul&gt;
&lt;li&gt;Title + Abstract&lt;/li&gt;
&lt;li&gt;Scientific Justification&lt;/li&gt;
&lt;li&gt;Technical Justification&lt;/li&gt;
&lt;li&gt;&amp;ldquo;Observing Tool&amp;rdquo; + correlator setup&lt;/li&gt;
&lt;/ul&gt;




&lt;h3 id="discussion-questions"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/20-telescope-proposal-tac/#discussion-questions"&gt;Discussion questions&lt;/a&gt;
&lt;/h3&gt;
&lt;ul&gt;
&lt;li&gt;For observationally-oriented projects, what is the &lt;em&gt;minimum&lt;/em&gt; time to go from proposal idea to getting data? What is a reasonable time to go from getting data to publication?&lt;/li&gt;
&lt;li&gt;What about the role of archival data in preparing a new proposal? Unpublished archival data (proprietary period has passed)?&lt;/li&gt;
&lt;/ul&gt;




&lt;h2 id="what-happens-in-the-review-process"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/20-telescope-proposal-tac/#what-happens-in-the-review-process"&gt;What happens in the review process?&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;&lt;a href="https://almascience.nrao.edu/news/alma-cycle-8-2021-proposal-review-process"&gt;Overview of the ALMA Cycle 8 proposal review process&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;Cycle 8 had 1735 total proposals, of which 253 were selected to be observed as &amp;ldquo;high priority&amp;rdquo; (grade A + B). Over 1000 people participated in the review process.&lt;/p&gt;
&lt;p&gt;What is the observatory looking for? To do the &lt;strong&gt;best science&lt;/strong&gt; (as defined by the scientific community) with &lt;strong&gt;limited resources&lt;/strong&gt; (time).&lt;/p&gt;




&lt;h3 id="dual-anonymous-proposals"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/20-telescope-proposal-tac/#dual-anonymous-proposals"&gt;Dual anonymous proposals&lt;/a&gt;
&lt;/h3&gt;
&lt;p&gt;In recent cycles, ALMA has followed a &lt;a href="https://almascience.nrao.edu/proposing/alma-proposal-review/dual-anonymous"&gt;dual-anonymous proposal strategy&lt;/a&gt;. This means that the proposers do not know the reviewers&amp;rsquo; identities, nor do the reviewers know the proposers&amp;rsquo; identities. This strategy is also employed by HST and JWST review panels.&lt;/p&gt;
&lt;p&gt;Dual-anonymous review processes are designed to insulate against many biases (prestige, gender, seniority, &amp;hellip;) that may affect the quality of a reviewer&amp;rsquo;s judgement. Speaking from my own experience from serving as a reviewer on dual-anonymous and non-anonymized panels, I very much prefer the dual-anonymous format. In panel discussions, it reduces &amp;ldquo;background noise&amp;rdquo; from the reputation (or lack thereof) of the proposer, removes many political dimensions, and forces the committee to focus solely on the scientific and technical merits of the proposal. In the end, this improves the ability of the observatory to do the best science with limited resources.&lt;/p&gt;




&lt;h3 id="what-are-you-asked-to-do-as-a-reviewer"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/20-telescope-proposal-tac/#what-are-you-asked-to-do-as-a-reviewer"&gt;What are you asked to do as a reviewer?&lt;/a&gt;
&lt;/h3&gt;
&lt;p&gt;The following is from the Cycle 8 &lt;a href="https://almascience.nrao.edu/proposing/alma-proposal-review/guidelines-for-reviewers"&gt;Guidelines for Reviewers&lt;/a&gt;:&lt;/p&gt;




&lt;h4 id="assessing-proposals"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/20-telescope-proposal-tac/#assessing-proposals"&gt;Assessing proposals&lt;/a&gt;
&lt;/h4&gt;
&lt;p&gt;Reviewers should assess the scientific merit of the proposals to the best of their ability using the following criteria:
&lt;em&gt;The overall scientific merit of the proposed investigation and its potential contribution to the advancement of scientific knowledge.&lt;/em&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Does the proposal clearly indicate which important, outstanding questions will be addressed?&lt;/li&gt;
&lt;li&gt;Will the proposed observations have a high scientific impact on this particular field and address the specific science goals of the proposal? ALMA encourages reviewers to give full consideration to well-designed high-risk/high-impact proposals even if there is no guarantee of a positive outcome or definite detection.&lt;/li&gt;
&lt;li&gt;Does the proposal present a clear and appropriate data analysis plan?&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;&lt;em&gt;The suitability of the observations to achieve the scientific goals&lt;/em&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Is the choice of target (or targets) clearly described and well justified?&lt;/li&gt;
&lt;li&gt;Are the requested signal-to-noise ratio, angular resolution, spectral setup, and u-v coverage sufficient to achieve the science goals?&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;In general, the scientific merit should be assessed solely on the content of the proposal, according to the above criteria. Proposals may contain references to published papers (including preprints) as per standard practice in the scientific literature. Consultation of those references should not, however, be required for a general understanding of the proposal.&lt;/p&gt;




&lt;h4 id="writing-reviews-to-the-pis"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/20-telescope-proposal-tac/#writing-reviews-to-the-pis"&gt;Writing reviews to the PIs&lt;/a&gt;
&lt;/h4&gt;
&lt;p&gt;Clear and thoughtful reviews from reviewers can help PIs improve their proposed project and write stronger proposals in the future.&lt;/p&gt;
&lt;ol&gt;
&lt;li&gt;Summarize both the strengths and weaknesses of the proposal.&lt;/li&gt;
&lt;/ol&gt;
&lt;p&gt;A summary of both the strengths and weaknesses can help PIs understand what aspects of the project are strong, and which aspects need to be improved in any future proposal.
Reviews should focus on the major strengths and major weaknesses. Avoid giving the impression that a minor weakness was the cause of a poor ranking. Many proposals do not have obvious weaknesses but are just less compelling than others; in such a case, acknowledge that the considered proposal is good but that there were others that were more compelling.&lt;/p&gt;
&lt;p&gt;Take care to ensure that the strengths and weaknesses do not contradict each other.&lt;/p&gt;
&lt;ol start="2"&gt;
&lt;li&gt;Be objective.&lt;/li&gt;
&lt;/ol&gt;
&lt;p&gt;Be as specific as possible when commenting on the proposal. Avoid generic statements that could apply to most proposals.
If necessary, provide references to support your critique.
All reviews should be impersonal, critiquing the proposal and not the proposal team. For example, do not write &amp;ldquo;The PI did not adequately describe recent observations of this source.&amp;rdquo;, but instead write &amp;ldquo;The proposal did not adequately describe recent observations of this source.”.&lt;/p&gt;
&lt;p&gt;Reviewers cannot be sure at the time of writing reviews whether the proposed observations will be scheduled for execution. The reviews should be phrased in such a way that they are sensible and meaningful regardless of the final outcome.&lt;/p&gt;
&lt;ol start="3"&gt;
&lt;li&gt;Be concise.&lt;/li&gt;
&lt;/ol&gt;
&lt;p&gt;It is not necessary to write a lengthy review. An informative review can be only a few sentences in length if it is concise and informative. But, please avoid writing only a single sentence that does not address specific strengths and weaknesses.&lt;/p&gt;
&lt;ol start="4"&gt;
&lt;li&gt;Be professional and constructive.&lt;/li&gt;
&lt;/ol&gt;
&lt;p&gt;It is never appropriate to write inflammatory or inappropriate comments, even if you think a proposal could be greatly improved.
Use complete sentences when writing your reviews. We understand that many reviewers are not native English speakers, but please try to use correct grammar, spelling, and punctuation.&lt;/p&gt;
&lt;ol start="5"&gt;
&lt;li&gt;Be aware of unconscious bias.&lt;/li&gt;
&lt;/ol&gt;
&lt;p&gt;We all have biases and we need to make special efforts to review the proposals objectively. A discussion of unconscious bias is provided here.&lt;/p&gt;
&lt;ol start="6"&gt;
&lt;li&gt;Be anonymous.&lt;/li&gt;
&lt;/ol&gt;
&lt;p&gt;Do not identify yourself in the reviews to the PIs. In case of distributed peer review, these reviews will not be checked and edited by the JAO. They will be sent verbatim to the PIs, and they will also be shared with other reviewers during Stage 2.
Do not spend time trying to guess who is the proposal team behind the proposal you are reviewing. Your review should be based solely on the scientific merit of the proposal.&lt;/p&gt;
&lt;ol start="7"&gt;
&lt;li&gt;Other best practices.&lt;/li&gt;
&lt;/ol&gt;
&lt;p&gt;Do not summarize the proposal: The purpose of the review is to evaluate the scientific merits of the proposal, not to summarize it. While you may provide a concise overview of the proposal, it should not constitute the bulk of the reviews.
Do not include statements about scheduling feasibility. If there are any scheduling feasibility issues with the proposal, the JAO will address them directly with the PI.
Do not include explicit references to other proposals that you are reviewing, such as project codes.
Do not ask questions. A question is usually an indirect way to indicate there is a weakness in the proposal, but the weakness should be stated explicitly. For example, instead of &amp;ldquo;Why was a sample size of 10 chosen?&amp;rdquo; write &amp;ldquo;The proposal did not provide a strong justification for the sample size of 10.&amp;rdquo;
Do not use sarcasm or any insulting language.&lt;/p&gt;
&lt;ol start="8"&gt;
&lt;li&gt;Re-read your reviews and scientific rankings.&lt;/li&gt;
&lt;/ol&gt;
&lt;p&gt;Once you have completed your assessments, re-read your reviews and ask how you would react if you received them. If you feel that the reviews would upset you, revise them.
Check to see if the strengths and weaknesses in the reviews are consistent with the scientific rankings. If not, consider revising the reviews or the rankings.&lt;/p&gt;
&lt;figure&gt;&lt;a href="https://almascience.nrao.edu/proposing/alma-proposal-review/guidelines-for-reviewers"&gt;&lt;img src="https://iancze.github.io/courses/astro542/lectures/20-telescope-proposal-tac/Example_review.png"
 alt="An example of a decent (albeit short) review. From the ALMA Cycle 8 Reviewers Guidelines."&gt;&lt;/a&gt;&lt;figcaption&gt;
 &lt;p&gt;An example of a decent (albeit short) review. From the ALMA Cycle 8 Reviewers Guidelines.&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;

&lt;p&gt;For ALMA distributed TAC, you are asked to review 10 proposals for every proposal that you submit.&lt;/p&gt;
&lt;p&gt;For something like the JWST TAC, you might be asked to read about 70 proposals for which you are not conflicted. 30-40 of them will be as &amp;ldquo;primary&amp;rdquo; or &amp;ldquo;secondary&amp;rdquo; reviewer.&lt;/p&gt;
&lt;p&gt;There is also an ALMA review panel for proposals asking for more than 25 hours of time, which meets and discusses the proposals.&lt;/p&gt;




&lt;h2 id="tips-for-writing-good-proposals"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/20-telescope-proposal-tac/#tips-for-writing-good-proposals"&gt;Tips for writing good proposals&lt;/a&gt;
&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;Background on the topic, beginning broad and narrowing to the question addressed by the proposal&lt;/li&gt;
&lt;li&gt;Previous work on the proposal’s question (not necessarily your own!)&lt;/li&gt;
&lt;li&gt;How new progress can be made&lt;/li&gt;
&lt;li&gt;Proposed observations to address the proposal’s question&lt;/li&gt;
&lt;li&gt;Describe selection of targets and justify their number&lt;/li&gt;
&lt;li&gt;Describe the observations, the analysis of the data, and how they will be interpreted and used to answer the proposal’s question&lt;/li&gt;
&lt;li&gt;In the technical section, describe all details of the observations: instrument, filters, dispersers, apertures, exposure times, positions, dithering/mapping strategy, explanation of how exposure times selected (e.g., to achieve a certain SNR), total observing time, observing constraints (e.g., instrument orientation)&lt;/li&gt;
&lt;li&gt;Justify the telescope; why can’t observations be performed with smaller/other telescopes?&lt;/li&gt;
&lt;/ul&gt;




&lt;h2 id="other-miscellaneous-thoughts"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/20-telescope-proposal-tac/#other-miscellaneous-thoughts"&gt;Other miscellaneous thoughts&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;Good review committees will make sure that the strengths and weaknesses of the proposal are adequately discussed.&lt;/p&gt;
&lt;p&gt;Sometimes (especially for highly oversubscribed calls), there are very few weaknesses identified for a proposal. That doesn&amp;rsquo;t necessarily mean that your proposal should have been accepted over ones that were! In very competitive reviews, it&amp;rsquo;s common to just have proposals that were a more compelling use of the observatory&amp;rsquo;s limited resources.&lt;/p&gt;
&lt;p&gt;When writing a review, try to put yourself into the shoes of the proposer receiving the comments, and strive to write accurate and constructive comments. Avoid commentary about the individual (in a dual-anonymous review process, this trap is harder to fall into). Rather, focus on the content of the &lt;em&gt;proposal&lt;/em&gt; and the &lt;em&gt;scientific plan&lt;/em&gt;.&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Reviews tend to be concise (remember, serving on a TAC is usually a &lt;em&gt;voluntary&lt;/em&gt; service to the community, and reviewers might need to write and review comments for over 40 proposals).&lt;/li&gt;
&lt;/ul&gt;




&lt;h2 id="tac-review-assignment"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/20-telescope-proposal-tac/#tac-review-assignment"&gt;TAC Review Assignment&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;At the end of this lecture, you will be provided with a packed of 5 actual ALMA proposals that I have sourced from my generous colleagues.&lt;/p&gt;
&lt;p&gt;It is your job to review these as if you were serving on the ALMA time allocation committee.&lt;/p&gt;
&lt;p&gt;First, please read the &lt;a href="https://almascience.nrao.edu/proposing/alma-proposal-review/guidelines-for-reviewers"&gt;ALMA Guidelines for reviewers document&lt;/a&gt;. Then, &lt;em&gt;for each proposal&lt;/em&gt;, you will be expected to write a review at least 100 words in length, though it&amp;rsquo;s not a bad idea to aim for 150 - 200 words.&lt;/p&gt;
&lt;p&gt;These reviews are due Monday, November 1st, and are worth 5% of your final grade. In addition, you will be asked to grade each proposal on a numerical scale of 1 to 5 (1 being the best, 5 the worst.)&lt;/p&gt;
&lt;p&gt;I will also designate each student as the &amp;ldquo;primary&amp;rdquo; and &amp;ldquo;secondary&amp;rdquo; reviewer for a proposal. On Wednesday, November 3rd and Friday, November 5th, class will be devoted to a mock TAC discussion. The idea is that, as primary reviewer, you will lead the discussion about the proposal. As secondary reviewer, you will be asked to provide any additional thoughts that the primary reviewer may not have covered. The other TAC members are welcome to comment on any proposal, even if they are not primary or secondary reviewers.&lt;/p&gt;
&lt;p&gt;Then, as a TAC, we will decide on a collective ranking of the proposals. Once this is complete, we will examine how the proposals were actually ranked by the real ALMA TAC, and discuss.&lt;/p&gt;




&lt;h2 id="alma-proposal-final-project"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/20-telescope-proposal-tac/#alma-proposal-final-project"&gt;ALMA Proposal Final Project&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;After we complete the review process on Nov 5th, it might be a good time to start thinking about your final project, which will be the scientific justification component of an ALMA proposal (modeled on the Cycle 8 call for proposals). Information on this can be found in the &lt;a href="https://iancze.github.io/courses/astro542/syllabus/"&gt;course syllabus&lt;/a&gt;.&lt;/p&gt;</description></item><item><title>Scattering and Absorption by Small Particles</title><link>https://iancze.github.io/courses/astro542/lectures/19-scattering-absorption/</link><pubDate>Mon, 04 Oct 2021 00:00:00 +0000</pubDate><guid>https://iancze.github.io/courses/astro542/lectures/19-scattering-absorption/</guid><description>&lt;p&gt;&lt;a href="https://psu.mediaspace.kaltura.com/media/Astro+542A+Lecture+Oct+11/1_omcf7pzy"&gt;Zoom link&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;References&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Draine Ch. 22&lt;/li&gt;
&lt;li&gt;Ryden and Pogge Ch. 6.2&lt;/li&gt;
&lt;/ul&gt;




&lt;h2 id="cross-sections-and-efficiency-factors"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/19-scattering-absorption/#cross-sections-and-efficiency-factors"&gt;Cross sections and efficiency factors&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;In analogy to our previous setups with scattering cross sections, which we wrote as \(\sigma\), we&amp;rsquo;ll define other cross sections that are useful for talking about grains and radiative transfer. At least in this chapter, Draine labels them by \(C\), and many of them have a dependence on wavelength. They still have units of area, \(\mathrm{cm}^2\).&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;&lt;strong&gt;absorption cross section&lt;/strong&gt; \(C_\mathrm{abs}(\lambda)\)&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;scattering cross section&lt;/strong&gt; \(C_\mathrm{sca}(\lambda)\)&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;extinction cross section&lt;/strong&gt;
$$
C_\mathrm{ext}(\lambda) = C_\mathrm{abs}(\lambda) + C_\mathrm{sca}(\lambda)
$$&lt;/li&gt;
&lt;/ul&gt;




&lt;h3 id="a-brief-note-about-radiative-transfer-with-scattering-included"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/19-scattering-absorption/#a-brief-note-about-radiative-transfer-with-scattering-included"&gt;A brief note about radiative transfer with scattering included&lt;/a&gt;
&lt;/h3&gt;
&lt;p&gt;For a population of identical dust grains with number density \(n\), the extinction cross section is related to the attenuation coefficient by the relation
$$
\kappa_\lambda = n C_\mathrm{ext}(\lambda).
$$&lt;/p&gt;
&lt;p&gt;If you remember back to our radiative transfer lecture, we neglected scattering in nearly all applications. Now, we&amp;rsquo;re including scattering for dust, and that means that radiative transfer gets &lt;strong&gt;hard&lt;/strong&gt;. We can still calculate the optical depth
$$
\tau_\nu = \int_0^s \kappa_\lambda d s,
$$
however, to calculate an emergent intensity we can&amp;rsquo;t just plug it into the integral form of the RTE like we did before. \(\boldsymbol{x}\) and \(\boldsymbol{n}\) are vectors. Here is an example including the sink and source terms due to scattering.&lt;/p&gt;
&lt;p&gt;The sink term is added simply by using \(\kappa_\mathrm{ext}\), which includes the attenuation due to scattering. The problem starts to get gnarly when we consider the &lt;em&gt;source&lt;/em&gt; term due to scattering. Basically, light can be scattered &lt;em&gt;into&lt;/em&gt; the ray under consideration (attenuating the ray from which it originated). The modified RTE looks like&lt;/p&gt;
&lt;p&gt;$$
\frac{d I}{d s}(x, n, \lambda) = - \kappa_\mathrm{ext}(x, \lambda) I(x, n, \lambda) + j_*(x, n, \lambda) + \kappa_\mathrm{sca}(x, \lambda) \int_{4 \pi} \Phi(n, n^\prime, x, \lambda) I(x, n^\prime, \lambda) d \Omega^\prime
$$&lt;/p&gt;
&lt;p&gt;where \(\Phi\) term is a scattering phase function, which describes the probability that a photon originally propagating in direction \(\boldsymbol{n}^\prime\) and scattered at position \(\boldsymbol{x}\) will have \(\boldsymbol{n}\) as its new propagation direction.&lt;/p&gt;
&lt;p&gt;This is called an &lt;em&gt;integro-differential&lt;/em&gt; equation. It sounds hard because it is, and all radiation fields in all positions in all directions (and for all wavelengths) are coupled! For more information on the complexities of radiative transfer including dust, see the review article &lt;a href="https://ui.adsabs.harvard.edu/abs/2013ARA%26A..51...63S/abstract"&gt;Steinacker ARA&amp;amp;A&lt;/a&gt;.&lt;/p&gt;




&lt;h3 id="back-to-terms"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/19-scattering-absorption/#back-to-terms"&gt;Back to terms&lt;/a&gt;
&lt;/h3&gt;
&lt;p&gt;The &lt;strong&gt;albedo&lt;/strong&gt;
$$
\omega \equiv \frac{C_\mathrm{sca}(\lambda)}{C_\mathrm{abs}(\lambda) + C_\mathrm{sca}(\lambda)} = \frac{C_\mathrm{sca}(\lambda)}{C_\mathrm{ext}(\lambda)}.
$$
An albedo of 1 means that scattering dominates the extinction (shiny silver thing), an albedo of 0 means that absorption dominates (pure black).&lt;/p&gt;
&lt;p&gt;For a given direction of incidence to a fixed grain, you need two angles \(\theta, \phi\) to fully specify the scattering direction, since it can be in any direction in 3D space. However, if we are talking about spherical grains, or a large ensemble of randomly oriented grains, then we only need to talk about a single scattering angle \(\theta\). A value of 0 means that the light is scattered in the forward direction, a value of 180 means that the light is scattered backwards (back-scattered), and a value of 90 would have the light be scattered to the right, left, top, or bottom.&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;
&lt;p&gt;We can write down a &lt;strong&gt;differential scattering cross section&lt;/strong&gt;
$$
\frac{d C_\mathrm{sca}(\theta)}{d \Omega}
$$
for incident unpolarized light to be scattered by an angle \(\theta\).&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;We can calculate the &lt;strong&gt;mean value&lt;/strong&gt; of \(\cos \theta\) (scattering angle) by taking the average over all angles
$$
\langle \cos \theta \rangle = \frac{1}{C_\mathrm{sca}} \int_0^\pi \cos \theta \frac{d C_\mathrm{sca}(\theta)}{d \Omega} 2 \pi \sin \theta d \theta
$$&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;We can calculate the radiation pressure cross section
$$
C_\mathrm{pr}(\lambda) \equiv C_\mathrm{abs}(\lambda) + (1 - \langle \cos \theta \rangle C_\mathrm{sca}(\lambda)).
$$&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;And we also have the &lt;strong&gt;degree of polarization&lt;/strong&gt; for light scattered through an angle \(P(\theta)\).&lt;/p&gt;
&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;Frequently it&amp;rsquo;s convenient to work with a dimensionless cross section that is normalized to some characteristic area of the grain. For a spherical grain, the geometric grain cross section is a good one: \(\pi a^2\). Draine adopts the following convention for a non-spherical grain: create a sphere having the same volume \(V\) as the grain material (not including voids) and use the effective cross section of that sphere: \(\pi a_\mathrm{eff}^2\), where
$$
a_\mathrm{eff} = \left( \frac{3 V}{4 \pi} \right)^{1/3}.
$$
Then we have
$$
Q_\mathrm{sca} = \frac{C_\mathrm{sca}}{\pi a_\mathrm{eff}^2}
$$
$$
Q_\mathrm{abs} = \frac{C_\mathrm{abs}}{\pi a_\mathrm{eff}^2}
$$
and
$$
Q_\mathrm{ext} \equiv Q_\mathrm{sca} + Q_\mathrm{abs}
$$
and all of the \(Q\) terms are dimensionless.&lt;/p&gt;




&lt;h2 id="index-of-refraction"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/19-scattering-absorption/#index-of-refraction"&gt;Index of refraction&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;The index of refraction will indicate whether a dust grain is better at absorbing or scattering.&lt;/p&gt;
&lt;p&gt;This can be done with the complex dielectric function
$$
\epsilon(\omega) = \epsilon_r + i \epsilon_r
$$
or the refractive index \(m(\omega)\) where \(m = \sqrt{\epsilon}\).&lt;/p&gt;
&lt;p&gt;The index of refraction is a complex number with real and imaginary parts
$$
m = m_r + i m_i.
$$&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;The real part \(m_r\) is called the refractive index and determines how much a beam of light is bent (or refracted)&lt;/li&gt;
&lt;li&gt;The imaginary part \(m_i\) is called the absorption index, since it determines how strongly the substance absorbs light.&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;Both \(m_r\) and \(m_i\) are real and non-negative.&lt;/p&gt;
&lt;p&gt;Consider an electromagnetic wave with field strength
$$
E \propto e^{i(kx - \omega t)}
$$
traveling through a material. The dispersion relationship between wavenumber and frequency will be
$$
k = m \frac{\omega}{c}.
$$&lt;/p&gt;
&lt;p&gt;If the imaginary part of \(m\) is greater than zero, then the power of the electromagnetic wave will decrease as it propagates through the material
$$
|E|^2 \propto e^{-2 \mathrm{Im}(m) \omega x/c}
$$
and thus the attenuation coefficient for that light will be
$$
\kappa(\omega) = 2 m_i \frac{\omega}{c} = \frac{4 \pi m_i}{\lambda_\mathrm{vac}}.
$$&lt;/p&gt;




&lt;h2 id="limit-that-grain-much-smaller-than-wavelength-of-light"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/19-scattering-absorption/#limit-that-grain-much-smaller-than-wavelength-of-light"&gt;Limit that grain much smaller than wavelength of light&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;Now we will consider the situation where the grain is much smaller than the wavelength of incident radiation. In such a situation, it&amp;rsquo;s as if the grain is subject to an incident applied electric field that is uniform in space&lt;/p&gt;
&lt;p&gt;The efficiency factors have the form&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;absorption&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;$$
Q_\mathrm{abs} = 4 \left ( \frac{2 \pi a }{\lambda} \right) \mathrm{Im} \left [ \frac{m^2 - 1}{m^2 + 2} \right].
$$&lt;/p&gt;
&lt;p&gt;which yields an absorption coefficient
$$
\kappa_\mathrm{abs} \propto n_\mathrm{gr} a^3.
$$
So, you can think of the absorption coefficient as being proportional to the fraction of the volume of space that is occupied by dust.&lt;/p&gt;
&lt;p&gt;For scattering, the efficiency factor is
$$
Q_\mathrm{sca} = \frac{8}{3} \left ( \frac{2 \pi a}{\lambda} \right)^4 \left | \frac{m^2 - 1}{m^2 + 2} \right |^2.
$$
or the relationship that
$$
\sigma_\mathrm{sca} \propto a^6 \lambda^{-4}
$$
which is characteristic of Rayleigh scattering.&lt;/p&gt;
&lt;p&gt;As long as there is any absorption at all, absorption will dominate over scattering in the limit that the grain radius goes to zero. E.g., you can see this by the fact that the optical/UV extinction curve has a shape much closer to \(\lambda^{-1}\) than \(\lambda^{-4}\).&lt;/p&gt;




&lt;h2 id="regime-where-sizes-are-comparable"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/19-scattering-absorption/#regime-where-sizes-are-comparable"&gt;Regime where sizes are comparable&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;Use Mie theory to calculate the absorption and scattering coefficients. Basically, the electric field inside and outside the sphere can be decomposed into spherical harmonics with appropriate radial functions, and then determine the coefficients by the boundary conditions that you have a plane wave at infinity and that you have continuity at the surface of the sphere. In modern applications, you would write/use a computer program to calculate these coefficients.&lt;/p&gt;




&lt;h2 id="regime-where-2-pi-a-gg-lambda"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/19-scattering-absorption/#regime-where-2-pi-a-gg-lambda"&gt;Regime where \(2 \pi a \gg \lambda\)&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;Might want to use Mie theory, but there are some useful approximations. As you take the limit of larger grains relative to the wavelength you get the strange result that
$$
Q_\mathrm{ext} \rightarrow 2.
$$
Now, if you remember the definition of \(Q_\mathrm{ext} \), this means that the extinction cross-section is &lt;em&gt;twice&lt;/em&gt; that of its geometric cross section. It&amp;rsquo;s as if you had a large opaque bowling ball, but with twice the cross-sectional area. This result is called the &amp;ldquo;extinction paradox.&amp;rdquo;&lt;/p&gt;
&lt;p&gt;It comes about because of diffraction effects. According to &lt;strong&gt;Babinet&amp;rsquo;s principle&lt;/strong&gt; the scattering produced by an opaque object is the same as that produced by an aperture of the same size cut into an opaque screen. So the diffraction pattern of an opaque sphere of radius \(a\) is the same as the Airy pattern produced by a hole of radius \(a\) cut into a screen, and in the short-wavelength limit you get \(Q_\mathrm{abs} = 1\) and \(Q_\mathrm{sca} = 1\) so \(Q_\mathrm{ext} = 2\).&lt;/p&gt;
&lt;figure&gt;&lt;img src="https://iancze.github.io/courses/astro542/lectures/19-scattering-absorption/fig_22_6.jpg"
 alt="Values of the dimensionless extinction coefficient \(Q_\mathrm{ext}\) for amorphous silicate spheres with particle sizes \(a\) of 0.01, 0.1, 1, and 10 microns, for observational wavelengths ranging from 1 angstrom to 1 mm. At short wavelengths, the smallest grains show discontinuities at the x-ray absorption edges. In the IR, the medium sized grains show prominent silicate absorption features at 9.7 microns and 18 microns (not seen for the largest grains). Credit: Draine Fig 22.6"&gt;&lt;figcaption&gt;
 &lt;p&gt;Values of the dimensionless extinction coefficient \(Q_\mathrm{ext}\) for amorphous silicate spheres with particle sizes \(a\) of 0.01, 0.1, 1, and 10 microns, for observational wavelengths ranging from 1 angstrom to 1 mm. At short wavelengths, the smallest grains show discontinuities at the x-ray absorption edges. In the IR, the medium sized grains show prominent silicate absorption features at 9.7 microns and 18 microns (not seen for the largest grains). Credit: Draine Fig 22.6&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;
</description></item><item><title>Interstellar Dust</title><link>https://iancze.github.io/courses/astro542/lectures/18-dust/</link><pubDate>Fri, 08 Oct 2021 00:00:00 +0000</pubDate><guid>https://iancze.github.io/courses/astro542/lectures/18-dust/</guid><description>&lt;p&gt;&lt;a href="https://psu.mediaspace.kaltura.com/media/Astro+542A+Oct+8/1_m09odw6b"&gt;Zoom link&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;References&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Ryden and Pogge Ch 6.1&lt;/li&gt;
&lt;li&gt;Draine Ch. 21&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;Modern astronomers are quick to recognize the effects of dust obscuration, such as in the dark lanes across the Milky Way Galaxy&lt;/p&gt;
&lt;figure&gt;&lt;img src="https://iancze.github.io/courses/astro542/lectures/18-dust/plate1.png"
 alt="Credit: Draine Plate 1"&gt;&lt;figcaption&gt;
 &lt;p&gt;Credit: Draine Plate 1&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;

&lt;p&gt;And of course we can use infrared observations to directly measure the emission &lt;em&gt;from&lt;/em&gt; dust&lt;/p&gt;
&lt;figure&gt;&lt;img src="https://iancze.github.io/courses/astro542/lectures/18-dust/plate2.png"
 alt="Credit: Draine Plate 2"&gt;&lt;figcaption&gt;
 &lt;p&gt;Credit: Draine Plate 2&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;

&lt;p&gt;But without these tools, how could you be so sure that something like dust must exist, especially just by observing by eye? Astro&lt;em&gt;photographs&lt;/em&gt; by Barnard in the late 1800s and early 1900s were highly suggestive of dark or obscuring structures.&lt;/p&gt;
&lt;figure&gt;&lt;img src="https://iancze.github.io/courses/astro542/lectures/18-dust/fig6-1.jpg"
 alt="Photograph by E.E. Barnard of the dark structures near the Theta Ophiuchi (bright star w/ diffraction ring) Credit: Ryden and Pogge, Figure 6.1. Originally Barnard 1899"&gt;&lt;figcaption&gt;
 &lt;p&gt;Photograph by E.E. Barnard of the dark structures near the Theta Ophiuchi (bright star w/ diffraction ring) Credit: Ryden and Pogge, Figure 6.1. Originally Barnard 1899&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;

&lt;p&gt;The widespread existence of dust came about in the work by Robert Trumpler in 1930, when he studied open clusters in our Galaxy. Basically, he treated bright main sequence stars as standard candles, the size of clusters as standard rulers, and found that things didn&amp;rsquo;t correlate the way he might expect just from the inverse square law alone. Instead, he found a better fit if the light was attenuated exponentially
$$
I_\lambda = I_{\lambda, 0} e^{-\kappa_\lambda r}
$$
with a wavelength dependence such that bluer wavelengths suffered larger attenuation.&lt;/p&gt;
&lt;p&gt;Dust&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;dims light (overall)&lt;/li&gt;
&lt;li&gt;dims blue optical wavelengths more than red optical or infrared wavelengths&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;Trumpler hypothesized that this could be due to Rayleigh scattering from &amp;ldquo;fine cosmic dust particles.&amp;rdquo;&lt;/p&gt;




&lt;h2 id="effect-of-dust-on-observations"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/18-dust/#effect-of-dust-on-observations"&gt;Effect of dust on observations&lt;/a&gt;
&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;Dust particles &lt;em&gt;scatter&lt;/em&gt; light. We see this in reflection nebulae.&lt;/li&gt;
&lt;li&gt;Dust particles also &lt;em&gt;absorb&lt;/em&gt; light.&lt;/li&gt;
&lt;/ul&gt;
&lt;figure&gt;&lt;img src="https://iancze.github.io/courses/astro542/lectures/18-dust/plate11.png"
 alt="The blue nebulosity in the north is primarily scattered light. Credit: Draine Plate 11"&gt;&lt;figcaption&gt;
 &lt;p&gt;The blue nebulosity in the north is primarily scattered light. Credit: Draine Plate 11&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;

&lt;p&gt;The net effect of scattering and absorption is called &lt;em&gt;extinction&lt;/em&gt;. Extinction can depend on the polarization of the light, and so dust can also polarize light.&lt;/p&gt;
&lt;p&gt;The relative amount of scattering and absorbing depends on the properties of the dust grains.&lt;/p&gt;
&lt;p&gt;We can write the observed flux of a star as
$$
F_\lambda = F_{\lambda, 0} e^{-\tau_\lambda}
$$
basically, the original flux attenuated by a medium with some optical depth.&lt;/p&gt;
&lt;p&gt;There isn&amp;rsquo;t much starlight at energies \(h \nu &amp;gt; 13.6\) eV, so dust extinction studies have generally focused on near UV and optical \(\lambda &amp;gt; 912\) angstroms.&lt;/p&gt;
&lt;p&gt;Because optical astronomers are usually making these calculations, we&amp;rsquo;re usually working in &lt;em&gt;magnitudes&lt;/em&gt;. So, we can write the ratio of the original flux to the attenuated flux relative as a number of magnitudes of extinction. The definition is&lt;/p&gt;
&lt;p&gt;$$
A_\lambda = 2.5 \log_{10} \left ( \frac{F_{\lambda, 0}}{F_\lambda } \right)
$$
making the relationship between magnitudes and optical depth
$$
A_\lambda = 2.5 \log_{10}(e^{\tau_\nu}) \approx 1.086 \tau_\lambda.
$$&lt;/p&gt;
&lt;p&gt;The exctinction curve as a function of wavelength looks like the following&lt;/p&gt;
&lt;figure&gt;&lt;img src="https://iancze.github.io/courses/astro542/lectures/18-dust/fig6-2.jpg"
 alt="Dust exctinction as a function of wavelength, normalized relative to \(A_V\), the extinction in the V band. Credit: Ryden and Pogge, Figure 6.2. data from Cardelli&amp;#43;94 and O&amp;rsquo;Donnell 94."&gt;&lt;figcaption&gt;
 &lt;p&gt;Dust exctinction as a function of wavelength, normalized relative to \(A_V\), the extinction in the V band. Credit: Ryden and Pogge, Figure 6.2. data from Cardelli+94 and O&amp;rsquo;Donnell 94.&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;

&lt;p&gt;and this is determined by comparing the spectrum of a nearby (un-extincted) star to a more distant star of the same spectral type.&lt;/p&gt;
&lt;p&gt;Cardelli+89 showed that the extinction relative to some reference wavelength can be described as a function
$$
A_\lambda / A_{\lambda_\mathrm{ref}} \approx f(\lambda)
$$&lt;/p&gt;
&lt;p&gt;The actual functional form is a non-parametric spline fit with seven free parameters.&lt;/p&gt;
&lt;p&gt;The first thing you should notice is that bluer wavelengths (in particular UV and blue optical) are much more heavily extincted than redder wavelengths (red optical and near-infrared). This means that dust will &lt;em&gt;attenuate&lt;/em&gt; and &lt;em&gt;redden&lt;/em&gt; the light from a star.&lt;/p&gt;
&lt;p&gt;There is also an excess extinction feature at \(\lambda \approx 2175\) angstroms, which is attributed to carbon rich particles. It is metallicity dependent.&lt;/p&gt;
&lt;p&gt;At optical wavelengths, the extinction is a bit simpler. At optical and near-UV wavelengths, the functional form can be approximated by a one-parameter family of curves
$$
A_\lambda / A_{\lambda_\mathrm{ref}} \approx f(\lambda; R_V)
$$
where that one parameter is \(R_V\).&lt;/p&gt;
&lt;p&gt;The redenning is commonly expressed in terms of a color excess between B and V optical bands,
$$
E(B-V) = A_B - A_V
$$
and is usually a positive number.&lt;/p&gt;
&lt;p&gt;If you assume that the extinction curve is the same everywhere, then by knowing \(A_V\) you would be able to calculate \(E(B-V)\) (or vice-versa). Unfortunately it&amp;rsquo;s not guaranteed to be the same everywhere, and in fact, the &lt;em&gt;ratio of total to selective extinction&lt;/em&gt; is
$$
R_V \equiv \frac{A_V}{E(B-V)}
$$
and ranges for values of \(2 \lesssim R_V \lesssim 6\) for interstellar dust, which means that for every magnitude of \(V\)-band extinction, you&amp;rsquo;re going to get 1.2 - 1.5 magnitudes of B band extinction. You can think of \(R_V\) as the inverse of the slope of the extinction curve at optical wavelengths.&lt;/p&gt;
&lt;p&gt;If \(R_V \rightarrow \infty\), then you have a &amp;ldquo;grey&amp;rdquo; absorber where all wavelengths of light are absorbed equally. This would also be what happens if we were in the &amp;ldquo;geometric optics&amp;rdquo; limit, where the dust grains were large compared to the wavelength. The fact that we do see extinction rise with wavelength means that small grains (\(a \lesssim 0.015\,\mu m\)) must be making an appreciable contribution to extinction at observed wavelengths.&lt;/p&gt;
&lt;p&gt;The most commonly adopted value is \(R_V = 3.1\) for general ISM.&lt;/p&gt;
&lt;p&gt;With the Milky Way Galaxy, the amount of dust extinction along a line of sight is found to be correlated with the total column density of hydrogen.&lt;/p&gt;
&lt;p&gt;For sightlines with \(R_V \approx 3.1\),
$$
\frac{N_H}{A_V} \approx 1.9 \times 10^{21}\,\mathrm{cm}^{-2}\;\mathrm{mag}^{-1}.
$$
This is a really useful relationship because if you&amp;rsquo;ve measured the exctinction, then you can estimate the column density of hydrogen, or vice-versa.&lt;/p&gt;
&lt;p&gt;And realistically, you&amp;rsquo;d try to combine as much information that you can in our galaxy, including dust emission, and then you can produce a map of extinction for any extragalactic source. The most famous of these papers is the &amp;ldquo;SFD&amp;rdquo; paper by Schlegel, Finkbeiner, and Davis. Even at high galactic latitudes, there can still be some extinction (~0.05 mags \(A_V\))&lt;/p&gt;
&lt;figure&gt;&lt;a href="https://ui.adsabs.harvard.edu/search/filter_database_fq_database=AND&amp;amp;filter_database_fq_database=database%3A%22astronomy%22&amp;amp;fq=%7B!type%3Daqp%20v%3D%24fq_database%7D&amp;amp;fq_database=%28database%3A%22astronomy%22%29&amp;amp;q=citation_count%3A%5B1%20TO%20100000%5D&amp;amp;sort=citation_count%20desc%2C%20bibcode%20desc&amp;amp;p_=0"&gt;&lt;img src="https://iancze.github.io/courses/astro542/lectures/18-dust/citations.png"
 alt="A ranking of the most highly cited astronomy papers (retrieved 10/2/2021). Note Schlegel, Finkbeiner, and Davis in 3rd place after the Riess&amp;#43; and Perlmutter&amp;#43; papers discovering the accelerating universe. (not sure I would classify the Perdew and Zunger paper as astronomy, despite what ADS says.)"&gt;&lt;/a&gt;&lt;figcaption&gt;
 &lt;p&gt;A ranking of the most highly cited astronomy papers (retrieved 10/2/2021). Note Schlegel, Finkbeiner, and Davis in 3rd place after the Riess+ and Perlmutter+ papers discovering the accelerating universe. (not sure I would classify the Perdew and Zunger paper as astronomy, despite what ADS says.)&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;

&lt;p&gt;Because some exctinction is caused by absorption instead of scattering, dust grains are heated by (optical) starlight, after which they then cool by the emission of infrared light.&lt;/p&gt;
&lt;p&gt;In our Milky Way Galaxy, the infrared portion of the interstellar radiation field peaks at \(\lambda \approx 140\,\mu m\), which corresponds to dust temperatures of 100 K.&lt;/p&gt;
&lt;p&gt;Using the hydrogen column density on a typical sightline through our galaxy, we can make an order of magnitude calculation as to the infrared luminosity per hydrogen nucleus
$$
\frac{I_\mathrm{IR}}{N_H} = 5 \times 10^{-24}\,\mathrm{erg\,s}^{-1}.
$$&lt;/p&gt;
&lt;p&gt;By comparision, the Sun&amp;rsquo;s luminosity per hydrogen nucleus is
$$
\sim L_\odot m_H / M_\odot \approx 3 \times 10^{-24}\,\mathrm{erg\,s}^{-1}.
$$&lt;/p&gt;
&lt;p&gt;So we took a lot of liberty here in the approximations, but the ISM is &lt;em&gt;about as efficient as converting sunlight into infrared light as the Sun is at creating sunlight to begin with&lt;/em&gt;. This has important implications for the observations of external galaxies&amp;mdash;for some with large dust content, the amount of energy in IR radiation is comparable to that contained in optical starlight.&lt;/p&gt;
&lt;p&gt;Basically, the dust in a galaxy is efficient at absorbing optical starlight and re-radiating it as infrared emission.&lt;/p&gt;</description></item><item><title>Radiative Trapping</title><link>https://iancze.github.io/courses/astro542/lectures/17-radiative-trapping/</link><pubDate>Wed, 06 Oct 2021 00:00:00 +0000</pubDate><guid>https://iancze.github.io/courses/astro542/lectures/17-radiative-trapping/</guid><description>&lt;p&gt;&lt;a href="https://psu.mediaspace.kaltura.com/media/Astro+542A+Lecture+Oct+6/1_zra8cex4"&gt;Zoom link&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;References:&lt;/p&gt;
&lt;p&gt;Draine Ch. 19&lt;/p&gt;
&lt;p&gt;In many situations, like dense molecular clouds or stellar outflows, we have sufficient gas densities that a photon emitted by
$$
X_u \rightarrow X_l
$$
will have a high probability of being absorbed by another \(X_l\) nearby, and, thus, a low probability of ever escaping from the region and being seen by an observer. This is called &lt;strong&gt;radiative trapping&lt;/strong&gt;. The consequences are&lt;/p&gt;
&lt;ol&gt;
&lt;li&gt;reducing the emission from \(X_u \rightarrow X_l\) emerging from the region&lt;/li&gt;
&lt;li&gt;changes the level populations of \(X\) relative to what would be calculated if the photons escaped&lt;/li&gt;
&lt;/ol&gt;
&lt;p&gt;This is (one of the reasons) why radiative transfer can be hard in practice: excitation is coupled across non-local scales. One way to simplify things a tiny bit is to use the &lt;strong&gt;escape probability approximation&lt;/strong&gt;.&lt;/p&gt;




&lt;h2 id="escape-probability-approximation"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/17-radiative-trapping/#escape-probability-approximation"&gt;Escape probability approximation&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;Consider some slab of material that has light passing through it. The optical depth through this slab is \(\tau_\nu\). What is the probability that the photon makes it through the slab without being absorbed?
$$
\beta = e^{-\tau_\nu}.
$$
I.e.,&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;if \(\tau_\nu \approx 0\), the photon will pass through with probability of 1 (it will nearly always go through).&lt;/li&gt;
&lt;li&gt;if \(\tau_\nu \gtrsim 1\), the photon will pass through with probability of 0 (it will nearly always be absorbed).&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;Now, let&amp;rsquo;s consider some location \(\boldsymbol{r}\) within a cloud, and we&amp;rsquo;re looking in a direction \(\hat{\boldsymbol{n}}\).&lt;/p&gt;
&lt;p&gt;The total optical depth (i.e., integrated until we&amp;rsquo;re out of the cloud) in that direction is \(\tau_\nu(\hat{\boldsymbol{n}}, \boldsymbol{r}) \).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q&lt;/strong&gt;: If an atom at this location emitted a photon (in a random direction), what would be the probability that it would escape the cloud?&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;A&lt;/strong&gt;: We&amp;rsquo;d need to average over all directions
$$
\bar{\beta}_\nu( \boldsymbol{r}) \equiv \int \frac{d \Omega}{4 \pi} e^{-\tau_nu(\hat{\boldsymbol{n}}, \boldsymbol{r})}.
$$&lt;/p&gt;
&lt;p&gt;So far, we&amp;rsquo;ve been treating the escape probability at a single frequency. But let&amp;rsquo;s say we wanted to treat the escape probability averaged over the line profile
$$
\langle \beta(\boldsymbol{r}) \rangle = \int \phi_\nu \bar{\beta}_\nu( \boldsymbol{r}) d \nu.
$$&lt;/p&gt;
&lt;p&gt;Now, let&amp;rsquo;s re-examine the level population calculations using the escape probability approximation and making a few more assumptions&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;excitation level is uniform throughout the cloud&lt;/li&gt;
&lt;li&gt;&amp;ldquo;on-the-spot&amp;rdquo;: if a radiated photon will be absorbed, it&amp;rsquo;s absorbed close to (at) the point of emission&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;&lt;strong&gt;Plan&lt;/strong&gt;: start with the rate of change of populations for a two-level system, incorporate the escape probabilities, and see what&amp;rsquo;s changed.&lt;/p&gt;
&lt;p&gt;$$
\frac{d n_u}{d t} = \left ( n_c k_{lu} + n_\gamma \frac{g_u}{g_l} A_{ul} \right) n_l - \left ( n_c k_{ul} + A_{ul} + n_\gamma A_{ul} \right) n_u
$$&lt;/p&gt;
&lt;p&gt;Because the cloud is uniformly excited, we can take the RTE in integral form and rewrite it as
$$
I_\nu = I_\nu(0) e^{-\tau_\nu} + B_\nu(T_\mathrm{exc})(1 - e^{-\tau_\nu})
$$&lt;/p&gt;
&lt;p&gt;and then rewrite it in terms of photon number&lt;/p&gt;
&lt;p&gt;$$
n_\gamma(\nu) = n_\gamma^{(0)} e^{-\tau_\nu} + \frac{1 - e^{-\tau_\nu}}{(n_l g_u)/(n_u g_l) - 1}
$$
where&lt;/p&gt;
&lt;p&gt;$$
n_\gamma^{(0)} \equiv \frac{c^2}{2 h \nu^3} I_\nu(0).
$$&lt;/p&gt;
&lt;p&gt;we can introduce \(\beta_\nu\), do the average over direction and line profile, and obtain&lt;/p&gt;
&lt;p&gt;$$
\langle n_\gamma \rangle = \langle \bar{\beta} \rangle n_\gamma^{(0)} + \frac{1 - \langle \bar{\beta} \rangle}{(n_l g_u/n_u g_l) - 1}.
$$&lt;/p&gt;
&lt;p&gt;Then we can rewrite the rate of change of level populations to be
$$
\frac{d n_u}{d t} = n_c k_{lu} n_l + n_c k_{ul} n_u - \langle \bar{\beta}\rangle A_{ul} n_u + n_l \frac{g_u}{g_l} \langle \bar{\beta}\rangle A_{ul} n_\gamma^{(0)} \left (1 - \frac{n_u g_l}{n_l g_u} \right),
$$
which is called the &lt;strong&gt;escape probability approximation&lt;/strong&gt;.&lt;/p&gt;
&lt;p&gt;What have we done? We started with an equation that included photoexcitation and stimulated emission, including the effects of photons emitted by the cloud material, which is valid in the optically thick regime as well as optically thin. This was tricky to solve in the optically thick regime, because the terms were &lt;strong&gt;coupled across large distances&lt;/strong&gt;.&lt;/p&gt;
&lt;p&gt;We have used the escape probability approximation and written down an equation that the level populations must satisfy, taking into account all of the radiative processes (absorption, spontaneous emission, and stimulated emission), and photons both originating externally and emitted with the cloud, assuming we know the value of the escape probability \(\langle \bar{\beta} \rangle\).&lt;/p&gt;
&lt;p&gt;This modified equation is nice because it functionally looks like an equation we&amp;rsquo;d write down with&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;no internally generated radiation field present&lt;/li&gt;
&lt;li&gt;cloud was optically thin to external radiation field \(I_\nu(0)\)&lt;/li&gt;
&lt;li&gt;the Einstein A coefficient replaced by an effective value \(\langle \hat{\beta}\rangle A_{ul}\).&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;Because this is functionally &amp;ldquo;optically thin,&amp;rdquo; we&amp;rsquo;ve decoupled the terms across distances and can use it to calculate level populations in a much more straightforward manner.&lt;/p&gt;
&lt;p&gt;Can also be applied to the case of scattering.&lt;/p&gt;
&lt;p&gt;&lt;a href="https://ned.ipac.caltech.edu/level5/March02/Netzer/Netzer2_4.html"&gt;See also notes from NED/Caltech&lt;/a&gt;.&lt;/p&gt;




&lt;h2 id="applications"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/17-radiative-trapping/#applications"&gt;Applications&lt;/a&gt;
&lt;/h2&gt;




&lt;h3 id="homogeneous-static-spherical-cloud"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/17-radiative-trapping/#homogeneous-static-spherical-cloud"&gt;Homogeneous static spherical cloud&lt;/a&gt;
&lt;/h3&gt;
&lt;p&gt;The angle-averaged escape probability depends on the geometry and velocity structure of the region, e.g., escape probability will be largest at cloud boundary and lowest in cloud center. There are ways to define a mass-averaged escape probability over the cloud volume (see Draine 19.2).&lt;/p&gt;




&lt;h3 id="co-j1-0"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/17-radiative-trapping/#co-j1-0"&gt;CO J=1-0&lt;/a&gt;
&lt;/h3&gt;
&lt;p&gt;Frequently used as a tracer of molecular gas. Can follow through from an assumption of Gaussian velocity distribution, and known quantities about partition functions. Given these, we usually find that at least the J=1-0 level of CO is expected to be thermalized in molecular clouds.&lt;/p&gt;




&lt;h2 id="local-velocity-gradient-lvg-approximation"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/17-radiative-trapping/#local-velocity-gradient-lvg-approximation"&gt;Local Velocity Gradient (LVG) approximation&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;Also called Sobolev approximation.&lt;/p&gt;
&lt;p&gt;If there is a velocity gradient such that point-to-point differences across the flow field are large compared to the width of the velocity distribution at a given point, then you can get more emission out of the cloud (it has more frequency bandwidth on which to emit).&lt;/p&gt;




&lt;h2 id="escape-probability-for-turbulent-clouds-and-the-x-factor-between-mathrmh_2-mass-and-co"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/17-radiative-trapping/#escape-probability-for-turbulent-clouds-and-the-x-factor-between-mathrmh_2-mass-and-co"&gt;Escape probability for turbulent clouds and the &amp;ldquo;X-Factor&amp;rdquo; between \(\mathrm{H}_2\) mass and CO&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;Observers of the cold ISM face a perennial problem in that most of the hydrogen mass in these regions is in molecular form \(\mathrm{H}_2\), but that molecule doesn&amp;rsquo;t have a dipole moment, and therefore does not readily emit, so it&amp;rsquo;s &amp;ldquo;invisible.&amp;rdquo;&lt;/p&gt;
&lt;p&gt;It&amp;rsquo;s commonly assumed that CO 1-0 transition can be used as a tracer of molecular cloud mass.&lt;/p&gt;
&lt;p&gt;Assumptions underpinning this include virialized clouds (setting turbulent velocity dispersions). Depends on the density and temperature of the cloud.&lt;/p&gt;</description></item><item><title>Nebular Diagnostics</title><link>https://iancze.github.io/courses/astro542/lectures/16-nebular-diagnostics/</link><pubDate>Tue, 28 Sep 2021 00:00:00 +0000</pubDate><guid>https://iancze.github.io/courses/astro542/lectures/16-nebular-diagnostics/</guid><description>&lt;p&gt;&lt;a href="https://psu.mediaspace.kaltura.com/media/Astro+542A+Lecture+Oct+4/1_y8cn2wz4"&gt;Zoom link&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;References&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Ryden and Pogge Ch. 4.4&lt;/li&gt;
&lt;li&gt;Draine Ch. 18&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;&lt;strong&gt;Goal&lt;/strong&gt;: use emission features from an H II region or planetary nebulae to estimate its temperature and density&lt;/p&gt;
&lt;p&gt;Ionized nebulae are far from blackbodies (not a terrible approximation for opaque bodies like stars). I.e., a blackbody &lt;em&gt;absorbs&lt;/em&gt; all incident radiation and emits blackbody radiation.&lt;/p&gt;
&lt;p&gt;A parcel of gas (say, containing neutral hydrogen atoms) can be in LTE (thermalized via collisions w/ free electrons) but not be a blackbody (i.e. emits via spectral lines, and continuum emission via free-free, free-bound, and two-photon emission).&lt;/p&gt;
&lt;p&gt;Though the continuum emission does depend on density and temperature, because of the bright, spectrally concentrated nature of spectral line emission, this usually provides a more direct measurement.&lt;/p&gt;
&lt;figure&gt;&lt;img src="https://iancze.github.io/courses/astro542/lectures/16-nebular-diagnostics/fig4-5.jpg"
 alt="Spectrum of a disk H II region in the Whirlpoos Galaxy. The top panel shows the bright lines and the bottom panel is scaled to show the fainter lines. Credit: Ryden and Pogge Figure 4.5"&gt;&lt;figcaption&gt;
 &lt;p&gt;Spectrum of a disk H II region in the Whirlpoos Galaxy. The top panel shows the bright lines and the bottom panel is scaled to show the fainter lines. Credit: Ryden and Pogge Figure 4.5&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;





&lt;h2 id="calculating-temperature"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/16-nebular-diagnostics/#calculating-temperature"&gt;Calculating temperature&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;A good rule of thumb for using emission lines is to find two excited states of the same ion (making sure you&amp;rsquo;re tracing the same conditions) whose energy levels differ by \(\sim k T\), where \(T\) is the temperature of the gas.&lt;/p&gt;
&lt;p&gt;If \(T \sim 10^4\) K, then we&amp;rsquo;re looking for energy differences on the order 1 eV.&lt;/p&gt;
&lt;p&gt;For H II regions, the following are useful&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;doubly ionized oxygen O III&lt;/li&gt;
&lt;li&gt;singly ionized nitrogen N II&lt;/li&gt;
&lt;li&gt;doubly ionized sulfur S III&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;all of these have similar electronic structure, they are 4 electrons short of a filled outer shell and therefore their lowest energy levels have a similar form.&lt;/p&gt;
&lt;figure&gt;&lt;img src="https://iancze.github.io/courses/astro542/lectures/16-nebular-diagnostics/fig4-6.jpg"
 alt="Energy levels of the ions O III, N II, and S III. Credit: Ryden and Pogge Figure 4.6"&gt;&lt;figcaption&gt;
 &lt;p&gt;Energy levels of the ions O III, N II, and S III. Credit: Ryden and Pogge Figure 4.6&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;

&lt;p&gt;The ground state is \({}^3 \mathrm{P}_J\), which has fine structure sub-levels corresponding to angular momentum quantum levels \(J=0,1,2\).&lt;/p&gt;
&lt;p&gt;The first excited state of the ion is written with \({}^1 \mathrm{D}\). A transition from this state to the \(J=1,2\) states of the ground state are forbidden magnetic dipole transitions, and the transition to the \(J=0\) is an even more strongly forbidden electric quadrupole transition.&lt;/p&gt;
&lt;p&gt;Each of these transitions produce lines like [O III], [N II] that are in the optical band, and thus easily observable from the ground. These lines include those frequently termed &amp;ldquo;nebular lines&amp;rdquo; \({}^1 \mathrm{D} \rightarrow {}^3 \mathrm{P}_J \) (because they were first seen in planetary nebulae)&lt;/p&gt;
&lt;p&gt;and &amp;ldquo;auroral lines&amp;rdquo; \({}^1 \mathrm{S} \rightarrow {}^1 \mathrm{D} \) because a similar line is seen in Earth&amp;rsquo;s aurorae. We number the different energy levels with numbers 0, 1, 2, 3, 4 for simplicity, so we don&amp;rsquo;t need to keep using the spectroscopic notation. There are higher energy levels to the ions, but at typical temperatures in ISM they are essentially unpopulated.&lt;/p&gt;
&lt;p&gt;Now let&amp;rsquo;s calculate temperature from observed line strength. First, assume we are in the low-density limit such that&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;\(n_e &amp;lt; n_\mathrm{crit} \) for each line under consideration&lt;/li&gt;
&lt;li&gt;meaning collisional excitation is followed by radiative de-excitation (not collisional de-excitation)&lt;/li&gt;
&lt;li&gt;radiative de-excitation will primarily be spontaneous instead of stimulated emission&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;The emissivity from spontaneous emission is
$$
j_\nu = n_u \frac{A_{ul}}{4 \pi} h \nu_{ul} \Phi_\nu
$$
so, integrated over the line profile, we have
$$
j(4 \rightarrow 3) = n_4 \frac{A_{43}}{4 \pi} h \nu_{43}.
$$&lt;/p&gt;
&lt;p&gt;When we are in excitational equilibrium, the rate of collisional excitation from the ground state (remember, most things in ground state) to level 4 will be balanced by the radiative de-excitation from level 4.&lt;/p&gt;
&lt;p&gt;For quantum mechanical reasons, transitions from 4 to 2 and 4 to 0 are strongly forbidden. Therefore we only need to take into account the transitions from 4 to 3 and 4 to 1
$$
n_0 n_e k_{04} = n_4 (A_{43} + A_{41}).
$$
We can use this with the previous equation to include the collinional rate coefficient (and thus the temperature dependence) in the emissivity as
$$
j(4 \rightarrow 3) = \frac{n_0 n_e}{4 \pi} k_{04} \frac{A_{43}}{A_{43} + A_{41}} h \nu_{43}
$$
the temperature dependence comes in through the \(k_{04}\) term.&lt;/p&gt;
&lt;p&gt;Now let&amp;rsquo;s consider the \(3\rightarrow 2\) line. Like the level 4 state, the level 3 state can be populated through direct collisional excitation, but it can also be populated through radiative de-excitation from the level 4 state. So, we would write&lt;/p&gt;
&lt;p&gt;$$
j(3 \rightarrow 2) = \frac{n_0 n_e}{4 \pi} \left [ k_{03} + k_{04} \frac{A_{43}}{A_{43} + A_{41}}\right] \frac{A_{32}}{A_{32} + A_{31}} h \nu_{32}.
$$&lt;/p&gt;
&lt;p&gt;We can get rid of the dependence on density by taking the ratio of the line strengths
$$
\frac{j(4 \rightarrow 3)}{j(3 \rightarrow 2)} = \frac{A_{43} \nu_{43}}{A_{32} \nu_{32}} \frac{(A_{32} + A_{31}) k_{04}}{(A_{43} + A_{41})k_{03} + A_{43} k_{04}}.
$$
All of the \(A\) terms are known from quantum mechanics, and the collisional rate coefficients are dependent on temperature.&lt;/p&gt;
&lt;p&gt;From the principle of detailed balance, we can relate the collisional rate coefficients by
$$
\frac{k_{0u}}{k_{u0}} = \frac{g_u}{g_0} \exp \left (- \frac{h \nu_{u0}}{k T} \right).
$$&lt;/p&gt;
&lt;p&gt;In practice, there are some simplifications and factorizations that occur. First, we write the collision de-excitation rate cofficient as
$$
k_{u0} = \frac{\beta}{T^{1/2}} \frac{\Omega_{u0}}{g_u}
$$
where
$$
\beta = \left ( \frac{2 \pi \hbar^4}{k m_e^3} \right)^{1/2}
$$
and \(\Omega_{u0}\) is a dimensionless collision strength. \(\Omega_{u0}\) is nearly independent of temperature (and of order unity) so \(k_{u0} \propto T^{-1/2}\).&lt;/p&gt;
&lt;p&gt;We can refactor the line strength ratios into
$$
\frac{j(4 \rightarrow 3)}{j(3 \rightarrow 2)} = \frac{A_{43} \nu_{43}}{A_{32} \nu_{32}} \frac{(A_{32} + A_{31}) \Omega_{40} \exp(- h \nu_{43}/kT)}{(A_{43} + A_{41} \Omega_{30})+ A_{43} \Omega_{40} \exp(- h \nu_{43} / kT)}.
$$
The temperature dependence is now isolated to the exponent. We&amp;rsquo;re going to get the most variation in the line ratios for temperatures \(kT \approx h \nu_{43}\), which is 1.96 eV for S III and 2.84 eV for O III.&lt;/p&gt;




&lt;h2 id="calculating-density"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/16-nebular-diagnostics/#calculating-density"&gt;Calculating density&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;To calculate density, its helpful to identify an ion with two electronic transitions that are similar in energy but have very different critical densities. One good example is singly ionized oxygen, O II.&lt;/p&gt;
&lt;p&gt;Remember that critical density
$$
n_\mathrm{crit} = \frac{(1 + n_\gamma) A_{ul}}{k_{ul}}.
$$&lt;/p&gt;
&lt;p&gt;The first excited state \({}^2 \mathrm{D}^O\) is split into a pair of fine structure sub-levels, whose energies are finely separated. Thus, the transitions from the first excited state to ground produce a line doublet (two lines closely spaced in wavelength). The critical densities for the different lines of the doublet, however, are substantially different.&lt;/p&gt;
&lt;figure&gt;&lt;img src="https://iancze.github.io/courses/astro542/lectures/16-nebular-diagnostics/fig4-7.jpg"
 alt="Energy levels of the O II and S II ions. Credit: Ryden and Pogge Figure 4.7"&gt;&lt;figcaption&gt;
 &lt;p&gt;Energy levels of the O II and S II ions. Credit: Ryden and Pogge Figure 4.7&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;

&lt;p&gt;Following a similar path as before, we&amp;rsquo;ll write down the emissivities for each line of the doublet and then compute their ratios, and do some simplifications using collisional strengths to get&lt;/p&gt;
&lt;p&gt;$$
\frac{j(2\rightarrow 0)}{j(1 \rightarrow 0)} \approx \frac{\Omega_{02}}{\Omega_{01}} \frac{1 + n_e/n_{\mathrm{crit},1}}{1 + n_e/n_{\mathrm{crit},2}}.
$$&lt;/p&gt;
&lt;p&gt;In the low-density limit, we have
$$
\frac{j(2\rightarrow 0)}{j(1 \rightarrow 0)} \approx \frac{\Omega_{02}}{\Omega_{01}},
$$
the line ratios are independent of the free electron density \(n_e\).&lt;/p&gt;
&lt;p&gt;In the high density limit, we have
$$
\frac{j(2\rightarrow 0)}{j(1 \rightarrow 0)} \approx \frac{\Omega_{02}}{\Omega_{01}}\frac{n_{\mathrm{crit},2}}{n_{\mathrm{crit},1}},
$$
which is again independent of the free electron density \(n_e\) (but does depend on the critical densities of each transition).&lt;/p&gt;
&lt;p&gt;To get something that does depend on the free electron density, we need to be in the regime where \(n_e\) is larger than one critical density but smaller than the other. For the O II transition we&amp;rsquo;ve been talking about, you can get line ratios that range from 0.4 to 1.4, which is a decent dynamic range.&lt;/p&gt;
&lt;p&gt;And, of course, in a modern analysis, usually multiple temperature and density diagnostics are combined into a single estimate of temperature and density.&lt;/p&gt;
&lt;p&gt;More details are in Ryden and Pogge, Ch 4.4.&lt;/p&gt;</description></item><item><title>Collisional Excitation</title><link>https://iancze.github.io/courses/astro542/lectures/15-collisional-excitation/</link><pubDate>Sat, 25 Sep 2021 00:00:00 +0000</pubDate><guid>https://iancze.github.io/courses/astro542/lectures/15-collisional-excitation/</guid><description>&lt;p&gt;&lt;a href="https://psu.mediaspace.kaltura.com/media/Astro+542A+October+1/1_n7gksgev"&gt;Zoom link&lt;/a&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Draine Ch. 17. also Ch. 3&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;Why is collisional excitation important?&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;puts atoms, ions, and molecules into excited states from which they can decay radiatively, leading to cooling of the gas&lt;/li&gt;
&lt;li&gt;puts species into excited states that are used as diagnostics of physical conditions in the gas, such as density, temperature, or radiation field&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;Collisional rate coefficients between initial and final state
$$
k_{if} \equiv \langle \sigma v \rangle_{i \rightarrow f}
$$
having units \(\mathrm{cm}^3\,\mathrm{s}^{-1}\).&lt;/p&gt;
&lt;p&gt;And radiative transition probability
$$
A_{if} \equiv A_{i \rightarrow f}
$$
having units \(\mathrm{s}^{-1}\).&lt;/p&gt;




&lt;h2 id="two-level-atom"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/15-collisional-excitation/#two-level-atom"&gt;Two-Level atom&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;Consider only the ground state and the first excited state of an atom. Usually not a bad assumption, especially for getting down the basics.&lt;/p&gt;
&lt;p&gt;Only processes acting are&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;collisional excitation&lt;/li&gt;
&lt;li&gt;collisional de-excitation&lt;/li&gt;
&lt;li&gt;radiative decay&lt;/li&gt;
&lt;/ul&gt;




&lt;h3 id="assume-no-background-radiation-present"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/15-collisional-excitation/#assume-no-background-radiation-present"&gt;Assume no background radiation present&lt;/a&gt;
&lt;/h3&gt;
&lt;p&gt;Let the number densities of the two levels of the atom be given by \(n_0\) and \(n_1\), and the number density of the collisional partner (e.g., electrons) be given by \(n_c\). Then,&lt;/p&gt;
&lt;p&gt;The rate of change of the excited state is
$$
\frac{d n_1}{d t} = n_c n_0 k_{01} - n_c n_1 k_{10} - n_1 A_{10}.
$$
i.e., it&amp;rsquo;s equal to how quickly level 1 is populated via collisions, minus how quickly level 1 is depopulated via collisions, minus how quickly level 1 is depopulated via spontaneous emission. Steady-state solution is \(dn_1/dt = 0\).&lt;/p&gt;
&lt;p&gt;The law of mass action and the principle of detailed balance allow us to relate the collisional rate coefficients by
$$
\frac{k_{01}}{k_{10}} = \frac{g_1}{g_0} e^{-E_{10}/k T_\mathrm{gas}}.
$$
In the limit that \(n_c \rightarrow \infty\) (i.e., collisions thermalize the gas), the steady state solution has us bring about the relationship
$$
\frac{n_1}{n_0} = \frac{g_1}{g_0} e^{-E_{10}/k T_\mathrm{gas}}.
$$&lt;/p&gt;




&lt;h3 id="allow-background-radiation-to-be-present"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/15-collisional-excitation/#allow-background-radiation-to-be-present"&gt;Allow background radiation to be present&lt;/a&gt;
&lt;/h3&gt;
&lt;p&gt;Consider background radiation with dimensionless photon occupation number
$$
n_\gamma = \frac{c^3}{8 \pi h \nu^3} u_\nu.
$$&lt;/p&gt;
&lt;p&gt;The rate of change of the excited state is
$$
\frac{d n_1}{d t} = n_0 \left [ n_c k_{01} + n_\gamma \frac{g_1}{g_0} A_{10} \right] - n_1 \left [ n_c k_{10} + (1 + n_\gamma) A_{10} \right]
$$
where we&amp;rsquo;ve written the absorption coefficient \(B_{01}\) in terms of \(A_{10}\) and canceled out some prefactors. We&amp;rsquo;ve also included stimulated emission in the rightmost term.&lt;/p&gt;
&lt;p&gt;Let&amp;rsquo;s rearrange for the steady-state solution
$$
\frac{n_1}{n_0} = \frac{n_c k_{01} + n_\gamma(g_1/g_0) A_{10}}{n_c k_{10} + (1 + n_\gamma) A_{10}}
$$
and examine for a few cases.&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;in the limit that \(n_\gamma \rightarrow 0\), we have the previous result&lt;/li&gt;
&lt;li&gt;in the limit \(n_c \rightarrow 0\), and the radiation field is described by a blackbody field with \(T_\mathrm{rad}\), then the level populations are given by Boltzmann w/ that temperature (gas temperature could be different, though, it&amp;rsquo;s just that the number density of the collisional partners isn&amp;rsquo;t high enough to thermalize anything to the gas temperature).&lt;/li&gt;
&lt;li&gt;if the temperature of the radiation field happens to match that of the gas \(T_\mathrm{gas}\), then the system will be brought into equilibrium independent of the collisional density&lt;/li&gt;
&lt;/ul&gt;




&lt;h2 id="critical-density"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/15-collisional-excitation/#critical-density"&gt;Critical density&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;the density at which collisional deexcitation equals radiative deexcitation, including stimulated emission&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;For a collisional partner \(c\), and the excited state, we define this as
$$
n_{\mathrm{crit}, u}(c) \equiv \frac{\sum_{l&amp;lt;u} [1 + (n_\gamma)&lt;em&gt;{ul}] A&lt;/em&gt;{ul}}{\sum_{u &amp;lt;l} k_{ul}(c)}
$$
i.e., we&amp;rsquo;re considering multi-level atoms and summing over all states \(l\) lower than the state under consideration (\(u\)).&lt;/p&gt;
&lt;p&gt;For a two-level atom, this is
$$
n_\mathrm{crit} = \frac{(1 + n_\gamma) A_{ul}}{k_{ul}}
$$&lt;/p&gt;
&lt;p&gt;This definition is appropriate only when the gas is optically thin such that the radiated photons escape. If it&amp;rsquo;s not, then we get &amp;ldquo;radiative trapping,&amp;rdquo; which we&amp;rsquo;ll discuss in a future lecture.&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;
&lt;p&gt;In the &lt;strong&gt;high density regime&lt;/strong&gt;, \(n \gg n_\mathrm{crit}\), the rate of collisional deexcitation is greater than radiative deexcitation, and the excitation temperature is driven towards the kinetic temperature.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;In the &lt;strong&gt;low density regime&lt;/strong&gt; \(n &amp;lt; n_\mathrm{crit}\), the rate of collisional deexcitation is less than radiative deexcitation, and the excitation temperature is driven to the radiation temperature.&lt;/p&gt;
&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;We can expand this discussion a bit by following the example from &lt;a href="http://www.astronomy.ohio-state.edu/~pogge/Ast871/Notes/Molecules.pdf"&gt;Richard Pogge&lt;/a&gt; and discuss the specific example of a medium with two-level atoms (ratio of states given by \(T_\mathrm{exc}\), kinetic temperature given by \(T\)) with some &lt;em&gt;blackbody&lt;/em&gt; radiation field with radiation temperature \(T_\mathrm{rad}\).&lt;/p&gt;
&lt;p&gt;In the high density regime, we&amp;rsquo;ll have the excitation temperature driven towards the kinetic temperature \(T_\mathrm{exc} \rightarrow T\). If \(T &amp;gt; T_\mathrm{rad}\) (common in ISM), we&amp;rsquo;ll see an emission line.&lt;/p&gt;
&lt;p&gt;In the low density regime, we&amp;rsquo;ll have the excitation temperature driven towards the radiation temperature \(T_\mathrm{exc} \rightarrow T_\mathrm{rad}\), in which case we won&amp;rsquo;t see a line.&lt;/p&gt;
&lt;p&gt;This is why it is sometimes said that a medium needs to be above the critical density for a line to &amp;ldquo;turn on.&amp;rdquo; Different species have different critical densities, and so line strengths can be used as density diagnostics.&lt;/p&gt;</description></item><item><title>Ionization in Predominantly Neutral Regions (cool + warm HI regions)</title><link>https://iancze.github.io/courses/astro542/lectures/14-ionization-neutral-regions/</link><pubDate>Sat, 25 Sep 2021 00:00:00 +0000</pubDate><guid>https://iancze.github.io/courses/astro542/lectures/14-ionization-neutral-regions/</guid><description>&lt;p&gt;&lt;a href="https://psu.mediaspace.kaltura.com/media/Astro+542A+Lecture+Sep+29/1_z72pm38a"&gt;Zoom link&lt;/a&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Draine Ch. 16&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;What is the degree of ionization expected in predominately neutral interstellar clouds?&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;\(n_e\) determines ionization balance of various species&lt;/li&gt;
&lt;li&gt;collisions with free electrons play a role in determining the charge state for interstellar grains&lt;/li&gt;
&lt;li&gt;electrons play a role in interstellar chemistry and sometimes cool the gas via collisional excitation&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;Three main regimes to consider&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;&lt;strong&gt;diffuse H I regions&lt;/strong&gt;: metals are photoionized by starlight; cosmic rays create a small amount of \(\mathrm{H}^+\) and \(\mathrm{He}^+\). The gas may be CNM or WNM.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;diffuse molecular clouds&lt;/strong&gt;: (e.g., moderate visual extinction \(0.3 \lesssim A_V \lesssim 2\,\mathrm{mag}\)). Most hydrogen is in \(H_\mathrm{2}\), metals still predominately photoionized by starlight. Cosmic rays can produce \(H_2^+\), which leads to the formation of \(H_3^+\).&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;dark molecular clouds&lt;/strong&gt;: (e.g., visual extinction \(A_V \gtrsim 3\,\mathrm{mag}\)). Insufficient UV radiation to photoionize elements like C and S. CRs can maintain only a very small ionization \(10^{-7}\).&lt;/li&gt;
&lt;/ul&gt;




&lt;h2 id="ionization-of-metals-in-h-i-regions"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/14-ionization-neutral-regions/#ionization-of-metals-in-h-i-regions"&gt;Ionization of metals in H I regions&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;Carbon is the fourth most astrophysically abundant element.&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;&lt;strong&gt;Q&lt;/strong&gt;: what&amp;rsquo;s the third?&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;A&lt;/strong&gt;: oxygen&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;Nearly 60% of the carbon in the ISM is in solid grains, leaving a gas phase abundance of \(n_C / n_H \approx 1 \times 10^{-4}\).&lt;/p&gt;
&lt;p&gt;As we talked about in previous lectures, stellar photons with energies greater than \(I_H = 13.6\) eV cannot penetrate appreciable quantities of H I gas because the cross-section for photoionization is large. But, carbon has an ionization energy of 11.26 eV, which means that it can be ionized by starlight photons that &lt;em&gt;will&lt;/em&gt; penetrate H I. As a result, under typical ISM conditions, we find that 99% of the carbon is ionized.&lt;/p&gt;




&lt;h2 id="ionization-of-hydrogen-in-cool-h-i-regions"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/14-ionization-neutral-regions/#ionization-of-hydrogen-in-cool-h-i-regions"&gt;Ionization of hydrogen in cool H I regions&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;Stellar photons with energies greater than \(I_H = 13.6\) eV cannot penetrate appreciable quantities of H I gas, however x-rays of sufficiently high energies can reach the interior of clouds. Let&amp;rsquo;s revisit the photoionization cross sections from Ch. 13 to see why this is the case:&lt;/p&gt;
&lt;figure&gt;&lt;img src="https://iancze.github.io/courses/astro542/lectures/14-ionization-neutral-regions/fig_13_1.jpg"
 alt="Photoionization cross sections for hydrogen atoms, molecular hydrogen, helium, carbon, and oxygen. The cross sections are high when the photon energy is close to the ionization energy, zero when the photon is below this, and decline as a power law when photon energies are much higher. The cross sections for C and O have an absorption edge corresponding to the minimum photon energy for photoionization from the 1s shell, thus boosting the cross section considerably compared to hydrogen. Credit: Draine 13.1"&gt;&lt;figcaption&gt;
 &lt;p&gt;Photoionization cross sections for hydrogen atoms, molecular hydrogen, helium, carbon, and oxygen. The cross sections are high when the photon energy is close to the ionization energy, zero when the photon is below this, and decline as a power law when photon energies are much higher. The cross sections for C and O have an absorption edge corresponding to the minimum photon energy for photoionization from the 1s shell, thus boosting the cross section considerably compared to hydrogen. Credit: Draine 13.1&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;

&lt;p&gt;Still, we&amp;rsquo;re talking about ionization fractions of \(n_e \lesssim 0.01\,\mathrm{cm}^{-3}\).&lt;/p&gt;
&lt;p&gt;Also need to consider cosmic ray flux.&lt;/p&gt;




&lt;h2 id="diffuse-molecular-gas"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/14-ionization-neutral-regions/#diffuse-molecular-gas"&gt;Diffuse molecular gas&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;\(0.3 \lesssim A_V \lesssim 2\,\mathrm{mag}\)&lt;/p&gt;
&lt;p&gt;In regions where molecular hydrogen can be found, most ionizations produced by cosmic rays or x rays&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;detach an electron from \(H_\mathrm{2}\) to create \(\mathrm{H}^+\)&lt;/li&gt;
&lt;li&gt;if it encounters an electron, the \(\mathrm{H}^+\) will dissociatively recombine
$$
\mathrm{H}_2^+ + e^- \rightarrow \mathrm{H} + \mathrm{H}
$$&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;But, since the electron fraction in molecular gas is low, it may not encounter an electron before it undergoes the fast exothermic ion-molecular reaction
$$
\mathrm{H}_2^+ + \mathrm{H}_2 \rightarrow \mathrm{H}_3^+ + \mathrm{H}
$$
to form
\(\mathrm{H}_3^+\), which will eventually dissociatively recombine.&lt;/p&gt;
&lt;p&gt;Because all of these involve free electron densities, we can use the ratio of
$$
\frac{N(\mathrm{H}_3^+)}{N(\mathrm{H}_2^+)}
$$
to estimate the cosmic ray ionization rate in molecular clouds.&lt;/p&gt;




&lt;h2 id="dense-molecular-clouds-dark-clouds"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/14-ionization-neutral-regions/#dense-molecular-clouds-dark-clouds"&gt;Dense Molecular Clouds (dark clouds)&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;When \(A_V \gtrsim 3\,\mathrm{mag}\), even carbon and sulfur remain predominately neutral. Most of the free electrons come from cosmic ray ionization.&lt;/p&gt;
&lt;p&gt;As before, CR ionization produces \(\mathrm{H}_2^+\), which leads to \(\mathrm{H}_3^+\). However, in dense molecular clouds the ionization fraction is so low that most of these \(\mathrm{H}_3^+\) ions react with atoms or molecules (like CO) to form a generic molecule (like \(HCO^+\)), which will then recombine dissociatively, capture an electron from a grain, or charge exchange with a neutral atom like sulfur.&lt;/p&gt;
&lt;p&gt;The end result of these reactions, though, is that deep within a molecular cloud cosmic ray ionization is the only source of free electrons, and even then, the gas has a very low ionization fraction. This has bearing on the magnetohydrodynamics of the gas, because ionized gas interacts with a magnetic field, while neutral gas doesn&amp;rsquo;t.&lt;/p&gt;</description></item><item><title>Photoionized Gas (HII regions)</title><link>https://iancze.github.io/courses/astro542/lectures/13-photoionized-gas-hii/</link><pubDate>Sat, 25 Sep 2021 00:00:00 +0000</pubDate><guid>https://iancze.github.io/courses/astro542/lectures/13-photoionized-gas-hii/</guid><description>&lt;p&gt;&lt;a href="https://psu.mediaspace.kaltura.com/media/Astro+542A+Lecture+Sep+27/1_7qvclhvh"&gt;Zoom link&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;References&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Ryden and Pogge Ch 4&lt;/li&gt;
&lt;li&gt;Draine Ch 15&lt;/li&gt;
&lt;/ul&gt;




&lt;h2 id="picture-tour"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/13-photoionized-gas-hii/#picture-tour"&gt;Picture tour&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;&lt;figure&gt;&lt;img src="https://iancze.github.io/courses/astro542/lectures/13-photoionized-gas-hii/plate9.png"
 alt="Credit: Draine Plate 9"&gt;&lt;figcaption&gt;
 &lt;p&gt;Credit: Draine Plate 9&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;

&lt;figure&gt;&lt;img src="https://iancze.github.io/courses/astro542/lectures/13-photoionized-gas-hii/plate10.png"
 alt="Credit: Draine Plate 10"&gt;&lt;figcaption&gt;
 &lt;p&gt;Credit: Draine Plate 10&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;

&lt;figure&gt;&lt;img src="https://iancze.github.io/courses/astro542/lectures/13-photoionized-gas-hii/plate11.png"
 alt="Credit: Draine Plate 11"&gt;&lt;figcaption&gt;
 &lt;p&gt;Credit: Draine Plate 11&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;
&lt;/p&gt;
&lt;p&gt;Recall ionization state from previous lecture
$$
x_e = \frac{n_e}{n_\mathrm{H}}.
$$
Note that here we are using \(n_\mathrm{H}\) as a marker for total number density of hydrogen, not just neutral hydrogen, i.e., \(n_\mathrm{H} = n_{\mathrm{H}^+} + n_{\mathrm{H}^0}\).&lt;/p&gt;
&lt;p&gt;In the warm ionized medium \(x_e \sim 0.7\) and in the hot ionized medium \(x_e \sim 1\). Recall from the Saha equation that ionization fraction is a function of temperature.&lt;/p&gt;
&lt;p&gt;There is the warm ionized medium (WIM) which is \(T \sim 8000\) K and \( n \approx 0.2\;\mathrm{cm}^{-3}\).
There are also H II regions and planetary nebulae (a denser analog to the WIM) that contain many diagnostic tools (in terms of emission) that allow us to determine densities, temperatures, compositions, and ionization states.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;first ionization energy&lt;/strong&gt;: energy required to remove the most loosely bound electron from a neutral atom in its ground state. For hydrogen, this is 13.6 eV, for helium it&amp;rsquo;s 24.6 eV, for oxygen its 13.6 eV, for carbon its 11.3 eV, see R&amp;amp;P Table 1.2.&lt;/p&gt;
&lt;p&gt;&lt;em&gt;question&lt;/em&gt;: why do we usually care about the energy required to hydrogen, but don&amp;rsquo;t talk much about the ionization energy from the \(n=2\) level, which is \(I=3.40\) eV?&lt;/p&gt;
&lt;p&gt;&lt;em&gt;answer&lt;/em&gt;: in realistic interstellar situations, \(n=2\) rare to find a large population of \(n=2\) hydrogen atoms, and the spontaneous emission \(A_{21}^{-1} \sim 2\) nanoseconds is quick, so rare that we&amp;rsquo;d have an encounter while it is populated. It would sooner emit a Lyman \(\alpha\) photon. To gain a 50% population of \(n=2\) atoms relative to \(n=1\) in LTE, we&amp;rsquo;d need to be at about 50,000 K, but at these temperatures the hydrogen will already be pretty much fully ionized.&lt;/p&gt;




&lt;h2 id="photoionization-and-radiative-recombination"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/13-photoionized-gas-hii/#photoionization-and-radiative-recombination"&gt;Photoionization and Radiative Recombination&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;Photoionization occurs by photoelectric absorption:
$$
\mathrm{H} + h \nu \rightarrow p + e^-
$$
where the photon energy must be \(h \nu &amp;gt; I_\mathrm{H}\) where \(I_\mathrm{H} = 13.60\) eV (otherwise no ionization will occur).&lt;/p&gt;
&lt;p&gt;Above this range, we have&lt;/p&gt;
&lt;p&gt;$$
\sigma_\mathrm{photo}(\nu) \approx \sigma_0 \left (\frac{h\nu}{Z^2 I_\mathrm{H}} \right)^{-3}
$$&lt;/p&gt;
&lt;figure&gt;&lt;img src="https://iancze.github.io/courses/astro542/lectures/13-photoionized-gas-hii/fig4-1.jpg"
 alt="Photoionization cross sections for hydrogen, neutral helium, and hydrogenic helium. Credit: Ryden and Pogge, Figure 4.1"&gt;&lt;figcaption&gt;
 &lt;p&gt;Photoionization cross sections for hydrogen, neutral helium, and hydrogenic helium. Credit: Ryden and Pogge, Figure 4.1&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;





&lt;h3 id="photoionization-rate"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/13-photoionized-gas-hii/#photoionization-rate"&gt;Photoionization rate&lt;/a&gt;
&lt;/h3&gt;
&lt;p&gt;We can calculate \(\zeta_\mathrm{photo}\) the photoionization rate, or the rate at which a single neutral hydrogen atom undergoes photoionization.
$$
\zeta_\mathrm{photo} = \int_{\nu_0}^\infty \frac{\varepsilon_\nu}{h \nu} c \sigma_\mathrm{photo}(\nu)\,d\nu
$$
where \(\nu_0\) is the threshold ionization frequency and \(\varepsilon_\nu\) is the background energy density.&lt;/p&gt;
&lt;p&gt;Then, the &lt;strong&gt;volumetric photoionization rate&lt;/strong&gt; is
$$
\frac{d n_\mathrm{H\,II}}{d t} = n_\mathrm{H\,I} \zeta_\mathrm{photo}.
$$&lt;/p&gt;




&lt;h3 id="recombination-rate"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/13-photoionized-gas-hii/#recombination-rate"&gt;Recombination rate&lt;/a&gt;
&lt;/h3&gt;
&lt;p&gt;Radiative recombination is the inverse process to photoionization
$$
p + e^- \rightarrow H + h \nu.
$$&lt;/p&gt;
&lt;p&gt;The radiative recombination rate is \(\alpha\) (we&amp;rsquo;ve used a more convenient variable than \(\langle \sigma v \rangle\), but its the same idea), and it&amp;rsquo;s
$$
\alpha_{nl}(T) = \left (\frac{8 k T}{\pi m_e} \right)^{1/2} \int_0^\infty \sigma_{rr,nl}(E) \frac{E}{k T} e^{-E/k T} \frac{d E}{k T}
$$
and has units of \(\mathrm{cm}^3\,\mathrm{s}^{-1}\).&lt;/p&gt;
&lt;p&gt;And the &lt;strong&gt;volumetric rate of radiative recombination&lt;/strong&gt; is
$$
\frac{d n_\mathrm{H\,II}}{d t} = - n_e n_\mathrm{H\,II} \alpha.
$$&lt;/p&gt;
&lt;p&gt;To calculate \(\alpha_n\), you need to sum over all values of angular momentum quantum number \(l\). To calculate \(\alpha\), the total recombination rate, need to sum over principle quantum numbers \(n\).&lt;/p&gt;
&lt;p&gt;We can calculate something called the recombination timescale
$$
t_\mathrm{rec} \equiv \frac{1}{\alpha n_e} \sim 0.6\;\mathrm{Myr}
$$
for H II to undergo radiative recombination. The timescale is typical for the warm ionized medium with \(T=8000\) K and \(n_e \sim 0.1\,\mathrm{cm}^{-3}\).&lt;/p&gt;
&lt;p&gt;In ionization equilibrium, we have
$$
n_\mathrm{H\,I} \zeta_\mathrm{photo} = n_e n_\mathrm{H\,II} \alpha.
$$&lt;/p&gt;




&lt;h2 id="stromgren-spheres"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/13-photoionized-gas-hii/#stromgren-spheres"&gt;Stromgren spheres&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;&amp;ldquo;Toy model&amp;rdquo; of an H II region. Assume we have &lt;em&gt;pure atomic hydrogen gas&lt;/em&gt; with uniform density \(n_\mathrm{H}\) that surrounds a central point source of light. What does the ionization fraction look like as a function of radius, as a function of time?&lt;/p&gt;
&lt;p&gt;Assume the central light source produces ionizing photons with \(h \nu &amp;gt; I_\mathrm{H}\) at a constant rate \(Q_0\) (i.e., photons/s). For a real O3 star with \(T_\mathrm{eff} \approx 45000\)K, \(Q_0 = 7.6\times10^{47}\,\mathrm{s}^{-1}\).&lt;/p&gt;
&lt;p&gt;Say we &amp;ldquo;turn on&amp;rdquo; the star at \(t=0\), the mean free path for ionizing photons traveling through the neutral medium is
$$
\lambda_\mathrm{mfp} = \frac{1}{n_\mathrm{H} \sigma_\mathrm{photo}} = 0.0014\,\mathrm{pc} \left ( \frac{n_\mathrm{H}}{40\,\mathrm{cm}^{-2}} \right)^{-1} \left ( \frac{\sigma_\mathrm{photo}}{6 \times 10^{-18}\,\mathrm{cm}^2} \right )^{-1}
$$
scalings are for cold neutral medium and photon just above ionization energy.&lt;/p&gt;




&lt;h3 id="calculate-the-volume-of-ionized-gas"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/13-photoionized-gas-hii/#calculate-the-volume-of-ionized-gas"&gt;Calculate the volume of ionized gas&lt;/a&gt;
&lt;/h3&gt;
&lt;p&gt;Assume that every ionizing photon will ionize hydrogen (and just pass through ionized hydrogen),
$$
Q_0 = n_\mathrm{H} \frac{d V}{d t} = n_\mathrm{H} 4 \pi R^2 \frac{d R}{d t}
$$
which we can integrate and find that the radius increases like
$$
R(t) = \left( \frac{3 Q_0}{4 \pi n_\mathrm{H}} t \right)^{1/3}.
$$&lt;/p&gt;
&lt;p&gt;We have neglected the facts that&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;protons and electrons recombine at a rate proportional to \(\alpha n_e n_\mathrm{H\,II}\)&lt;/li&gt;
&lt;li&gt;some UV photons from central source reionize atoms that recombined&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;which modifies the equation to
$$
Q_0 = n_\mathrm{H} 4 \pi R^2 \frac{d R}{d t} + \alpha_{B,H} n_e n_H \frac{4 \pi}{3} R^3
$$
or
$$
Q_0 = n_\mathrm{H} 4 \pi R^2 \frac{d R}{d t} + \alpha_{B,H} n_H^2 \frac{4 \pi}{3} R^3
$$
we have assume the ionized sphere to be optically thick to ionizing radiation (case B) an that the fractional ionization is \(x = 1\).&lt;/p&gt;
&lt;p&gt;We can integrate this and obtain
$$
R(t) = R_S \left (1 - e^{-t/t_\mathrm{rec}} \right)^{1/3}
$$
where the Stromgren radius \(R_S\) is
$$
R_S \equiv \left (\frac{3 Q_0}{4 \pi \alpha_{B,H} n^2_H} \right)^{1/3} \approx 5.5\,\mathrm{pc}.
$$&lt;/p&gt;
&lt;p&gt;We can recalculate the recombination time assuming Case B and in a setup where we have a pure hydrogen H II region with fractional ionization \(x \approx 1\).
$$
t_\mathrm{rec} \approx \frac{1}{\alpha_{B, H} n_H} \approx 2600\,\mathrm{yr}
$$&lt;/p&gt;
&lt;p&gt;So it takes about 3000 years to create a Stromgren sphere.&lt;/p&gt;
&lt;p&gt;At times \(t \gg t_\mathrm{rec}\), the Stromgren sphere will consist of a sphere of almost fully ionized gas with radius \(R \rightarrow R_S \sim 6\,\mathrm{pc}\). Outside of this, there will be a partially ionized boundary of thickness \(\lambda_\mathrm{mfp} \sim 0.001\,\mathrm{pc} \ll R_S\), and outside of this, neutral hydrogen.&lt;/p&gt;
&lt;p&gt;Because radiative recombination is the reversal of photoionization, \(t_\mathrm{rec}\) is also the time it takes for the sphere to go back to neutral H after the central UV source has been turned off.&lt;/p&gt;
&lt;p&gt;The Stromgren sphere approaches ionization temperature, but it&amp;rsquo;s still far from pressure equilibrium, and as it&amp;rsquo;s heated, it will begin to expand.&lt;/p&gt;</description></item><item><title>Interstellar Radiation Fields, Ionization Processes, and Recombination of Ions with Electrons</title><link>https://iancze.github.io/courses/astro542/lectures/12-ionization-processes/</link><pubDate>Mon, 20 Sep 2021 00:00:00 +0000</pubDate><guid>https://iancze.github.io/courses/astro542/lectures/12-ionization-processes/</guid><description>&lt;p&gt;&lt;a href="https://psu.mediaspace.kaltura.com/media/Astro+542A+Lecture+Sep+24/1_v7exn76l"&gt;Zoom link&lt;/a&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Draine Ch. 12, 13, &amp;amp; 14&lt;/li&gt;
&lt;li&gt;Ryden and Pogge Ch 4&lt;/li&gt;
&lt;/ul&gt;




&lt;h2 id="interstellar-radiation-fields"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/12-ionization-processes/#interstellar-radiation-fields"&gt;Interstellar Radiation Fields&lt;/a&gt;
&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;Draine Ch. 12&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;Pick a point in our galaxy, somewhere in the ISM. Look around. Unless you are deep in a molecular cloud, it isn&amp;rsquo;t completely dark.&lt;/p&gt;
&lt;p&gt;Of course, the intensity of the incoming radiation very much depends on your &lt;em&gt;location&lt;/em&gt; in the Galaxy and &lt;em&gt;where&lt;/em&gt; you&amp;rsquo;re looking. Remember, \(I_\nu\) is a function of &lt;em&gt;position&lt;/em&gt; \(\vec{x}\) and &lt;em&gt;direction&lt;/em&gt; \(\Omega\).&lt;/p&gt;
&lt;p&gt;If you&amp;rsquo;re very near an H II region, or partially shielded by a molecular cloud, etc&amp;hellip; you will see a different spectrum.&lt;/p&gt;
&lt;p&gt;The first part of this lecture is looking at the spectral energy distribution of the interstellar radiation field in a roughly &amp;ldquo;average&amp;rdquo; sense.&lt;/p&gt;
&lt;figure&gt;&lt;img src="https://iancze.github.io/courses/astro542/lectures/12-ionization-processes/fig_12_1.jpg"
 alt="Interstellar continuum radiation field \(u_\nu\) (spectral energy distribution) in an H I cloud in the solar neighborhood (not including spectral lines). Solid line is the sum of all individual components. Squares are additional all-sky measurements from COBE-DIRBE and ROSAT. Credit: Draine Fig 12.1"&gt;&lt;figcaption&gt;
 &lt;p&gt;Interstellar continuum radiation field \(u_\nu\) (spectral energy distribution) in an H I cloud in the solar neighborhood (not including spectral lines). Solid line is the sum of all individual components. Squares are additional all-sky measurements from COBE-DIRBE and ROSAT. Credit: Draine Fig 12.1&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;

&lt;p&gt;In our solar neighborhood, from lowest frequency to highest, we are dominated by&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;&lt;strong&gt;Galactic synchrotron&lt;/strong&gt; from relativistic electrons in the galactic magnetic field. Dominates at \(\nu &amp;lt; 1\) GHz. Spatially variable, enhanced near SNe remnants. You can measure the sky-averaged antenna temperature from synchrotron and you&amp;rsquo;ll find
$$
\langle T_A \rangle \approx 700\,\mathrm{K}
$$
at \(\nu = 140\) MHz, meaning that the radio sky (at low frequencies) is &lt;em&gt;very&lt;/em&gt; bright. But, the total energy density (integrated over relevant frequencies) is comparably small.&lt;/li&gt;
&lt;/ul&gt;
&lt;figure&gt;&lt;img src="https://iancze.github.io/courses/astro542/lectures/12-ionization-processes/plate4.png"
 alt="Credit: Draine Plate 4"&gt;&lt;figcaption&gt;
 &lt;p&gt;Credit: Draine Plate 4&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;

&lt;ul&gt;
&lt;li&gt;&lt;strong&gt;CMB&lt;/strong&gt; at \(\nu=1\) GHz, the CMB exceeds the synchrotron. \(T_\mathrm{CMB} = 2.7255\), essentially isotropic, with the primary departure from the dipole perturbation of the motion of the Sun relative to CMB rest-frame.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Far-infrared (FIR) and infrared (IR)&lt;/strong&gt; emission from dust grains heated by starlight. Dominates between wavelengths of 600 microns up to 5 microns. Most of the spectrum can be approximated as thermal emission from dust grains at a temperature of 17 K. The remaining part is (wavelengths shorter than 50 microns) is power radiated by vibrational emission bands, likely from polycyclic aromatic hydrocarbon (PAH) particles&lt;/li&gt;
&lt;/ul&gt;
&lt;figure&gt;&lt;img src="https://iancze.github.io/courses/astro542/lectures/12-ionization-processes/plate2.png"
 alt="Credit: Draine Plate 2"&gt;&lt;figcaption&gt;
 &lt;p&gt;Credit: Draine Plate 2&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;

&lt;ul&gt;
&lt;li&gt;Emission from \(10^4\) K &lt;strong&gt;plasma&lt;/strong&gt;, which includes free-free, free-bound, and bound-bound transitions.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Starlight&lt;/strong&gt;, i.e., photons from stellar photospheres. At optical wavelengths, most photons are starlight. Local starlight background is the sum of diffuse blackbodies. Within an H I region, very little radiation at UV and FUV wavelengths (see super shop dropoff), because this radiation is strongly absorbed by neutral H and He. This type of radiation is very important for photoexciting \(\mathrm{H}_2\) and other molecules, photoionizing heavy elements, and ejecting photons from dust grains. The UV radiation is very spatially variable, because the O and B stars that are the primary emitters of this radiation are sporadic and clumped together in star forming regions.&lt;/li&gt;
&lt;/ul&gt;
&lt;figure&gt;&lt;img src="https://iancze.github.io/courses/astro542/lectures/12-ionization-processes/plate1.png"
 alt="Credit: Draine Plate 1"&gt;&lt;figcaption&gt;
 &lt;p&gt;Credit: Draine Plate 1&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;

&lt;ul&gt;
&lt;li&gt;&lt;strong&gt;X-ray emission&lt;/strong&gt; from very hot (\(T = 10^5 - 10^8\)) K plasma. Most of this from SNe, injecting energy into the ISM, energy goes into thermal plasma, and then re-radiated as x-rays and extreme UV. Lower energies easily absorbed by neutral H I, so at the boundary between this and the starlight regions, the radiation density is highly variable. In external galaxies, AGN can make this part of energy density much higher.&lt;/li&gt;
&lt;/ul&gt;
&lt;figure&gt;&lt;img src="https://iancze.github.io/courses/astro542/lectures/12-ionization-processes/plate5.png"
 alt="Credit: Draine Plate 5"&gt;&lt;figcaption&gt;
 &lt;p&gt;Credit: Draine Plate 5&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;





&lt;h3 id="spectrum-in-a-photodissociation-region-pdr"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/12-ionization-processes/#spectrum-in-a-photodissociation-region-pdr"&gt;Spectrum in a photodissociation region (PDR)&lt;/a&gt;
&lt;/h3&gt;
&lt;p&gt;Let&amp;rsquo;s look at a massive O star illuminating an H II region adjacent to a molecular cloud, called a photodissociation region or PDR.&lt;/p&gt;
&lt;p&gt;Compared to the previous, this radiation field additionally includes&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;radiation from the O star (at \( h \nu &amp;lt; 13.6 )) eV), everything higher absorbed within H II&lt;/li&gt;
&lt;li&gt;free-free emission from H II region&lt;/li&gt;
&lt;li&gt;line emission from H II&lt;/li&gt;
&lt;li&gt;emission from warm dust in PDR&lt;/li&gt;
&lt;/ul&gt;
&lt;figure&gt;&lt;img src="https://iancze.github.io/courses/astro542/lectures/12-ionization-processes/fig_12_3.jpg"
 alt="Radiation field in neutral gas adjacent to an H II: region. Starlight from an O star. Spectral lines not shown. Credit: Draine, Figure 12.3"&gt;&lt;figcaption&gt;
 &lt;p&gt;Radiation field in neutral gas adjacent to an H II: region. Starlight from an O star. Spectral lines not shown. Credit: Draine, Figure 12.3&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;





&lt;h2 id="ionization-processes"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/12-ionization-processes/#ionization-processes"&gt;Ionization Processes&lt;/a&gt;
&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;Draine Ch. 13&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;Ionization state varies widely from place to place. Define ionization fraction
$$
x_e = n_e/n_\mathrm{H}.
$$&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;in &lt;strong&gt;dense molecular clouds&lt;/strong&gt;, material is almost entirely neutral \(x_e \lesssim 10^{-6}\)&lt;/li&gt;
&lt;li&gt;in &lt;strong&gt;&amp;ldquo;H I gas,&amp;rdquo;&lt;/strong&gt; the hydrogen in predominately neutral but is partially ionized in the range \(10^{-3} &amp;lt; x_e &amp;lt; 10^{-1}\), mostly by cosmic rays. Elements like carbon are photoionized by starlight.&lt;/li&gt;
&lt;li&gt;in an &lt;strong&gt;&amp;ldquo;H II&amp;rdquo; region&lt;/strong&gt;, hydrogen is mostly ionized, helium mostly singly ionized, and oxygen or neon doubly ionized&lt;/li&gt;
&lt;li&gt;in a &lt;strong&gt;supernova remnant&lt;/strong&gt;, elements up through carbon may be fully ionized (i.e., no electrons!) and oxygen and neon may contain electrons only in inner shell&lt;/li&gt;
&lt;/ul&gt;




&lt;h3 id="processes-that-alter-ionization-state"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/12-ionization-processes/#processes-that-alter-ionization-state"&gt;Processes that alter ionization state&lt;/a&gt;
&lt;/h3&gt;
&lt;ul&gt;
&lt;li&gt;&lt;strong&gt;photoelectric absorption&lt;/strong&gt;
$$
X + h \nu \rightarrow X^+ + e^-
$$&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;&lt;figure&gt;&lt;img src="https://iancze.github.io/courses/astro542/lectures/12-ionization-processes/fig_13_1.jpg"
 alt="Photoionization cross sections for hydrogen atoms, molecular hydrogen, helium, carbon, and oxygen. The cross sections are high when the photon energy is close to the ionization energy, zero when the photon is below this, and decline as a power law when photon energies are much higher. The cross sections for C and O have an absorption edge corresponding to the minimum photon energy for photoionization from the 1s shell, thus boosting the cross section considerably compared to hydrogen. Credit: Draine 13.1"&gt;&lt;figcaption&gt;
 &lt;p&gt;Photoionization cross sections for hydrogen atoms, molecular hydrogen, helium, carbon, and oxygen. The cross sections are high when the photon energy is close to the ionization energy, zero when the photon is below this, and decline as a power law when photon energies are much higher. The cross sections for C and O have an absorption edge corresponding to the minimum photon energy for photoionization from the 1s shell, thus boosting the cross section considerably compared to hydrogen. Credit: Draine 13.1&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;

At high energies, heavy elements can dominate the total photoionization cross section, even though their total abundance is much less compared to hydrogen.&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;&lt;strong&gt;photoelectric absorption followed by Auger effect&lt;/strong&gt;: photoionization ejects an electron from inner shell, one electron from outer shell drops, second electron promoted to an excited level, possibly unbound&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;collisional ionization&lt;/strong&gt;
$$
X + e^- \rightarrow X^+ + 2 e^-
$$
Electron hits an atom, removes another electron, leaving ionized atom.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;cosmic ray ionization&lt;/strong&gt;
$$
X + CR \rightarrow X^+ + e^- + CR
$$
What are cosmic rays? They can be thought of as &amp;ldquo;non-thermal&amp;rdquo; electrons and ions, i.e., they&amp;rsquo;re moving much faster than would be expected given the local temperature of the plasma.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;charge exchange&lt;/strong&gt;
$$
X + Y^+ \rightarrow X^+ + Y
$$
(see next section)&lt;/li&gt;
&lt;/ul&gt;




&lt;h2 id="recombination-of-ions-with-electrons"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/12-ionization-processes/#recombination-of-ions-with-electrons"&gt;Recombination of Ions with Electrons&lt;/a&gt;
&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;Draine Ch. 14&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;This is the inverse of the processes we just discussed.&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;&lt;strong&gt;radiative recombination&lt;/strong&gt;
$$
X^+ + e^- \rightarrow X + h \nu
$$
If recombination takes place such that the electron goes straight to the ground state, then the photon that is emitted can ionize hydrogen. If the region has a significant population of neutral hydrogen, then the emitted photon will have a high probability of being absorbed close to where it was emitted. This sets up two &amp;ldquo;regimes&amp;rdquo; of recombination,
&lt;ul&gt;
&lt;li&gt;&lt;strong&gt;Case A&lt;/strong&gt;: the region is &lt;em&gt;optically thin&lt;/em&gt; to ionizing radiation, such that every ionizing photon emitted during the process escapes. Happens in collisionally ionized regions that are very hot (and very low density)&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Case B&lt;/strong&gt;: the region is &lt;em&gt;optically thick&lt;/em&gt; to ionizing radiation, such that ionizing photons emitted are immediately absorbed. The recombinations directly to the ground state do not change the ionization of the gas, only recombinations to states \(n \geq 2\) do. Happens in photoionized nebulae around O and B stars, aka H II regions&lt;/li&gt;
&lt;/ul&gt;
&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;The hydrogen recombination emission spectrum can be calculated from the radiative decays from one level to one or more lower levels. Radiative recombination creates neutral atoms with excited levels, which will then (quickly) decay to other lower levels, producing a characteristic line-emission spectrum.&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;the ratios of these lines can be measured and used to calculate the temperature of the region&lt;/li&gt;
&lt;li&gt;the total power from the lines can be used to determine total rate of hydrogen recombination&lt;/li&gt;
&lt;li&gt;relative intensities of recombination lines can be used to estimate dust reddening&lt;/li&gt;
&lt;/ul&gt;
&lt;ul&gt;
&lt;li&gt;&lt;strong&gt;dielectronic recombination&lt;/strong&gt;
$$
X^+ + e^- \rightarrow X^{**} \rightarrow X + h \nu
$$
important for high temperature plasmas. Ion captures electron and briefly has two electrons in excited states.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;three-body recombination&lt;/strong&gt;
$$
X^+ + e^- + e^- \rightarrow X + e^-
$$&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;charge exchange&lt;/strong&gt;
$$
X^+ + Y \rightarrow X + Y^+
$$
Any electron is seized during a collision.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;neutralization by grain&lt;/strong&gt;
$$
X^+ + \mathrm{grain} \rightarrow X + \mathrm{grain}^+
$$
Collision with a grain transfers charge, if the ionization potential exceeds the &amp;ldquo;work function&amp;rdquo; of the grain.&lt;/li&gt;
&lt;/ul&gt;</description></item><item><title>Introduction to Radio Astronomy and ALMA</title><link>https://iancze.github.io/courses/astro542/lectures/11-intro-radio-astronomy/</link><pubDate>Mon, 20 Sep 2021 00:00:00 +0000</pubDate><guid>https://iancze.github.io/courses/astro542/lectures/11-intro-radio-astronomy/</guid><description>&lt;p&gt;&lt;a href="https://psu.mediaspace.kaltura.com/media/Astro+542A+Lecture+Sep+20/1_fjzi6u2t"&gt;Zoom link&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Suggested reading&lt;/strong&gt;: &lt;a href="https://almascience.eso.org/documents-and-tools/cycle8/alma-science-primer"&gt;ALMA Primer&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;Resources:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;&lt;a href="https://www.cv.nrao.edu/~sransom/web/xxx.html"&gt;Essential Radio Astronomy&lt;/a&gt; by James Condon and Scott Ransom (HTML book)&lt;/li&gt;
&lt;li&gt;&lt;a href="https://ui.adsabs.harvard.edu/abs/2017isra.book.....T/abstract"&gt;Interferometry and Synthesis in Radio Astronomy, 3rd Edition&lt;/a&gt; by Thompson, Moran, and Swenson. Electronically available from PSU library.&lt;/li&gt;
&lt;li&gt;&lt;a href="https://ui.adsabs.harvard.edu/abs/2013tra..book.....W/abstract"&gt;Tools of Radio Astronomy&lt;/a&gt; by Wilson and Hüttemeister. Also electronically available from PSU library.&lt;/li&gt;
&lt;li&gt;&lt;a href="http://www.cvent.com/events/virtual-17th-synthesis-imaging-workshop/agenda-0d59eb6cd1474978bce811194b2ff961.aspx"&gt;NRAO Summer School Lectures and Slides&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;em&gt;The Fourier Transform and its Applications&lt;/em&gt; by R. Bracewell&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;&lt;em&gt;Disclaimer&lt;/em&gt;: Radio astronomy (including radio interferometry) is a &lt;em&gt;vast&lt;/em&gt; subject area worthy of several full length courses. I encourage you to check out the resources above to survey the many topical areas that we won&amp;rsquo;t be able to cover in this lecture.&lt;/p&gt;
&lt;p&gt;&lt;em&gt;Goal&lt;/em&gt;: provide enough of an introduction to radio astronomy, radio interferometry, and the concepts relevant to ALMA, such that you see some of the connections possible with our mock TAC and scientific justification later in the course.&lt;/p&gt;




&lt;h2 id="atmospheric-windows"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/11-intro-radio-astronomy/#atmospheric-windows"&gt;Atmospheric Windows&lt;/a&gt;
&lt;/h2&gt;
&lt;figure&gt;&lt;a href="https://www.cv.nrao.edu/~sransom/web/Ch1.html#S1"&gt;&lt;img src="https://iancze.github.io/courses/astro542/lectures/11-intro-radio-astronomy/atmospheric_windows.jpg"
 alt="Atmospheric windows for astronomy. Credit: ESA/Hubble (F. Granato) and Essential Radio Astronomy."&gt;&lt;/a&gt;&lt;figcaption&gt;
 &lt;p&gt;Atmospheric windows for astronomy. Credit: ESA/Hubble (F. Granato) and Essential Radio Astronomy.&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;

&lt;p&gt;By comparison to optical windows, the radio region of the spectrum is downright transparent. Only towards very high radio/microwave frequencies (wavelengths about 1 mm), does the atmospheric transmission start to decline.&lt;/p&gt;
&lt;p&gt;The same
$$
\theta \approx \frac{\lambda}{D}
$$
applies for radio antennas. Take a \(\lambda = 1\;\mathrm{cm}\) observation, for example. Compared to an optical \(\lambda = 500\;\mathrm{nm}\) telescope the same size, the resolution will be a factor of
$$
\frac{1\;\mathrm{cm}}{500\;\mathrm{nm}} = 20,000
$$
worse. Yikes!&lt;/p&gt;
&lt;p&gt;Radio astronomers are constantly trying to find ways to increase angular (spatial) resolution.&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;One way is to build bigger telescopes, such as the Green Bank Telescope (100m diameter)&lt;/li&gt;
&lt;/ul&gt;
&lt;figure&gt;&lt;a href="https://public.nrao.edu/gallery/green-bank-telescope/"&gt;&lt;img src="https://iancze.github.io/courses/astro542/lectures/11-intro-radio-astronomy/gbt.jpg"
 alt="The Green Bank Telescope (100m diameter) operates at radio wavelegths. Credit: NRAO/AUI/NSF"&gt;&lt;/a&gt;&lt;figcaption&gt;
 &lt;p&gt;The Green Bank Telescope (100m diameter) operates at radio wavelegths. Credit: NRAO/AUI/NSF&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;

&lt;ul&gt;
&lt;li&gt;Another way is to work at higher frequencies (shorter wavelengths), e.g. sub-mm radio astronomy (IRAM 30m telescope)&lt;/li&gt;
&lt;/ul&gt;
&lt;figure&gt;&lt;a href="https://www.iram-institute.org/EN/30-meter-telescope.php"&gt;&lt;img src="https://iancze.github.io/courses/astro542/lectures/11-intro-radio-astronomy/iram.jpg"
 alt="The IRAM 30m diameter telescope, which operates at sub-mm wavelengths. Credit: Wikipedia/IRAM-gre"&gt;&lt;/a&gt;&lt;figcaption&gt;
 &lt;p&gt;The IRAM 30m diameter telescope, which operates at sub-mm wavelengths. Credit: Wikipedia/IRAM-gre&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;

&lt;ul&gt;
&lt;li&gt;A final way is to use &lt;em&gt;interferometry&lt;/em&gt;, sometimes also at sub-mm wavelengths (ALMA)&lt;/li&gt;
&lt;/ul&gt;
&lt;figure&gt;&lt;img src="https://iancze.github.io/courses/astro542/lectures/11-intro-radio-astronomy/alma.jpg"
 alt="The Atacama Large (Sub)millimeter Array, an interferometric array of 66 antennas operating at sub-millimeter wavelengths. The largest antennas in the array are only 12m in diameter, yet through interferometry, the array is able to obtain far higher spatial resolution than the largest single-dish antennas. Credit: NRAO/ESO/NAOJ/JAO"&gt;&lt;figcaption&gt;
 &lt;p&gt;The Atacama Large (Sub)millimeter Array, an interferometric array of 66 antennas operating at sub-millimeter wavelengths. The largest antennas in the array are only 12m in diameter, yet through interferometry, the array is able to obtain far higher spatial resolution than the largest single-dish antennas. Credit: NRAO/ESO/NAOJ/JAO&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;

&lt;p&gt;It&amp;rsquo;s easier to build larger telescopes at longer wavelengths because the tolerances required for the reflecting surface are less strict than at optical wavelengths. Though typically one must keep surface tolerances to within
$$
\sigma = \frac{\lambda_\mathrm{min}}{16},
$$
otherwise the efficiency of the antenna starts to decline substantially. For the 100 m diameter GBT operating at it&amp;rsquo;s highest frequency (100 GHz) or 3 mm, this translates to \(200\;\mu\mathrm{m}\), which is the thickness of two sheets of paper! That&amp;rsquo;s quite an engineering challenge, and is the reason why large, steerable dishes are difficult to build.&lt;/p&gt;
&lt;p&gt;Keeping telescopes fixed is one way to build a little bit bigger, such as &lt;a href="https://en.wikipedia.org/wiki/Five-hundred-meter_Aperture_Spherical_Telescope"&gt;FAST&lt;/a&gt;, which is a five hundred meter diameter fixed telescope in China. See also Arecibo, which unfortunately collapsed in December 2020. Eventually, though, the materials/engineering cost to building large single dish telescopes becomes prohibitive.&lt;/p&gt;




&lt;h2 id="single-dish-observations"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/11-intro-radio-astronomy/#single-dish-observations"&gt;Single dish observations&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;The &amp;ldquo;beam&amp;rdquo; or power pattern as a function of direction, of a receiving antenna can be calculated using the reciprocity theorems for transmitting and receiving antennas. We don&amp;rsquo;t have time to cover it in this lecture, but the end result is that the far field electric field pattern \(f(l)\) is the Fourier transform of the electric field illuminating the aperture of the telescope \(g(u)\).&lt;/p&gt;
&lt;figure&gt;&lt;a href="https://www.cv.nrao.edu/~sransom/web/Ch3.html#S1"&gt;&lt;img src="https://iancze.github.io/courses/astro542/lectures/11-intro-radio-astronomy/uniform-dish.png"
 alt="A schematic illustration of (top): Uniformly illuminated aperture (middle): The electric field pattern of the antenna, as a function of direction (bottom): The power pattern of the antenna, as a function of direction. Credit: Essential Radio Astronomy"&gt;&lt;/a&gt;&lt;figcaption&gt;
 &lt;p&gt;A schematic illustration of (top): Uniformly illuminated aperture (middle): The electric field pattern of the antenna, as a function of direction (bottom): The power pattern of the antenna, as a function of direction. Credit: Essential Radio Astronomy&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;

&lt;p&gt;For large apertures, the nulls at \( l = \pm 1, 2, \ldots\) appear at the angles \(\theta \approx \lambda/D, 2 \lambda/D, \ldots\). In two dimensions, for a circular aperture, this is an Airy pattern.&lt;/p&gt;
&lt;figure&gt;&lt;img src="https://iancze.github.io/courses/astro542/lectures/11-intro-radio-astronomy/beam-pattern.png"
 alt="A beam power pattern plotten in polar coordinates, demonstrating that the antenna can pick up power from sidelobes at range of angles. Credit: Tools of Radio Astronomy."&gt;&lt;figcaption&gt;
 &lt;p&gt;A beam power pattern plotten in polar coordinates, demonstrating that the antenna can pick up power from sidelobes at range of angles. Credit: Tools of Radio Astronomy.&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;

&lt;p&gt;This is an idealized representation, but is still helpful. The beam can pick up power through sidelobes at a range of angles. Directional antennas help concentrate power in the main beam, but antennas with secondary stages (and thus supports, like most telescopes) create opportunities for ground radiation to reflect into the receiver. The relationship between the aperture and the electric&lt;/p&gt;
&lt;p&gt;You can think of single-dish telescopes (unless they have a sophisticated, multi-pixel receiver) essentially as single-pixel devices. So to make a map of the sky, you would need to raster scan the telescope across the region of interest, reading out antenna temperature as a function of RA, Dec. To make a good (i.e., scientifically accurate) map, you should focus on Nyquist sampling the sky to a uniform sensitivity, usually through a hexagonal pattern of dithering. More advanced instruments may have an array of &amp;ldquo;feeds&amp;rdquo; in a focal plane (mirroring a set of &amp;ldquo;pixels&amp;rdquo;), but this is still a small number of pixels compared to a typical CCD (e.g., 25 or 36).&lt;/p&gt;




&lt;h2 id="temperatures-redux"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/11-intro-radio-astronomy/#temperatures-redux"&gt;Temperatures, redux&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;In an earlier lecture, we gave definitions of &amp;ldquo;brightness temperature&amp;rdquo; and &amp;ldquo;antenna temperature&amp;rdquo; following Draine&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;&lt;strong&gt;brightness temperature&lt;/strong&gt;: the temperature corresponding to the specific intensity if we used the full form of the Planck formula&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;antenna temperature&lt;/strong&gt;: the temperature corresponding to the specific intensity if we were in the Rayleigh-Jeans domain. Added benefit that specific intensity is &lt;em&gt;linearly related&lt;/em&gt; to antenna temperature and makes it easy to substitute one for the other.
$$
T_A(\nu) = \frac{c^2}{2 k \nu^2} I_\nu
$$&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;These definitions are still true, but also somewhat idealized, as we&amp;rsquo;ll cover in a second.&lt;/p&gt;
&lt;p&gt;In the field of radio astronomy, be aware that one frequently combines temperatures in other interesting ways. One can express random noise power in terms of an effective temperature
$$
P = k T \Delta \nu
$$
where \(\Delta \nu\) is the bandwidth of the observation. Here the power is equal to the noise power delivered to a &lt;strong&gt;matched load&lt;/strong&gt; by a resistor at physical temperature \(T\). By matched load, we mean we connect a resistor to the input terminals of a linear amplifier. The fact that this resistor has some temperature (i.e., we haven&amp;rsquo;t cooled it to absolute zero&amp;hellip;) means that the thermal motion of the electrons will produce a random, variable current \(i(t)\) input to the amplifier. The mean value of this current is zero, but the root mean squared value is non-zero, and this represents a non-zero power. I.e., you can draw (some) power from a resistor at room temperature, purely from thermal motions. The situation is not dissimilar to the random walk of a particle in Brownian motion. For more details, see &lt;em&gt;Tools of Radio Astronomy&lt;/em&gt;, Chapter 1.8.&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;&lt;strong&gt;Antenna temperature&lt;/strong&gt; \(T_A\) the component of the power received by the antenna from &lt;em&gt;cosmic sources&lt;/em&gt;. It has the same interpretation as before (though we&amp;rsquo;ll talk about beam dilution in a second).&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Receiver temperature&lt;/strong&gt; \(T_R\) the component of the power from internal noise of the receiver components themselves, ground radiation, atmospheric emission, etc&amp;hellip;&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;System temperature&lt;/strong&gt; \(T_S = T_A + T_R\) is the sum of receiver temperature and antenna temperature. It&amp;rsquo;s the one power number coming out of the backend of your telescope. It&amp;rsquo;s up to you to calibrate \(T_R\) accurately enough to measure \(T_A\).&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;In any observation, you will have your cosmic signal of interest and several contributions of noise (see &lt;em&gt;Essential Radio Astronomy&lt;/em&gt;, Ch. 3.6.1)
$$
T_S=T_\mathrm{cmb}+T_\mathrm{rsb}+T_A + [1−\exp(−\tau_A)] T_\mathrm{atm}+T_\mathrm{spill}+T_r+ \ldots
$$
such as the CMB, other galactic background sources, the atmosphere, spillover radiation from the ground, the temperature of the radiometer itself (hopefully cryogenically cooled), etc.&lt;/p&gt;
&lt;p&gt;In the limit that \(T_A \ll T_S\) (most astronomy situations, unfortunately!), we have
$$
S/N \approx C \frac{T_A}{T_S} \sqrt{\Delta \nu \Delta t}
$$
where \(C\) is a constant of proportionality greater than or equal to 1, and \(\Delta t\) is the integration time. If we let \(\Delta \nu \approx 1\;\mathrm{GHz}\) and \(\Delta t \approx 1\;\mathrm{h}\), then we can get \(\sqrt{\Delta \nu \Delta t} \approx 10^6\), allowing us to detect a signal which is less than \(10^{-6}\) the system noise. A great illustration of this capability is the COBE satellite that studied CMB anisotropies with brightness temperatures \(&amp;lt; 10^{-7}\) that of the system temperature. To achieve these contrasts, however, it&amp;rsquo;s important to keep systematics under control, otherwise the S/N scaling won&amp;rsquo;t hold!&lt;/p&gt;




&lt;h2 id="beam-dilution"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/11-intro-radio-astronomy/#beam-dilution"&gt;Beam dilution&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;In previous lectures, we&amp;rsquo;ve been talking about specific intensity \(I_\nu(\Omega)\) as a &amp;ldquo;known&amp;rdquo; quantity of direction (e.g., R.A., Dec.) and used antenna temperature \(T_A\) as a linear proxy for its specification. For the following discussion, we&amp;rsquo;re going to move into the realm of observations, and discuss the ways \(T_A\) can be an unfaithful proxy for the &amp;ldquo;true&amp;rdquo; specific intensity distribution or brightness temperature. we&amp;rsquo;ll use the symbol \(T_b(\Omega)\) to denote the &amp;ldquo;true&amp;rdquo; brightness/antenna temperature, assuming we&amp;rsquo;re in the Rayleigh-Jeans limit, and redefine \(T_A\) to mean the response of the telescope to the cosmic radiation.&lt;/p&gt;
&lt;p&gt;When we&amp;rsquo;re doing observations, we don&amp;rsquo;t always have access to the highest resolution version of \(T_b(\Omega)\), but rather we have access to a quantity which is the true \(T_b(\Omega)\) convolved with the beam of the telescope, which is the implication of \(T_A\) for this discussion.&lt;/p&gt;




&lt;h4 id="large-fully-resolved-source"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/11-intro-radio-astronomy/#large-fully-resolved-source"&gt;large (fully resolved) source&lt;/a&gt;
&lt;/h4&gt;
&lt;p&gt;[Draw a smooth cloud with the beam overlaid as a small circle].&lt;/p&gt;
&lt;p&gt;In the limit that we are observing a source that subtends a solid angle much larger than the beam of the antenna,
$$
\Omega_S &amp;gt; \Omega_A
$$
the convolution of the beam doesn&amp;rsquo;t matter, we&amp;rsquo;re still sensing approximately the same \(T_b(\Omega)\) such that
$$
T_A(\Omega) \approx T_b(\Omega).
$$
If the source is in LTE, then we also have that \(T_A \approx T\).&lt;/p&gt;




&lt;h3 id="small-unresolved-source"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/11-intro-radio-astronomy/#small-unresolved-source"&gt;small (unresolved) source&lt;/a&gt;
&lt;/h3&gt;
&lt;p&gt;[Draw a beam with a smaller source in the center]&lt;/p&gt;
&lt;p&gt;If the source is more compact than the antenna beam,
$$
\Omega_S &amp;lt; \Omega_A,
$$
then the measured antenna temperature is basically the &amp;ldquo;true&amp;rdquo; intensity averaged over the area of the main beam. This lowers the measured antenna temperature by a factor
$$
\frac{T_A}{T_b} = \frac{\Omega_S}{\Omega_A}
$$
where the ratio \(\frac{\Omega_S}{\Omega_A}\) is called the &lt;strong&gt;beam filling factor&lt;/strong&gt;.&lt;/p&gt;
&lt;p&gt;For example, you could have a compact source with \(T_b = 10^4\) K, but if it only fills 1% of the beam solid angle then you would measure an antenna temperature of 100 K. If you took your observations at face-value (and assumed LTE), then you would incorrectly conclude that the source is 100x cooler than it actually is.&lt;/p&gt;
&lt;p&gt;Beam dilution also applies to observations of sources that have &lt;em&gt;structure&lt;/em&gt; on spatial scales below the observable limit, which, to be honest, is going to be most astrophysical sources of interest. For example, consider a gas filament in a star-forming region.&lt;/p&gt;
&lt;p&gt;[Draw an approximation of a gas filament]&lt;/p&gt;
&lt;p&gt;Radio-bright, spatially concentrated regions will be &amp;ldquo;smeared out&amp;rdquo; by the beam. If you wanted to use antenna temperature (and assume LTE) to estimate the physical conditions of the gas filament, you do so at the peril of measuring incorrect temperatures. The unfortunate reality here is that, without higher resolution images to guide you (which sometimes exist at optical or infrared wavelengths), it&amp;rsquo;s quite difficult to estimate how badly your measurements are affected by beam dilution.&lt;/p&gt;




&lt;h3 id="point-source"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/11-intro-radio-astronomy/#point-source"&gt;point source&lt;/a&gt;
&lt;/h3&gt;
&lt;p&gt;In the case of an unresolved &lt;em&gt;point source&lt;/em&gt; with known flux \(F_\nu\), it&amp;rsquo;s not that helpful to talk about brightness temperatures if we can&amp;rsquo;t also estimate the solid angle that the source subtends \(\Omega_S\), since it is required to relate flux to specific intensity (or brightness temperature). [Sometimes we &lt;em&gt;can&lt;/em&gt; estimate \(\Omega_S\) from theoretical arguments, even if the source is so small as to still be unresolvable, in which case we&amp;rsquo;d still use the above calculations.]&lt;/p&gt;
&lt;p&gt;For point sources, it&amp;rsquo;s easier to talk about antenna temperature using
$$
T_A = \frac{P_\nu}{k} = \frac{A_e F_\nu}{2 k}
$$
where \(A_e\) is the effective collecting area of the telescope. This gives us the strange but sometimes convenient relationship of talking about a telescope&amp;rsquo;s sensitivity to a point source using &amp;ldquo;kelvins per jansky,&amp;rdquo; which is proportional to its collecting area.&lt;/p&gt;
&lt;p&gt;For more useful single-dish guidance, see &lt;a href="https://www.atnf.csiro.au/research/radio-school/2011/talks/Parkes-school-Fundamental-II.pdf"&gt;these notes&lt;/a&gt; by James Jackson, or &lt;a href="https://www.cv.nrao.edu/~sransom/web/Ch3.html#S1.SS6"&gt;Ch 3.1.6&lt;/a&gt; of &lt;em&gt;Essential Radio Astronomy&lt;/em&gt;.&lt;/p&gt;




&lt;h2 id="interferometers"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/11-intro-radio-astronomy/#interferometers"&gt;Interferometers&lt;/a&gt;
&lt;/h2&gt;




&lt;h3 id="two-element-interferometer"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/11-intro-radio-astronomy/#two-element-interferometer"&gt;Two-element interferometer&lt;/a&gt;
&lt;/h3&gt;
&lt;p&gt;We&amp;rsquo;ll focus our discussion around the two-element interferometer, since even arrays of antennas (like ALMA) can be understood by breaking them down (conceptually) into this fundamental unit.&lt;/p&gt;
&lt;p&gt;Consider two antennas separated by some distance \(b\) simultaneously observing a source. Unless the source is directly overhead, the radiation from the source will arrive at the antennas at slightly different times, corresponding to a geometric time delay \(\tau_g\). The voltage streams from these two antennas \(V_1\) and \(V_2\) are then &lt;strong&gt;correlated&lt;/strong&gt; (multiplied together and then averaged) to form the output stream \(R\).&lt;/p&gt;
&lt;figure&gt;&lt;a href="https://www.cv.nrao.edu/~sransom/web/Ch3.html#S7"&gt;&lt;img src="https://iancze.github.io/courses/astro542/lectures/11-intro-radio-astronomy/interferometer.png"
 alt="A schematic of a two-element interferometer. Credit: Essential Radio Astronomy"&gt;&lt;/a&gt;&lt;figcaption&gt;
 &lt;p&gt;A schematic of a two-element interferometer. Credit: Essential Radio Astronomy&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;

&lt;p&gt;For now, we&amp;rsquo;ll call this a &lt;em&gt;cosine&lt;/em&gt; correlator because of the cosine dependence of the correlation stream.&lt;/p&gt;
&lt;p&gt;The fringe pattern in the lower right of this figure is what you would get if you kept the antennas stationary and let a point source drift across the field of view. The broad envelope is the attenuation from the primary beam response of the antennas as the source enters and then leaves the field of view. The fringe frequency depends on the observing frequency and the baseline separating the antennas.&lt;/p&gt;
&lt;p&gt;Interestingly, the time averaged response of a multiplying interferometer has a mean of 0. This has the consequence that uncorrelated noise power from very extended sources (such as the CMB) will average to 0, therefore, the interferometer is not sensitive to radiation on those large spatial scales. This is called &lt;em&gt;spatial filtering&lt;/em&gt;.&lt;/p&gt;




&lt;h3 id="multiple-antennas"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/11-intro-radio-astronomy/#multiple-antennas"&gt;Multiple antennas&lt;/a&gt;
&lt;/h3&gt;
&lt;p&gt;With multiple pairs of antennas, we can build up an improved point source response, because the fringe patterns of the individual two-element interferometers add up.&lt;/p&gt;
&lt;figure&gt;&lt;a href="https://www.cv.nrao.edu/~sransom/web/Ch3.html#S7"&gt;&lt;img src="https://iancze.github.io/courses/astro542/lectures/11-intro-radio-astronomy/building_a_beam.png"
 alt="How the fringe patterns of the individual two-element interferometers in an array build up to create a beam. Each shaded circle represents an antenna spaced out along a 1D line. Credit: Essential Radio Astronomy."&gt;&lt;/a&gt;&lt;figcaption&gt;
 &lt;p&gt;How the fringe patterns of the individual two-element interferometers in an array build up to create a beam. Each shaded circle represents an antenna spaced out along a 1D line. Credit: Essential Radio Astronomy.&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;

&lt;p&gt;In the limit of many antennas, the beam usually looks like something centralized, approximately Gaussian, but usually containing many sidelobes (hence, why radio astronomers call this a &amp;ldquo;dirty beam&amp;rdquo;).&lt;/p&gt;
&lt;p&gt;The resolution of the beam is on the order of \(\lambda/b\), where \(b\) is the projected baseline of the longest antenna pair. Note that this resolution strongly depends on the relative number of baselines at long/short separations. It won&amp;rsquo;t suffice to just have a single long-baseline pair of antennas if most other antenna pairs are at much shorter baselines.&lt;/p&gt;




&lt;h3 id="complex-correlators"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/11-intro-radio-astronomy/#complex-correlators"&gt;Complex correlators&lt;/a&gt;
&lt;/h3&gt;
&lt;p&gt;Now let&amp;rsquo;s move beyond a discussion of point sources and consider what we&amp;rsquo;ll call a &amp;ldquo;slightly extended source,&amp;rdquo; something that is larger than the dirty beam but smaller than the primary beam of each telescope.&lt;/p&gt;
&lt;p&gt;Instantaneous field of view of an interferometer is the same as the primary beam of each telescope, treated as a single dish (see previous section). Each single dish antenna is still seeing the same thing as before, it&amp;rsquo;s just that we have a correlator backend that&amp;rsquo;s doing things with the signals, allowing us to place many &lt;em&gt;synthesized beams&lt;/em&gt; within the area of the primary beam.&lt;/p&gt;
&lt;p&gt;The cosine correlator we discussed has some limitations, namely that the quantity
$$
R_c = \int I(\hat{s}) \cos(2 \pi \vec{b} \cdot \hat{s}/\lambda) d\Omega
$$
term is only sensitive to the &lt;em&gt;even&lt;/em&gt; (symmetric) component of a source.&lt;/p&gt;
&lt;p&gt;[Draw cosine curve centered around \(\vec{b} \cdot \hat{s} = 0 \), show that there is ambiguity in the response.]&lt;/p&gt;
&lt;p&gt;We can decompose any function into a sum of its even and odd (antisymmetric) parts. To capture the &lt;em&gt;odd&lt;/em&gt; parts of the source, we need a different kind of correlation,
$$
R_s = \int I(\hat{s}) \sin(2 \pi \vec{b} \cdot \hat{s}/\lambda) d\Omega
$$
which can be obtained by inserting a \(\pi/2 = 90^\circ\) phase delay in the output of one antenna.&lt;/p&gt;
&lt;p&gt;When we combine cosine and sine correlators (in hardware, this is all the same system), we have what is called a &lt;em&gt;complex correlator&lt;/em&gt;. It is usually more convenient to treat the cosine and sine components simultaneously using Euler&amp;rsquo;s formula and we have
$$
\mathcal{V} = R_c - i R_s
$$
or
$$
\mathcal{V} = \int I(\hat{s}) \exp(- i 2 \pi \vec{b} \cdot \hat{s}/\lambda) d\Omega.
$$
The quantity \(\mathcal{V}\) is called the &lt;em&gt;visibility&lt;/em&gt; and it is a complex number. It is the fundamental data product from an interferometer, and, like all data, it is usually measured with some noise. For more on this, see the &lt;a href="https://mpol-dev.github.io/MPoL/rml_intro.html"&gt;MPoL notes on likelihood functions&lt;/a&gt;.&lt;/p&gt;




&lt;h3 id="coordinates"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/11-intro-radio-astronomy/#coordinates"&gt;Coordinates&lt;/a&gt;
&lt;/h3&gt;
&lt;p&gt;The equation we&amp;rsquo;ve just derived is fairly general, but let&amp;rsquo;s expand this a bit to the astronomical coordinate systems we&amp;rsquo;re more used to.&lt;/p&gt;
&lt;p&gt;Rather than referring to the source brightness distribution with \(I_\nu(\hat{s})\), usually we&amp;rsquo;ll refer to some &lt;em&gt;phase center&lt;/em&gt;. [Typically this is the direction the antennas are pointed (i.e., center of primary beam), but doesn&amp;rsquo;t need to be and can be changed after the fact in software.]&lt;/p&gt;
&lt;p&gt;We can define the direction cosines in a tangent plane, such that we have \(l = \sin(\Delta \alpha \cos \delta)\) and \(m = \sin(\Delta \delta)\), where \(\alpha\) and \(\delta\) are R.A. and Dec., respectively.&lt;/p&gt;
&lt;figure&gt;&lt;img src="https://iancze.github.io/courses/astro542/lectures/11-intro-radio-astronomy/thompson_uv.png"
 alt="The relationship between baseline orientation, source position, and direction cosines \(x = l\) and \(y = m\). Credit: Tools of Radio Astronomy from Thompson 1982."&gt;&lt;figcaption&gt;
 &lt;p&gt;The relationship between baseline orientation, source position, and direction cosines \(x = l\) and \(y = m\). Credit: Tools of Radio Astronomy from Thompson 1982.&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;

&lt;p&gt;We can rewrite the projected baseline in terms of its east-west \(u\) and north-south \(v\) components, where&lt;/p&gt;
&lt;p&gt;$$
u = \left [ \frac{\vec{b} \cdot \hat{s}}{\lambda} \right ]_\mathrm{east-west}
$$&lt;/p&gt;
&lt;p&gt;and&lt;/p&gt;
&lt;p&gt;$$
v = \left [ \frac{\vec{b} \cdot \hat{s}}{\lambda} \right ]_\mathrm{north-south}.
$$&lt;/p&gt;
&lt;p&gt;So we have that the image domain coordinates \(l, m\) have corresponding Fourier plane coordinates \(u, v\), called spatial frequencies.&lt;/p&gt;
&lt;p&gt;And the relationship between source brightness distribution and the visibility function is given by
$$
\mathcal{V}(u,v) = \int \int I(l,m) \exp \{ - 2 \pi i (ul + vm)\} dl dm.
$$
This is a two-dimensional Fourier transform.&lt;/p&gt;
&lt;p&gt;It turns out that the measured spatial frequencies correspond directly to the projected baseline vector lengths, if the baseline is measured in multiples of the observing frequency.&lt;/p&gt;
&lt;p&gt;Longer baselines measure higher spatial frequencies, which correspond to finer-scale image plane features.&lt;/p&gt;




&lt;h3 id="fourier-transform-pairs"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/11-intro-radio-astronomy/#fourier-transform-pairs"&gt;Fourier Transform pairs&lt;/a&gt;
&lt;/h3&gt;
&lt;p&gt;Here are some examples of Fourier transform pairs, where the left column represents the sky-plane intensity distribution \(I(l,m)\) and the right column represents the amplitude of the visibility function, \(|\mathcal{V}|^2\). (Remember, the visibility is a complex quantity, and also has a phase. Alternatively, it can be expressed in terms of its real and imaginary components.)&lt;/p&gt;
&lt;figure&gt;&lt;a href="http://www.aoc.nrao.edu/events/synthesis/2010/lectures/wilner_synthesis10.pdf"&gt;&lt;img src="https://iancze.github.io/courses/astro542/lectures/11-intro-radio-astronomy/delta-func.png"
 alt="Narrower features in one domain will result in broader features in the other. Credit: David Wilner"&gt;&lt;/a&gt;&lt;figcaption&gt;
 &lt;p&gt;Narrower features in one domain will result in broader features in the other. Credit: David Wilner&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;

&lt;figure&gt;&lt;a href="http://www.aoc.nrao.edu/events/synthesis/2010/lectures/wilner_synthesis10.pdf"&gt;&lt;img src="https://iancze.github.io/courses/astro542/lectures/11-intro-radio-astronomy/ellipsoid.png"
 alt="Sharp edges will result in power at high spatial frequencies. Credit: David Wilner"&gt;&lt;/a&gt;&lt;figcaption&gt;
 &lt;p&gt;Sharp edges will result in power at high spatial frequencies. Credit: David Wilner&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;

&lt;figure&gt;&lt;a href="http://www.aoc.nrao.edu/events/synthesis/2010/lectures/wilner_synthesis10.pdf"&gt;&lt;img src="https://iancze.github.io/courses/astro542/lectures/11-intro-radio-astronomy/lo.png"
 alt="A real image (such as this picture of former NRAO director Fred Lo) has complicated structure on many scales. In general, a &amp;rsquo;natural&amp;rsquo; image will typically have more power at lower spatial frequencies than higher spatial frequencies. An example of a non-natural image is one containing text or artifical borders (lots of sharp lines), like the Galileoscope logo. Credit: David Wilner"&gt;&lt;/a&gt;&lt;figcaption&gt;
 &lt;p&gt;A real image (such as this picture of former NRAO director Fred Lo) has complicated structure on many scales. In general, a &amp;rsquo;natural&amp;rsquo; image will typically have more power at lower spatial frequencies than higher spatial frequencies. An example of a non-natural image is one containing text or artifical borders (lots of sharp lines), like the Galileoscope logo. Credit: David Wilner&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;

&lt;figure&gt;&lt;a href="http://www.aoc.nrao.edu/events/synthesis/2010/lectures/wilner_synthesis10.pdf"&gt;&lt;img src="https://iancze.github.io/courses/astro542/lectures/11-intro-radio-astronomy/spatial-filtering.png"
 alt="A demonstration of how an interferometer with an insufficient number of short baselines (right column) is insensitive to large-scale emission that is actually there (left column). This effect is especially important for studies of the ISM, where there can be diffuse emission from molecular clouds next to protoplanetary disks. Credit: David Wilner"&gt;&lt;/a&gt;&lt;figcaption&gt;
 &lt;p&gt;A demonstration of how an interferometer with an insufficient number of short baselines (right column) is insensitive to large-scale emission that is actually there (left column). This effect is especially important for studies of the ISM, where there can be diffuse emission from molecular clouds next to protoplanetary disks. Credit: David Wilner&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;





&lt;h3 id="aperture-synthesis"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/11-intro-radio-astronomy/#aperture-synthesis"&gt;Aperture synthesis&lt;/a&gt;
&lt;/h3&gt;
&lt;p&gt;Because the earth rotates, the projected baseline to a source also changes throughout an observation. This can be used to great advantage to sample different spatial frequencies than were otherwise possible in a &amp;ldquo;snapshot&amp;rdquo; observation. The following is a video showing how UV coverage for the MAPS large program is built up over a series of eight 1-hour long observations, spaced out over a single year, and the corresponding &amp;ldquo;dirty beam&amp;rdquo; that results.&lt;/p&gt;
&lt;video controls preload="auto" width="100%" playsinline class="html-video"&gt;
 &lt;source src="https://iancze.github.io/courses/astro542/lectures/11-intro-radio-astronomy/baselines-beams.mp4" type="video/mp4"&gt;
 &lt;p&gt;&lt;/p&gt;
&lt;/video&gt;




&lt;h3 id="making-images-from-visibilities"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/11-intro-radio-astronomy/#making-images-from-visibilities"&gt;Making images from visibilities&lt;/a&gt;
&lt;/h3&gt;
&lt;p&gt;How do go from the interferometric data samples of the visibility function to images? This is a complicated question.&lt;/p&gt;
&lt;p&gt;We sampled the visibility function
$$
\mathcal{V}(u,v) = \int \int I(l,m) \exp \{ - 2 \pi i (ul + vm)\} dl dm.
$$
at a bunch of \(u,v\) points corresponding to the projected baselines of the array. There are many \(u,v\) points &lt;em&gt;that we didn&amp;rsquo;t sample&lt;/em&gt;, and so it&amp;rsquo;s not possible to carry out the inverse Fourier transform completely accurately, with data alone.&lt;/p&gt;
&lt;p&gt;What can we do?&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;One approach is to not make images, and instead forward-model your data using a parametric model, like you might fit a line to a bunch of points. These are complex numbers, but the same principle applies. This has the benefit that you can do Bayesian inference on the parameters of the model (e.g., a Gaussian with some width at some location, etc&amp;hellip;) but the drawback that you need to specify said model. For more information, see the discussion of model-fitting section of the &lt;a href="https://mpol-dev.github.io/MPoL/rml_intro.html"&gt;MPoL documentation&lt;/a&gt;.&lt;/li&gt;
&lt;li&gt;Another is to fearlessly charge on and just take the direct inverse Fourier transform of the samples. This generates an image called the &amp;ldquo;dirty image,&amp;rdquo; because it retains the features of the dirty beam. The assumption here is that the unsampled spatial frequencies carry zero power (not a great assumption, in general), and their visibility values are set to zero. This means that the dirty image is highly sensitive to the array configuration, not something we want! The following movie shows the dirty image that results from choosing a different 75% of the dataset in a round robin fashion for each frame. There is no noise&amp;mdash;each visibility is sampled perfectly.&lt;/li&gt;
&lt;/ul&gt;
&lt;video controls preload="auto" width="100%" playsinline class="html-video"&gt;
 &lt;source src="https://iancze.github.io/courses/astro542/lectures/11-intro-radio-astronomy/alma_noiseless.mp4" type="video/mp4"&gt;
 &lt;p&gt;&lt;/p&gt;
&lt;/video&gt;
&lt;p&gt;There are several algorithms one can employ to try to recover the &amp;ldquo;true&amp;rdquo; sky brightness from the dirty image.&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;CLEAN: iterative image-plane deconvolution. Build up a &amp;ldquo;CLEAN Model&amp;rdquo; using point sources, restore it with a &amp;ldquo;CLEAN beam,&amp;rdquo; which is typically a Gaussian.&lt;/li&gt;
&lt;li&gt;Regularized Maximum Likelihood (RML) imaging: forward-modeling of the visibilities (e.g., &lt;a href="https://mpol-dev.github.io/MPoL/"&gt;MPoL&lt;/a&gt;). Need to specify prior distributions in addition to the aforementioned visibility likelihood.&lt;/li&gt;
&lt;/ul&gt;




&lt;h2 id="science-capacities-of-alma"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/11-intro-radio-astronomy/#science-capacities-of-alma"&gt;Science capacities of ALMA&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;In the remaining time, we&amp;rsquo;ll discuss the science capabilities of ALMA. A great resource for these materials is the&lt;/p&gt;
&lt;p&gt;&lt;a href="https://almascience.eso.org/documents-and-tools/cycle8/alma-science-primer"&gt;ALMA Primer&lt;/a&gt;&lt;/p&gt;</description></item><item><title>Propagation of Radio Waves through the ISM</title><link>https://iancze.github.io/courses/astro542/lectures/10-wave-propagation/</link><pubDate>Sat, 11 Sep 2021 00:00:00 +0000</pubDate><guid>https://iancze.github.io/courses/astro542/lectures/10-wave-propagation/</guid><description>&lt;p&gt;&lt;a href="https://psu.mediaspace.kaltura.com/media/Astro+542A+Lecture+Sep+15/1_4xb4s6ls"&gt;Zoom link&lt;/a&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Draine Ch 11&lt;/li&gt;
&lt;/ul&gt;




&lt;h2 id="propagation-of-radio-waves-through-ism"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/10-wave-propagation/#propagation-of-radio-waves-through-ism"&gt;Propagation of Radio Waves through ISM&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;Propagating radio waves interact with the plasma particles, and therefore, the velocity of the wave propagation is &lt;em&gt;slower&lt;/em&gt; than would be the case in a vacuum. I.e., the speed of light is slower when it has to move through stuff!&lt;/p&gt;
&lt;p&gt;In this lecture, we&amp;rsquo;ll discuss how to calculate the &lt;strong&gt;dispersion measure&lt;/strong&gt; and the &lt;strong&gt;rotation measure&lt;/strong&gt; of electromagnetic radiation propagating through the ISM, and then how to use them to learn about the properties of the intervening ISM.&lt;/p&gt;




&lt;h3 id="dispersion-relation-for-cold-plasmas"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/10-wave-propagation/#dispersion-relation-for-cold-plasmas"&gt;Dispersion relation for cold plasmas&lt;/a&gt;
&lt;/h3&gt;
&lt;p&gt;Let wavenumber be
$$
k = \frac{2 \pi}{\lambda},
$$
and angular frequency be related to linear frequency by
$$
\nu = \frac{\omega}{2\pi}.
$$&lt;/p&gt;
&lt;p&gt;Electromagnetic waves propagating through a cold plasma with electron density \(n_e\) must satisfy the dispersion relation
$$
k^2 c^2 = \omega^2 - \omega_p
$$
where
$$
\omega_p = \left ( \frac{4 \pi n_e e^2}{m_e} \right)^{1/2} = 5.641 \times 10^4 \sqrt{\frac{n_e}{\mathrm{cm}^{-3}}} \; \mathrm{s}^{-1}
$$
is the plasma frequency. If we&amp;rsquo;re in a vacuum, no dispersion!&lt;/p&gt;
&lt;p&gt;There&amp;rsquo;s something called the &lt;strong&gt;phase velocity&lt;/strong&gt;, which is the propagation of the surface of constant phase:
$$
v_\mathrm{phase} = \frac{\omega}{k} = \frac{c}{\sqrt{1 - (\omega_p/\omega)^2)}} &amp;gt; c
$$
Confusingly, this velocity &lt;em&gt;can&lt;/em&gt; be faster than the speed of light! &lt;em&gt;But&lt;/em&gt; this is just a &amp;ldquo;mirage&amp;rdquo;&amp;ndash;these don&amp;rsquo;t carry any energy. &lt;a href="https://en.wikipedia.org/wiki/Phase_velocity"&gt;Wikipedia&lt;/a&gt; helps with the visualizations here a lot. They are important for considering polarization, as we&amp;rsquo;ll see with Faraday rotation.&lt;/p&gt;
&lt;p&gt;We are more concerned with the &lt;strong&gt;group velocity&lt;/strong&gt;
$$
v_g(\omega) = \frac{d \omega}{d k}
$$
which, in a plasma, is
$$
v_g = c \left ( 1 - \frac{\omega_p^2}{\omega^2} \right)^{1/2}.
$$
The group velocity is the speed at which information (energy) can be transmitted, and indeed, \(v_g &amp;lt; c\).&lt;/p&gt;




&lt;h3 id="dispersion-measure"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/10-wave-propagation/#dispersion-measure"&gt;Dispersion Measure&lt;/a&gt;
&lt;/h3&gt;
&lt;ul&gt;
&lt;li&gt;Time delay of radiation as a function of frequency depends on intervening electron density&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;Now let&amp;rsquo;s consider some astronomical object that can emit a well-defined pulse of radiation, like a pulsar. Let \(L\) be the distance to the pulsar, and assume that \(t=0\) corresponds to the time of emission. &lt;em&gt;As a function of frequency, what is the arrival time of the emission?&lt;/em&gt;&lt;/p&gt;
&lt;p&gt;Let&amp;rsquo;s break the path up into small segments, each with length \(dL\). Using the group velocity for a given frequency, how long does it take radiation of that frequency to traverse \(dL\)? Since velocity is distance over time, we can just divide \(dL\) by the group velocity to get \(dt\)
$$
dt = \frac{dL}{v_g(\omega)}
$$
and
$$
t_\mathrm{arrival} = \int_0^L \frac{dL}{v_g(\omega)}
$$
We can plug in \(v_g(\omega)\), but then we get a tricky term denominator. Let&amp;rsquo;s do a Taylor expansion of \(f(\omega) = 1/v_g(\omega)\) around \(\omega\), and see if we can arrive at an expression that is friendlier to integrate.&lt;/p&gt;
&lt;p&gt;To first order, the &lt;a href="https://en.wikipedia.org/wiki/Taylor_series"&gt;Taylor series&lt;/a&gt; expanded around \(x=a\) will look like
$$
f(x) \approx f(a) + f^\prime(a)(x - a).
$$
This is just a constant plus a linear term (draw a graph). One of the assumptions behind an accurate Taylor expansion is that the perturbation away from the reference point \(a\) is small. To ensure this, we can rewrite \(f(\omega)\) as
$$
f(x) = \frac{1}{c}(1 - x^2)^{-1/2}
$$
where \(x = \omega_p / \omega\). From the previous discussion around plasma frequency, we already pointed out that \(0 \lesssim x \ll 1\), so we will Taylor expand in terms of \(x\) about the point \(a = 0 \), and then resubstitute in for \(x\). This yields
$$
f(\omega) \approx \frac{1}{c} \left ( 1 + \frac{1}{2} \frac{\omega_p^2}{\omega^2}\right)
$$
and allows us to rephrase the integral as
$$
t_\mathrm{arrival} = \int_0^L \frac{dL}{c} \left ( 1 + \frac{1}{2} \frac{\omega_p^2}{\omega^2}\right)
$$&lt;/p&gt;
&lt;p&gt;$$
t_\mathrm{arrival} = \frac{L}{c} + \frac{1}{2 c \omega^2} \int_0^L \omega_p^2 d L.
$$&lt;/p&gt;
&lt;p&gt;Let&amp;rsquo;s rewrite the plasma frequency \(\omega_p\) in terms of the electron density \(n_e\), and define the &lt;strong&gt;dispersion measure&lt;/strong&gt; to be
$$
DM = \int_0^L n_e d L
$$
such that we have
$$
t_\mathrm{arrival} = \frac{L}{c} + \frac{e^2}{2 \pi m_e c} \frac{1}{\nu^2} DM
$$
or, evaluating the constants,
$$
t_\mathrm{arrival} = \frac{L}{c} + 4.146\times10^{-3} \left ( \frac{\nu}{\mathrm{GHz}} \right)^{-2} \frac{DM}{\mathrm{cm}^{-3}\;\mathrm{pc}}\;\mathrm{s}
$$&lt;/p&gt;
&lt;p&gt;A typical DM for a pulsar 3 kpc away might be \(DM \approx 10^2\;\mathrm{cm}^{-3}\;\mathrm{pc} \), which, at an observing frequency of 1 GHz, would yield a delay of 0.4 s after traveling \(10^4\) years.&lt;/p&gt;
&lt;p&gt;On it&amp;rsquo;s own, this isn&amp;rsquo;t very useful to us, because we don&amp;rsquo;t know when \(t=0\) was. But what we can exploit is the &lt;em&gt;frequency dependence&lt;/em&gt; of the arrival time, \(t_\mathrm{arrival} \propto \mathrm{const} + \nu^{-2}\). This means that &lt;em&gt;lower frequencies arrive later&lt;/em&gt;, such that the pulse is &amp;ldquo;dispersed.&amp;rdquo; We can compare the relative arrival times and compare this to
$$
\frac{d t_\mathrm{arrival}}{d \nu} = - \frac{e^2}{\pi m_e c\nu^3} DM
$$
calculate the DM and thus the integrated electron density.&lt;/p&gt;
&lt;figure&gt;&lt;a href="https://casper.astro.berkeley.edu/astrobaki/index.php/Dispersion_measure"&gt;&lt;img src="https://iancze.github.io/courses/astro542/lectures/10-wave-propagation/pulsar_dm.png"
 alt="Observations of (at least three pulses) of a pulsar, showing that lower frequency emission takes longer to arrive. The change in arrival time as a function of frequency can be used to calculate the dispersion measure, and thus the intervening electron density. Credit: caspar.astro.berkeley.edu"&gt;&lt;/a&gt;&lt;figcaption&gt;
 &lt;p&gt;Observations of (at least three pulses) of a pulsar, showing that lower frequency emission takes longer to arrive. The change in arrival time as a function of frequency can be used to calculate the dispersion measure, and thus the intervening electron density. Credit: caspar.astro.berkeley.edu&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;

&lt;p&gt;Some helpful commentary on dispersion measures in this &lt;a href="https://arxiv.org/abs/2007.02886"&gt;pedagogical note&lt;/a&gt; by S. Kulkarni.&lt;/p&gt;
&lt;p&gt;If you do this for many hundreds of pulsars, and you have reasonable estimates of their \(L\) from other data, then you can build up a 3D model of the electron density of the galaxy, probed by these pencil beam measurements. For example, take a look at these plots of dispersion measure vs. pulsar position complied by &lt;a href="https://ui.adsabs.harvard.edu/abs/2003astro.ph..1598C/abstract"&gt;Cordes and Lazio 2003&lt;/a&gt;.&lt;/p&gt;
&lt;figure&gt;&lt;a href="https://ui.adsabs.harvard.edu/abs/2003astro.ph..1598C/abstract"&gt;&lt;img src="https://iancze.github.io/courses/astro542/lectures/10-wave-propagation/cordes_lazio_2.png"
 alt="Credit: Cordes and Lazio 2003, arXiv:astro-ph/0301598"&gt;&lt;/a&gt;&lt;figcaption&gt;
 &lt;p&gt;Credit: Cordes and Lazio 2003, arXiv:astro-ph/0301598&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;

&lt;figure&gt;&lt;img src="https://iancze.github.io/courses/astro542/lectures/10-wave-propagation/fig_11_2.jpg"
 alt="The electron density \(n_e\) at the Galactic midplane on a line passing through the Galactic Center (GC) and the Sun, as estimated using the NE2001 model of Cordes and Lazio. Credit: Draine Figure 11.2"&gt;&lt;figcaption&gt;
 &lt;p&gt;The electron density \(n_e\) at the Galactic midplane on a line passing through the Galactic Center (GC) and the Sun, as estimated using the NE2001 model of Cordes and Lazio. Credit: Draine Figure 11.2&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;

&lt;p&gt;Put together into an estimate of the Galaxy&amp;rsquo;s spiral arm structure:
&lt;figure&gt;&lt;a href="https://ui.adsabs.harvard.edu/abs/2003astro.ph..1598C/abstract"&gt;&lt;img src="https://iancze.github.io/courses/astro542/lectures/10-wave-propagation/cordes_lazio_3.png"
 alt="Credit: Cordes and Lazio 2003, arXiv:astro-ph/0301598"&gt;&lt;/a&gt;&lt;figcaption&gt;
 &lt;p&gt;Credit: Cordes and Lazio 2003, arXiv:astro-ph/0301598&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;
&lt;/p&gt;
&lt;p&gt;Traces things like&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;enhanced electron densities from material that has been shock-compressed by SNe explosions&lt;/li&gt;
&lt;li&gt;underdensities of electrons in &amp;ldquo;local hot bubble,&amp;rdquo; also caused by SNe&lt;/li&gt;
&lt;li&gt;other SNe remnants, too&lt;/li&gt;
&lt;li&gt;GC region has enhanced electron density&lt;/li&gt;
&lt;li&gt;electron density tapering off with distance from midplane, with scale height of ~ 1 or 2 kpc.&lt;/li&gt;
&lt;/ul&gt;




&lt;h2 id="faraday-rotation"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/10-wave-propagation/#faraday-rotation"&gt;Faraday rotation&lt;/a&gt;
&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;polarization of light depends on the the presence of a magnetic field along intervening line of sight&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;Define the cyclotron frequency
$$
\omega_B = \frac{e B_{||}}{m_e c}
$$
where \( B_{||}\) is the component of the magnetic field &lt;em&gt;parallel&lt;/em&gt; to the direction of propagation. The dispersion relation for circularly polarized waves in cold plasma is
$$
k^2 c^2 = \omega^2 - \frac{\omega_p^2}{1 \pm \frac{\omega_B}{\omega}}
$$
where the \(\pm\) applies to right/left circular polarization. When we calculate the phase velocity (using similar Taylor expansion tricks for \(\omega_B / \omega \ll 1\) and \(\omega_p / \omega \ll 1\)), we see
$$
v_\mathrm{phase}(\omega) = \frac{\omega}{k(\omega)} \approx c \left [ 1 + \frac{1}{2}\frac{\omega_p^2}{\omega^2} \mp \frac{1}{2}\frac{\omega_p^2 \omega_B}{\omega^3} \right].
$$
What&amp;rsquo;s interesting about this is that the right/left circular polarization modes differ in phase velocity and wavenumber
$$
v_{\mathrm{phase},L} - v_{\mathrm{phase},R} = \frac{4 \pi e^3}{m_e^2} \frac{n_e B_{||}}{\omega^3}.
$$
and
$$
\Delta k = k_R - k_L = \frac{4 \pi e^3}{m_e^2 c^2} \frac{n_e B_{||}}{\omega^2}
$$
You can decompose even pure &lt;a href="https://en.wikipedia.org/wiki/Linear_polarization"&gt;linear polarization&lt;/a&gt; into equal amounts of left and right &lt;a href="https://en.wikipedia.org/wiki/Circular_polarization"&gt;circular polarized light&lt;/a&gt;. If the light propagates through a magnetized plasma, it will have its modes propagate at different phase velocities, which means after propagating some distance \(L\), the two waves will differ in phase by \(L \Delta k\).&lt;/p&gt;
&lt;figure&gt;&lt;a href="https://en.wikipedia.org/wiki/Faraday_effect"&gt;&lt;img src="https://iancze.github.io/courses/astro542/lectures/10-wave-propagation/faraday.svg"
 alt="Faraday rotation, demonstrating how the angle of linear polarization is rotated when propagating through a magnetized plasma. Credit: Wikipedia, DrBob."&gt;&lt;/a&gt;&lt;figcaption&gt;
 &lt;p&gt;Faraday rotation, demonstrating how the angle of linear polarization is rotated when propagating through a magnetized plasma. Credit: Wikipedia, DrBob.&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;

&lt;p&gt;When you recompose this back into a linear polarization, you see that the angle of polarization has rotated clockwise relative to its original angle by
$$
\Phi = \frac{1}{2} \int_0^L \Delta k \; d L = \int_0^L \frac{\omega_p^2 \omega_B}{2 c \omega^2} d L
$$
$$
\Phi = \frac{e^3}{2 \pi m_e^2 c^2}\frac{1}{\nu^2} \int_0^L n_e B_{||} \; dL
$$
$$
\Phi = RM \; \lambda^2
$$
where we define the &lt;strong&gt;rotation measure&lt;/strong&gt; as
$$
RM = \frac{1}{2 \pi} \frac{e^3}{m_e^2 c^4} \int_0^L n_e B_{||} \; dL
$$
$$
RM = 8.120 \times 10^{-5} \int_0^L n_e B_{||} \; dL \frac{\mathrm{rad}\;\mathrm{cm}^{-2}}{\mathrm{cm}^{-3}\; \mu G \; \mathrm{pc}}.
$$
I.e., the rotation measure is the integral of the electron density &lt;em&gt;times&lt;/em&gt; the parallel component of the magnetic field along the line of sight. In analogy to the dispersion measure, we don&amp;rsquo;t know the starting point of the linear polarization angle, but, because the change in rotation angle is wavelength-dependent, by measureing the linear polarization angles at two different wavelengths \(\lambda_1\) and \(\lambda_2\), we can calculate the rotation measure
$$
RM = \frac{\Phi_2 - \Phi_1}{\lambda_2^2 - \lambda_1^2}.
$$&lt;/p&gt;




&lt;h2 id="putting-it-all-together"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/10-wave-propagation/#putting-it-all-together"&gt;Putting it all together&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;If you measure &lt;em&gt;both&lt;/em&gt; the RM and DM, you can put them together to calculate the electron density-weighted mean value of the line of sight magnetic field
$$
\langle B_{||} \rangle = \frac{2 \pi m_e^2 c^4}{e^3} \frac{RM}{DM}.
$$
Background sources like AGNs and pulsars are good for this type of calculation because they are in general strongly linearly polarized (from synchrotron radiation). Putting this together for many sightlines, Han et al. 2006 concludes magnetic field in Galactic disk follows spiral structure, but reverses direction from spiral arms to interarm regions.&lt;/p&gt;




&lt;h2 id="other-propagation-effects"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/10-wave-propagation/#other-propagation-effects"&gt;Other propagation effects&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;We don&amp;rsquo;t have time to cover these, unfortunately, but be aware that we can also study&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;refraction: &amp;ldquo;clumpiness&amp;rdquo; of ISM from electron density variations&lt;/li&gt;
&lt;li&gt;scintillation: &amp;ldquo;twinkling&amp;rdquo; helps track turbulence in the ISM&lt;/li&gt;
&lt;li&gt;extreme brightening events: chance refraction through blobs in ISM, possibly from shells of SNe&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;More information provided in Draine Chs. 11.4 - 11.7&lt;/p&gt;</description></item><item><title>Emission and Absorption by a Thermal Plasma</title><link>https://iancze.github.io/courses/astro542/lectures/09-emission-absorption-plasma/</link><pubDate>Sat, 11 Sep 2021 00:00:00 +0000</pubDate><guid>https://iancze.github.io/courses/astro542/lectures/09-emission-absorption-plasma/</guid><description>&lt;p&gt;&lt;a href="https://psu.mediaspace.kaltura.com/media/Astro+542A+Lecture+Sep+13/1_392jaruc"&gt;Zoom link&lt;/a&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Draine Ch. 10&lt;/li&gt;
&lt;li&gt;Rybicki and Lightman Ch 5.&lt;/li&gt;
&lt;li&gt;Ryden and Pogge, Chs. 4.1, 4.3&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;A &lt;strong&gt;thermal plasma&lt;/strong&gt; is a partially ionized gas, whose particles have a velocity distribution that are very close to Maxwellian. We see these in interstellar space, with temperatures \(T = 10^3 - 10^8\) K. Generally, we can treat the plasma as in LTE.&lt;/p&gt;
&lt;p&gt;Note, though, that in the following discussion &amp;ldquo;continuum&amp;rdquo; does not necessarily mean blackbody, it just means that the opacity of the transition is much broader in frequency than a spectral line transition.&lt;/p&gt;
&lt;p&gt;Three important emission mechanisms in plasmas&lt;/p&gt;
&lt;ol&gt;
&lt;li&gt;&lt;strong&gt;Free-free&lt;/strong&gt; transitions: an electron is inelastically scattered from one free state to another, emitting a continuum photon&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Free-bound&lt;/strong&gt; transitions: an electron is initally free, but is captured into a bound state, emitting a continuum photon&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Bound-bound&lt;/strong&gt; transitions: an electron makes a transition from one bound state to another, which the change in energy carried away by one or two photons&lt;/li&gt;
&lt;/ol&gt;
&lt;p&gt;Today we are discussing 1 and 2. 3 will be discussed the following week in lectures on &amp;ldquo;Recombination of Ions with Electrons&amp;rdquo; and &amp;ldquo;Collisional Excitation.&amp;rdquo;&lt;/p&gt;




&lt;h2 id="free-free-emission-bremsstrahlung"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/09-emission-absorption-plasma/#free-free-emission-bremsstrahlung"&gt;Free-Free emission (Bremsstrahlung)&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;&amp;ldquo;Braking radiation&amp;rdquo;&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;&lt;em&gt;Classical electromagnetism&lt;/em&gt;: an accelerating charge radiates electromagnetic energy.&lt;/li&gt;
&lt;li&gt;in a thermal plasma (remember, hot &amp;amp; partially ionized), electrons and ions are scattering off of one another&lt;/li&gt;
&lt;li&gt;because the electrons are much lighter than the ions, they undergo larger accelerations and dominate the radiation&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;Provides continuum emission from radio frequencies up to the point where the photon energies are comparable to the thermal energy \(k T\).&lt;/p&gt;
&lt;p&gt;See Rybicki and Lightman Ch 5: Bremsstrahlung for more details on the derivation of the emissivity of this mechanism. Here, we&amp;rsquo;ll just write down the result
$$
j_{\mathrm{ff}, \nu} = \frac{8}{3} \left ( \frac{2 \pi}{3} \right)^{1/2} g_{\mathrm{ff}, i} \frac{e^6}{m_e^2 c^3} \left ( \frac{m_e}{kT} \right)^{1/2} e^{- h \nu / k T} n_e Z_i^2 n_i
$$
$$
j_{\mathrm{ff}, \nu} = 5.444 \times 10^{-41} g_\mathrm{ff} T_4^{-1/2} e^{- h \nu / k T} Z_i^2 n_i n_e \;\mathrm{erg}\;\mathrm{cm}^3\;\mathrm{s}^{-1}\;\mathrm{sr}^{-1}\;\mathrm{Hz}^{-1}
$$
here \(n_i\) represents the ions, and \(n_e\) represents the electrons.&lt;/p&gt;
&lt;p&gt;\(g_{\mathrm{ff},i}(\nu, T)\) is the &lt;strong&gt;Gaunt factor for free-free transitions&lt;/strong&gt;. When doing classical physics, Gaunt factor = 1. It&amp;rsquo;s different from 1 when we need to treat quantum mechanical effects. Even then, the correction factor is only 1 order of magnitude different from 1, across a wide range of frequencies.&lt;/p&gt;
&lt;p&gt;If we assume that \(g_{\mathrm{ff},i}(\nu, T)\) is constant (w \(\nu\)), then the emissivity is flat (independent) with frequency at low frequencies.&lt;/p&gt;
&lt;p&gt;We can calculate the power radiated per volume as
$$
\Lambda_\mathrm{ff} = 4 \pi \int_0^\infty j_{\mathrm{ff},\nu} d \nu
$$
and we get that
$$
\Lambda_\mathrm{ff} \propto n_i n_e \sqrt{T}
$$&lt;/p&gt;
&lt;p&gt;See Draine 10.2 and 10.3 for how to properly calculate and average over the Gaunt factor for various regimes.&lt;/p&gt;




&lt;h2 id="free-free-absorption"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/09-emission-absorption-plasma/#free-free-absorption"&gt;Free-free absorption&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;If we have a thermal distribution, then Kirchoff&amp;rsquo;s law must be satisfied
$$
B_\nu = \frac{j_\nu}{\kappa_\nu}
$$
where \(B_\nu\) is the Planck function. Kirchoff&amp;rsquo;s law is a neat trick, because we can go calculate the attenuation coefficient \(\kappa_\nu\) from the emissivity \(j_\nu\).&lt;/p&gt;
&lt;p&gt;$$
\kappa_{\mathrm{ff},\nu} = \frac{4}{3} \left ( \frac{2 \pi}{3} \right)^{1/2} \frac{e^6}{m_e{3/2} (k T)^{1/2} h c \nu^3} \left [ 1 - e^{- h \nu / k T} \right] Z_i^2 n_i n_e g_\mathrm{ff}
$$&lt;/p&gt;
&lt;p&gt;from the frequency dependence, we see that free-free absorption becomes strong at low (radio) frequencies.&lt;/p&gt;




&lt;h2 id="emission-measure"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/09-emission-absorption-plasma/#emission-measure"&gt;Emission Measure&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;Calculate the intensity \(I_\nu\) from free-free emission from an ionized region
$$
I_\nu(s) = I_\nu(0) e^{-\tau_\nu} + \int_0^s d s^\prime j_{\mathrm{ff},\nu} e^{-[\tau_\nu(s) - \tau_nu(s^\prime)]}
$$
$$
I_\nu(s) = I_\nu(0) e^{-\tau_\nu} + \int_0^{\tau_\nu} d \tau^\prime \left [ \frac{j_{\mathrm{ff}, \nu}}{\kappa_{\mathrm{ff}, \nu}} \right] e^{-(\tau - \tau^\prime)}
$$
If the region has a uniform temperature, then the integral becomes
$$
I_\nu = I_\nu(0) e^{-\tau_\nu} + \left [ \frac{j_{\mathrm{ff}, \nu}}{\kappa_{\mathrm{ff}, \nu}} \right]&lt;em&gt;T (1 - e^{-\tau&lt;/em&gt;\nu})
$$&lt;/p&gt;
&lt;p&gt;Now let&amp;rsquo;s define something called the &lt;em&gt;emission measure&lt;/em&gt;
$$
EM = \int n_e n_p ds = \left [ \frac{n_e n_p}{\kappa_{\mathrm{ff},\nu}} \right ]&lt;em&gt;T \tau&lt;/em&gt;\nu.
$$
Since most clouds are net neutral, we can write
$$
EM \approx \int n_e^2 ds.
$$&lt;/p&gt;
&lt;p&gt;We can rewrite the RTE,
$$
I_\nu = I_\nu(0) e^{-\tau_\nu} + \frac{(1 - e^{-\tau_\nu})}{\tau_\nu} \left [ \frac{j_{\mathrm{ff},\nu}}{n_e n_p} \right ]&lt;em&gt;T EM
$$
when we are very optically thin (\(\tau \ll 1 \)), we have
$$
\frac{(1 - e^{-\tau&lt;/em&gt;\nu})}{\tau_\nu} \approx 1.
$$
We can plug back in for the Brehmsstralung emissivity and obtain the result that intensity &lt;em&gt;increases linearly with EM&lt;/em&gt;. See Draine Ch 10.5 for prefactors.&lt;/p&gt;
&lt;p&gt;For low frequencies \(\nu \leq 1 \) GHz, self-absorption can become important in dense H II regions. Generally negligible in other ISM settings.&lt;/p&gt;




&lt;h2 id="free-bound-transitions-recombination-continuum"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/09-emission-absorption-plasma/#free-bound-transitions-recombination-continuum"&gt;Free-bound transitions: Recombination continuum&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;Electron captured by an ion, making the transition from a &amp;ldquo;free&amp;rdquo; state to a &amp;ldquo;bound&amp;rdquo; state, and emitting a photon.&lt;/p&gt;
&lt;p&gt;How might we calculate the emissivity \(j_{\mathrm{fb},\nu}\) for this interaction, assuming LTE (i.e., thermal plasma)?&lt;/p&gt;
&lt;p&gt;Let&amp;rsquo;s think about the tools that we need: \(\kappa_\nu\), \(B_\nu\), \(\sigma\), \(n_b\)&lt;/p&gt;
&lt;p&gt;Here&amp;rsquo;s one way to go about it. In our second lecture on collisional processes, we used the impact approximation to approximately calculate the cross section for photoionization, \(\sigma_b\). We assumed there was an electron with some kinetic energy, it engaged in some momentum transfer with the atom, and this extra change in momentum unbound it&amp;rsquo;s own electron, ionizing the atom.&lt;/p&gt;
&lt;p&gt;Then, we know that we can use our result from the radiative process lecture for net absorption
$$
\kappa_\nu = n_l \sigma_{l \rightarrow u}(\nu) - n_u \sigma_{u \rightarrow l}(\nu)
$$
to write
$$
\kappa_\nu = n_l \sigma_b(\nu) \left [ 1 - e^{-h \nu/k T_\mathrm{exc}} \right].
$$&lt;/p&gt;
&lt;p&gt;Next, let&amp;rsquo;s reuse Kirchoff&amp;rsquo;s law to relate
$$
j_{\mathrm{fb}, \nu} = \kappa_{\mathrm{bf}, \nu} B_\nu(T)
$$
and, since we are in LTE, \(T_\mathrm{exc} = T\), so
$$
j_{\mathrm{fb}, \nu} = n_b \sigma_b(\nu) \left [ 1 - e^{-h \nu/k T} \right ] B_\nu(T)
$$
where \(n_b\) is the number density of atoms in bound state b, if in LTE with electron density \(n_e\), and ion density \(n_i\). For example, \(n_b\) could be the number of hydrogen atoms with principle quantum number \(n=3\).&lt;/p&gt;
&lt;p&gt;Finally, we would like to rewrite \(n_b\) in terms of the overall number densities of electrons and ions and the temperature of the plasma, because in a moment we&amp;rsquo;re going to want to consider multiple different bound states. To do this, we need to refer back to the statistical mechanics lecture and the &lt;em&gt;law of mass action&lt;/em&gt;. One result from that lecture was the ability to take the reaction for photoionization
$$
H(n) + e^- \leftrightarrow H^+ + 2e^-
$$
and write down the relative number densities, obtaining&lt;/p&gt;
&lt;p&gt;$$
\left [ \frac{n(H(n))}{n(H^+) n_e} \right]_\mathrm{LTE} = \left [ \frac{h^3}{(2 \pi k T)^{3/2}} \right] \left [ \frac{m_H}{m_p m_e} \right]^{3/2} \frac{g[H(n)]}{g(e^-) g(H^+)} e^{-I_n/kT}
$$&lt;/p&gt;
&lt;p&gt;where&lt;/p&gt;
&lt;p&gt;$$
I_n = \frac{I_H}{n^2}
$$&lt;/p&gt;
&lt;p&gt;is the energy required to ionize \(H(n)\). Finally, putting it all together, we have&lt;/p&gt;
&lt;p&gt;$$
j_{\mathrm{fb},\nu} = \frac{g_b}{2 g_i} \frac{h^4 \nu^3}{(2 \pi m_e k T)^{3/2} c^2} e^{(I_b - h \nu)/k T} \sigma_b(\nu) n_e n_i
$$
where \(g_b\) is the degeneracy of the bound state, \(g_i\) is the degeneracy of the ion, and \(I_b\) is the energy required to ionize from bound state.&lt;/p&gt;
&lt;p&gt;So, what did we do? We started with a photoionization cross section, related that to an absorption coefficient, used Kirchoff&amp;rsquo;s law and LTE assumptions to relate &lt;em&gt;that&lt;/em&gt; to an emissivity, and then rewrote the number densities of the bound state \(n_b\) in terms of more-easily calculable quantities, \(n_e\) and \(n_i\).&lt;/p&gt;
&lt;p&gt;Some take aways:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Generally, the free-bound emissivity is proportional to
$$
j_{\mathrm{fb},\nu} \propto n_e n_i.
$$&lt;/li&gt;
&lt;li&gt;Each bound state contributes its own bit of recombination continuum starting at \(h \nu = I_b\), which is the minimum energy photon that can be emitted when an electron transitions from free to bound state b&lt;/li&gt;
&lt;li&gt;The recombination continuum tapers off at higher frequncies by the factor \(e^{(I_b - h \nu)/k T} \)&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;Now let&amp;rsquo;s examine this emission spectrum for a hydrogen plasma&lt;/p&gt;
&lt;figure&gt;&lt;img src="https://iancze.github.io/courses/astro542/lectures/09-emission-absorption-plasma/fig_10_2.jpg"
 alt="Solid line is the continuous emission spectrum of a \(T=8000)) K hydrogen plasma, including free-free emission, recombination continuum emission, and two-photon emission. (Emission lines not shown). Notice the &amp;lsquo;semi-circle&amp;rsquo; shape from the recombination emissivity. Credit: Draine Figure 10.2"&gt;&lt;figcaption&gt;
 &lt;p&gt;Solid line is the continuous emission spectrum of a \(T=8000)) K hydrogen plasma, including free-free emission, recombination continuum emission, and two-photon emission. (Emission lines not shown). Notice the &amp;lsquo;semi-circle&amp;rsquo; shape from the recombination emissivity. Credit: Draine Figure 10.2&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;





&lt;h2 id="radio-recombination-lines"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/09-emission-absorption-plasma/#radio-recombination-lines"&gt;Radio Recombination lines&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;Three-body collisional processes can populate hydrogen atoms with very high quantum states \(n \gtrsim 100\), which are also called &lt;strong&gt;Rydberg&lt;/strong&gt; states. These high states can also undergo spontaneous decay to lower levels, such as the
$$
n + 1 \rightarrow n
$$
transitions we called the \(n\alpha\) transitions.&lt;/p&gt;</description></item><item><title>Absorption Lines and Curve of Growth</title><link>https://iancze.github.io/courses/astro542/lectures/08-absorption-lines/</link><pubDate>Mon, 06 Sep 2021 00:00:00 +0000</pubDate><guid>https://iancze.github.io/courses/astro542/lectures/08-absorption-lines/</guid><description>&lt;p&gt;&lt;a href="https://psu.mediaspace.kaltura.com/media/Astro+542A+Lecture+8+Sep+10/1_zz3zpyg4"&gt;Zoom link&lt;/a&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Ryden and Pogge Ch. 2.3: Building Absorption Lines&lt;/li&gt;
&lt;li&gt;Ryden and Pogge Ch. 2.4: Curve of Growth&lt;/li&gt;
&lt;li&gt;Draine Ch. 9: Absorption Lines: The Curve of Growth&lt;/li&gt;
&lt;/ul&gt;
&lt;figure&gt;&lt;img src="https://iancze.github.io/courses/astro542/lectures/08-absorption-lines/fig_2_4.jpg"
 alt="Lorentzian and Gaussian line profiles. Credit: Fig 2.4 Ryden and Pogge"&gt;&lt;figcaption&gt;
 &lt;p&gt;Lorentzian and Gaussian line profiles. Credit: Fig 2.4 Ryden and Pogge&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;

&lt;p&gt;So far we&amp;rsquo;ve talked about the&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;intrinsic line profile, whose width comes about from quantum mechanical uncertainty, and takes on a Lorentzian form&lt;/li&gt;
&lt;li&gt;Gaussian line profile, which comes about from thermal motions&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;In truth, we have a convolution of these two profiles, called a Voigt profile.&lt;/p&gt;
&lt;p&gt;$$
\phi_\nu^\mathrm{Voigt} = \frac{1}{\sqrt{\pi}}\frac{1}{\nu_{ul}}\frac{c}{b}H(a,u)
$$
where \(H(a,u)\) is a dimensionless function describing the shape of the Voigt profile. It was first tabulated by Frode Hjerting in 1938, and is why we call this combined thing the Voigt-Hjerting function. You can use a special functions library to calculate this profile exactly; for our purposes, we are mainly concerned with observing the astrophysical behaviors that manifest from the fact that this profile has &lt;em&gt;both&lt;/em&gt; a Gaussian core and Lorentzian wings. You can think of the \(a\) parameter (what R&amp;amp;P term the ratio of intrinsic broadening to thermal broadening) as a tuning between its &amp;ldquo;Gaussian-ness&amp;rdquo; (low \(a\)) and its &amp;ldquo;Lorentzian-ness&amp;rdquo; (high \(a\)).&lt;/p&gt;
&lt;figure&gt;&lt;img src="https://iancze.github.io/courses/astro542/lectures/08-absorption-lines/fig_2_5.jpg"
 alt="The Voight-Hjerting function \(H(a,u)\) with various intrinsic-to-broadening ratios \(a\). Credit: Fig 2.5 Ryden and Pogge"&gt;&lt;figcaption&gt;
 &lt;p&gt;The Voight-Hjerting function \(H(a,u)\) with various intrinsic-to-broadening ratios \(a\). Credit: Fig 2.5 Ryden and Pogge&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;





&lt;h2 id="building-absorption-lines"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/08-absorption-lines/#building-absorption-lines"&gt;Building absorption lines&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;Let&amp;rsquo;s assume that we have a known background source illuminating some neutral hydrogen cloud with known&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;number density \(n_l\)&lt;/li&gt;
&lt;li&gt;temperature&lt;/li&gt;
&lt;li&gt;Einstein coefficients for the transition&lt;/li&gt;
&lt;li&gt;assume the cloud is cold enough so that we can ignore spontaneous emission &lt;em&gt;and&lt;/em&gt; \(j_\nu \approx 0 \)&lt;/li&gt;
&lt;li&gt;\(\phi_\nu\) is constant everywhere&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;&lt;strong&gt;What does the absorption line look like?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;This is just another way of asking what is the emergent \(I_\nu\), where in this case we are additionally interested in the \(\nu\) dependence.&lt;/p&gt;
&lt;p&gt;So let&amp;rsquo;s write down the RTE and get cracking.
$$
I_\nu(\tau_\nu) = I_\nu(0)e^{-\tau_\nu} + \int_0^{\tau_\nu} e^{-(\tau_\nu - \tau^\prime)}S_\nu d \tau^\prime
$$
Since
$$
S_\nu = \frac{j_\nu}{\kappa_\nu} \approx 0
$$
we have
$$
I_\nu(\tau_\nu) = I_\nu(0)e^{-\tau_\nu}
$$
For this problem, we basically have some initial (known) intensity that is attenuated by the intervening absorbers. The thing we want to calculate is the optical depth \(\tau_\nu\), which we do by
$$
\tau_\nu = \int \kappa_\nu d s = \int n_l \sigma_{lu}(\nu) ds
$$
$$
\tau_\nu = \frac{g_u}{g_l}\frac{c^2}{8 \pi \nu^2_{ul}}A_{ul} \int n_l \phi_\nu ds
$$
and from our assumptions we have \(\phi_\nu\) constant along line of sight so we can take
$$
\int n_l ds = N_l
$$
to a column density, and result in
$$
\tau_\nu = \frac{g_u}{g_l}\frac{c^2}{8 \pi \nu^2_{ul}}A_{ul} N_l \phi_\nu.
$$
Remember that \(\phi_\nu\) is our Voigt profile.&lt;/p&gt;
&lt;p&gt;We can plug this back in and solve for the emergent intensity we were after
$$
I_\nu(\tau_\nu) = I_\nu(0)e^{-\tau_\nu}.
$$&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;
&lt;p&gt;When \(\tau_\nu &amp;lt; 1\), we can use the small exponent approximation to show that
$$
\frac{I_\nu(0) - I_\nu(\tau_\nu)}{I_\nu(0)} \approx \tau_\nu
$$
meaning that the absorption line profile \(I_\nu\) follows the optical depth \(\tau_\nu\) as a function of frequency.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;When \(\tau_\nu \gg 1\), though, we get \(I_\nu(0)e^{-\tau_\nu} \approx 0\). Cold, dense, optically thick things (that are not emitting themselves) &lt;em&gt;block light&lt;/em&gt;!&lt;/p&gt;
&lt;/li&gt;
&lt;/ul&gt;
&lt;figure&gt;&lt;img src="https://iancze.github.io/courses/astro542/lectures/08-absorption-lines/fig_2_6.jpg"
 alt="Lyman \(\alpha\) absorption lines with central optical depths of \(\tau\) from 1 through 1,000. Figure 2.6 from Ryden and Pogge."&gt;&lt;figcaption&gt;
 &lt;p&gt;Lyman \(\alpha\) absorption lines with central optical depths of \(\tau\) from 1 through 1,000. Figure 2.6 from Ryden and Pogge.&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;

&lt;p&gt;What&amp;rsquo;s the relationship between \(I_\nu\) and \(F_\nu\) here? We&amp;rsquo;re just considering a single small patch with differential solid angle \(\Delta \Omega\), assuming that the properties of the source are uniform over that area, so then the integral is over a constant area and we have
$$
I_\nu \propto F_\nu.
$$
So in this discussion about absorption lines and curve and growth, you may see \(F_\nu\) and \(I_\nu\) used interchangeably.&lt;/p&gt;




&lt;h2 id="equivalent-width"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/08-absorption-lines/#equivalent-width"&gt;Equivalent width&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;Equivalent width is a convenient and quick &lt;em&gt;but imperfect&lt;/em&gt; way to boil down a rich measurement of \(I_\nu\) into a single number for tabulation.&lt;/p&gt;
&lt;figure&gt;&lt;img src="https://iancze.github.io/courses/astro542/lectures/08-absorption-lines/equivalent_width.jpg"
 alt="The equivalent width corresponding to the absorption line. Credit: Wikipedia/Szdori"&gt;&lt;figcaption&gt;
 &lt;p&gt;The equivalent width corresponding to the absorption line. Credit: Wikipedia/Szdori&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;

&lt;p&gt;How to calculate equivalent width?&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Take high resolution spectrum of a source&lt;/li&gt;
&lt;li&gt;Estimate the baseline &amp;ldquo;continuum&amp;rdquo; level from spectral regions that do not show absorption lines. Call this \(F_\nu(0)\)&lt;/li&gt;
&lt;li&gt;Integrate over (sum up) the absorption line region \(F_\nu\)&lt;/li&gt;
&lt;li&gt;Compare how much of the continuum region you&amp;rsquo;d need (absorbed at 100%) to match the integrated absorption in the actual line&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;The equivalent width is \(W\), and, confusingly, there are many different definitions out there in the wild.&lt;/p&gt;
&lt;p&gt;If you&amp;rsquo;re working at optical wavelengths, I always found the &amp;ldquo;wavelength equivalent width&amp;rdquo; to be the easiest to comprehend, which is calculated using \(F_\lambda\) common to optical spectra
$$
W_\lambda = \int \frac{\mathrm{continuum} - \mathrm{line}(\lambda)}{\mathrm{continuum}} d \lambda
$$
and has units of wavelength. Phrased with mathematical symbols
$$
W_\lambda = \int \frac{F_\lambda(0) - F_\lambda }{F_\lambda(0)} d \lambda.
$$&lt;/p&gt;
&lt;p&gt;&lt;em&gt;There are other definitions&lt;/em&gt;! For example, R&amp;amp;P and Draine prefer a dimensionless equivalent width
$$
W = \int \frac{d \nu}{\nu_0} \left [ 1 - \frac{F_\nu}{F_\nu(0)} \right] = \int \frac{d \nu}{\nu_0} ( 1 - e^{-\tau_\nu}).
$$
We&amp;rsquo;ll use the dimensionless equivalent width for any problem sets and tests (it has no units). They are related via
$$
W_\lambda \approx \lambda_0 W.
$$
The problem with the wavelength width is that if you measure \(W_\lambda = 1\)Å, it corresponds to a very different set of physical conditions if you&amp;rsquo;re looking at a line with line center \(\lambda = 4000\)Å vs \(\lambda = 8000\)Å. This is because we would expect
$$
\frac{\Delta \lambda}{\lambda}
$$
to be approximately constant across the spectrum, and this is captured by the dimensionless equivalent width. There is also a velocity-equivalent width (\(W_v = c W\)). So, if you&amp;rsquo;re going to compare equivalent widths, make sure you &lt;em&gt;understand the definition that is being used&lt;/em&gt;. Some authors define \(W &amp;gt; 0\) for emission lines, others for absorption lines.&lt;/p&gt;
&lt;p&gt;Much of the historical literature focuses on equivalent width measurements, and comparing these.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;My recommendation&lt;/strong&gt;: If you are lucky to have a high resolution spectrum of an object, realize that &lt;em&gt;you have a lot of information&lt;/em&gt; in how \(F_\nu\) varies with \(\nu\). Why go destroy some of it just to represent it as a single number? If you have a detailed physical model, you can fit \(F_\nu\) directly. Don&amp;rsquo;t bother thinking too hard about equivalent widths.&lt;/p&gt;
&lt;p&gt;For example, lets look at some real absorption lines and see why we&amp;rsquo;d actually want to take a look at the spectra themselves rather than just reporting a single number, \(W\).
&lt;figure&gt;&lt;img src="https://iancze.github.io/courses/astro542/lectures/08-absorption-lines/fig_2_2.jpg"
 alt="UV interstellar absorption lines. Figure 2.2 Ryden and Pogge"&gt;&lt;figcaption&gt;
 &lt;p&gt;UV interstellar absorption lines. Figure 2.2 Ryden and Pogge&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;
&lt;/p&gt;
&lt;p&gt;That said, it&amp;rsquo;s worthwhile thinking about how a line saturates with increasing optical depth, and the measure of equivalent width is useful for this exercise. Our goal is to&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Measure some line profile&lt;/li&gt;
&lt;li&gt;Make some assessment of the optical depth&lt;/li&gt;
&lt;li&gt;Use the line depth/shape and our understanding of radiative transfer to arrive at some conclusion about the column density of absorbers \(N_l\).&lt;/li&gt;
&lt;/ul&gt;




&lt;h2 id="curve-of-growth"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/08-absorption-lines/#curve-of-growth"&gt;Curve of growth&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;Let&amp;rsquo;s examine how the line profile changes with increasing optical depth at line center, \(\tau_0\). &lt;em&gt;Note&lt;/em&gt;: optical depth is not the same for all \(\lambda\), i.e., we have \(\tau_\lambda\) or \(\tau_\nu\)! That&amp;rsquo;s the point of the whole upcoming discussion!&lt;/p&gt;
&lt;figure&gt;&lt;img src="https://iancze.github.io/courses/astro542/lectures/08-absorption-lines/fig_2_6.jpg"
 alt="Lyman \(\alpha\) absorption lines with central optical depths of \(\tau_0\) from 1 through 1,000. Figure 2.6 from Ryden and Pogge."&gt;&lt;figcaption&gt;
 &lt;p&gt;Lyman \(\alpha\) absorption lines with central optical depths of \(\tau_0\) from 1 through 1,000. Figure 2.6 from Ryden and Pogge.&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;

&lt;p&gt;The &lt;strong&gt;curve of growth&lt;/strong&gt; is a relationship between the observed equivalent width \(W\) and the optical depth at line center \(\tau_0\).&lt;/p&gt;
&lt;figure&gt;&lt;img src="https://iancze.github.io/courses/astro542/lectures/08-absorption-lines/fig_2_7.jpg"
 alt="Curve of growth for the same Lyman \(\alpha\) line. Figure 2.7 from Ryden and Pogge."&gt;&lt;figcaption&gt;
 &lt;p&gt;Curve of growth for the same Lyman \(\alpha\) line. Figure 2.7 from Ryden and Pogge.&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;

&lt;p&gt;Note that the curve of growth is a &lt;em&gt;log-log&lt;/em&gt; plot! It is divided up into three regimes&lt;/p&gt;




&lt;h3 id="linear"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/08-absorption-lines/#linear"&gt;Linear&lt;/a&gt;
&lt;/h3&gt;
&lt;ul&gt;
&lt;li&gt;optically thin&lt;/li&gt;
&lt;li&gt;only the Gaussian core of a line contributes significant EW&lt;/li&gt;
&lt;li&gt;\(W\) is linear with \(\tau_0\) (and thus also with \(N_l\))&lt;/li&gt;
&lt;li&gt;\(N_l\) can be directly calculated from a measurement of \(W\)&lt;/li&gt;
&lt;li&gt;thermal broadening of the line cancels out in determination of \(N_l\) (good, we don&amp;rsquo;t need to know temperature!)&lt;/li&gt;
&lt;/ul&gt;




&lt;h3 id="damping-optical-depth"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/08-absorption-lines/#damping-optical-depth"&gt;Damping Optical Depth&lt;/a&gt;
&lt;/h3&gt;
&lt;p&gt;There is a transition whether the Lorentzian wings provide much contribution to the equivalent width calculation. This is called the damping optical depth \(\tau_\mathrm{damp}\), and depends on temperature as well as the properties of the absorber.&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;For \(1 \leq \tau_0 \leq \tau_\mathrm{damp} \), this is the flat portion.&lt;/li&gt;
&lt;li&gt;For \(\tau_0 \geq \tau_\mathrm{damp}\), this is the &amp;ldquo;square-root&amp;rdquo; or damped portion&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;For the line in this plot, \(\tau_\mathrm{damp} \approx 10^{3.5}\).&lt;/p&gt;




&lt;h3 id="flat"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/08-absorption-lines/#flat"&gt;Flat&lt;/a&gt;
&lt;/h3&gt;
&lt;ul&gt;
&lt;li&gt;optically thick but not so much that the Lorentzian wings provide much contribution to equivalent width calculation (they remain shallow): \(0 \leq \tau_0 \leq \tau_\mathrm{damp}\)&lt;/li&gt;
&lt;li&gt;optical depth &lt;em&gt;does depend&lt;/em&gt; on the thermal broadening parameter of the line, therefore we need to know temperature&lt;/li&gt;
&lt;li&gt;\(W\) grows grows excruciatingly slowly with optical depth, therefore trying to turn \(W\) into a measurement on \(N_l\) is very risky, since your error in \(W\) will &lt;em&gt;propagate exponentially&lt;/em&gt; into an error on \(N_l\).&lt;/li&gt;
&lt;/ul&gt;




&lt;h3 id="square-root-or-damped-portion"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/08-absorption-lines/#square-root-or-damped-portion"&gt;Square-root or &amp;ldquo;damped&amp;rdquo; portion&lt;/a&gt;
&lt;/h3&gt;
&lt;ul&gt;
&lt;li&gt;&lt;em&gt;so&lt;/em&gt; optically thick that even the Lorentzian wings &amp;ldquo;damping wings&amp;rdquo; are getting deep and contributing significantly to the equivalent width calculation \(\tau_0 \geq \tau_\mathrm{damp}\)&lt;/li&gt;
&lt;li&gt;after we&amp;rsquo;ve exceeded the damping optical depth column density \(N_l\) is again independent of thermal broadening parameter&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;For reference, you can walk through R&amp;amp;P Ch 2.4 or Draine Chs. 9.2, 9.3, and 9.4 for the calculations and approximations underlying these behaviors of the curve of growth.&lt;/p&gt;




&lt;h2 id="doublet-ratios"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/08-absorption-lines/#doublet-ratios"&gt;Doublet ratios&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;If an absorbing level can transition to two different excited states \(u_1\) or \(u_2\), we can calculate the relative equivalent widths of those two transitions. This gives us confidence in assessing whether the transition is actually in the optically thin limit, and we can derive a more accurate result. E.g., you may see this for the Ca II K and Ca II H doublet transitions.&lt;/p&gt;




&lt;h2 id="lyman-alpha"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/08-absorption-lines/#lyman-alpha"&gt;Lyman \(\alpha\)&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;In the ISM, atomic (neutral) hydrogen is almost always found in the ground state, and therefore the Lyman transitions are the most important. A &lt;strong&gt;resonance line&lt;/strong&gt; is one that has permitted absorption out of the electronic ground state.&lt;/p&gt;
&lt;p&gt;The Lyman \(\alpha\) transition \(1s \rightarrow 2p\) is very important because it allows us to directly measure the column density of atomic hydrogen.&lt;/p&gt;
&lt;p&gt;Deuterium has a slightly frequency shifted Lyman series, which, in some cases, allows us to measure absorption lines for both hydrogen and deuterium, and therefore determine a D\H ratio for the interstellar/galactic gas.&lt;/p&gt;




&lt;h2 id="lyman-limit"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/08-absorption-lines/#lyman-limit"&gt;Lyman limit&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;Happens as \(n \rightarrow \infty\) and the wavelength differences between adjacent transitions from ground state smush together. You get a bit of a &amp;ldquo;continuum&amp;rdquo; opacity that has a cross-section that is very similar to that of the photoionization we covered in previous lectures. This is because a transition from ground state to very high \(n\) is very similar to that of an ionization cross section.&lt;/p&gt;




&lt;h2 id="metal-lines"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/08-absorption-lines/#metal-lines"&gt;Metal lines&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;Many resonance lines for metals are in the ultraviolet, so you can only observe them from space, or using optical means for highly redshifted systems. Notable exceptions are Na I doublet, the K I doublet, and the Ca II doublet.&lt;/p&gt;




&lt;h2 id="gas-phase-abundances"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/08-absorption-lines/#gas-phase-abundances"&gt;Gas phase abundances&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;Observed abundances vary from sightline to sightline, which is presumably evidence that elements end up trapped in dust grains. This removal of various elements is called &lt;em&gt;interstellar depletion&lt;/em&gt;.&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Warm neutral medium: has high abundances&lt;/li&gt;
&lt;li&gt;Cold neutral medium: has lower abundances&lt;/li&gt;
&lt;li&gt;Warm ionized medium: very high abundances&lt;/li&gt;
&lt;li&gt;Diffuse molecular clouds: very low abundances&lt;/li&gt;
&lt;/ul&gt;</description></item><item><title>H I 21-cm Emission and Absorption</title><link>https://iancze.github.io/courses/astro542/lectures/07-hi-21cm/</link><pubDate>Mon, 06 Sep 2021 00:00:00 +0000</pubDate><guid>https://iancze.github.io/courses/astro542/lectures/07-hi-21cm/</guid><description>&lt;p&gt;&lt;a href="https://psu.mediaspace.kaltura.com/media/Astro+542A+Lecture+7+Sep+8/1_evjnpdt2"&gt;Zoom link&lt;/a&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Draine Ch. 8&lt;/li&gt;
&lt;li&gt;Ryden and Pogge Ch. 3.1: 21cm Emission and Absorption&lt;/li&gt;
&lt;li&gt;Ryden and Pogge Ch. 3.3: Exciting Hyperfine Energy Levels&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;This is the first lecture where we&amp;rsquo;re moving a bit from the core processes and moving back towards a discussion of the many &amp;ldquo;phases&amp;rdquo; of the ISM.&lt;/p&gt;
&lt;p&gt;Just as a review, we had (according to Draine):&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Coronal gas, &amp;ldquo;hot ionized medium&amp;rdquo; or &amp;ldquo;HIM&amp;rdquo;&lt;/li&gt;
&lt;li&gt;H II gas, dense H II regions, diffuse H II regions, &amp;ldquo;warm ionized medium&amp;rdquo; or &amp;ldquo;WIM&amp;rdquo;&lt;/li&gt;
&lt;li&gt;Warm H I, &amp;ldquo;warm neutral medium&amp;rdquo; or &amp;ldquo;WNM&amp;rdquo;&lt;/li&gt;
&lt;li&gt;Cool H I, &amp;ldquo;cold neutral medium&amp;rdquo; or &amp;ldquo;CNM&amp;rdquo;&lt;/li&gt;
&lt;li&gt;Diffuse molecular gas&lt;/li&gt;
&lt;li&gt;Dense molecular gas&lt;/li&gt;
&lt;li&gt;Stellar outflows&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;So, in talking about H I emission and absorption, we&amp;rsquo;re primarily talking about the warm neutral medium (R&amp;amp;P Ch 3) and the cold neutral medium (R&amp;amp;P Ch 2). We&amp;rsquo;re following Draine in a linear manner; these concepts are swapped slightly in R&amp;amp;P.&lt;/p&gt;
&lt;p&gt;&lt;em&gt;ISM phases are ionization state, density, and temperature.&lt;/em&gt;&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Cold neutral medium&lt;/strong&gt;: absorption lines (from previous lectures, will also be covered in Friday&amp;rsquo;s lecture). T approx 100 K, \(n_H = 30 \;\mathrm{cm}^{-3}\) and neutral. Volume fraction of 1%.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Warm Neutral Medium&lt;/strong&gt;: T approx 5000 K, \(n_H = 0.6\;\mathrm{cm}^{-3}\) and neutral. Volume fraction of 40%.&lt;/p&gt;
&lt;p&gt;Taken together, cold neutral medium and warm neutral medium contain over half of the mass of the Milky Way Galaxy&amp;rsquo;s ISM. Observationally, we can study this neutral atomic hydrogen directly via the hyperfine transition.&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;mapping distribution of H I in MW and other galaxies&lt;/li&gt;
&lt;li&gt;galactic rotation curve&lt;/li&gt;
&lt;li&gt;gas temperatures in interstellar clouds&lt;/li&gt;
&lt;/ul&gt;




&lt;h2 id="h-i-emissivity-and-absorption-coefficient"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/07-hi-21cm/#h-i-emissivity-and-absorption-coefficient"&gt;H I emissivity and absorption coefficient&lt;/a&gt;
&lt;/h2&gt;
&lt;figure&gt;&lt;img src="https://iancze.github.io/courses/astro542/lectures/07-hi-21cm/fig_8_1.jpg"
 alt="Hyperfine splitting of the 1s ground state of atomic H Credit: Figure 8.1: Draine."&gt;&lt;figcaption&gt;
 &lt;p&gt;Hyperfine splitting of the 1s ground state of atomic H Credit: Figure 8.1: Draine.&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;

&lt;p&gt;Magnetic moment of electron coupled to magnetic moment of proton creates a &amp;ldquo;hyperfine&amp;rdquo; split between the parallel and antiparallel spin states.&lt;/p&gt;




&lt;h3 id="anti-parallel-state"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/07-hi-21cm/#anti-parallel-state"&gt;Anti-parallel state&lt;/a&gt;
&lt;/h3&gt;
&lt;ul&gt;
&lt;li&gt;has lower energy \(E_l = 0\)&lt;/li&gt;
&lt;li&gt;degeneracy \(g_l = 1\)&lt;/li&gt;
&lt;/ul&gt;




&lt;h3 id="parallel-state"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/07-hi-21cm/#parallel-state"&gt;Parallel state&lt;/a&gt;
&lt;/h3&gt;
&lt;ul&gt;
&lt;li&gt;total spin \(S = 1\), degeneracy \( g_u = 2 S + 1 = 3\)&lt;/li&gt;
&lt;li&gt;\(E_u - E_l = 5.87\times10^{-6}\) eV&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;When the electron drops from parallel to anti-parallel states, the photon emitted has wavelength
$$
\lambda = \frac{h c}{E_u - E_l} = 21.11\,\mathrm{cm}
$$&lt;/p&gt;
&lt;p&gt;Let&amp;rsquo;s think a little bit about how this transition gets populated. Using the same concept of excitation temperature, but here we&amp;rsquo;re going to call it &lt;em&gt;spin temperature&lt;/em&gt;
$$
\frac{n_u}{n_l} = \frac{g_u}{g_l}e^{(E_l - E_u)/kT_\mathrm{spin}}.
$$
We can plug in the values for the spin flip transition and simplify the equation and
$$
\frac{E_l - E_u}{k} = 0.0682\;\mathrm{K}.
$$
is the temperature corresponding to the energy of the spin flip transition. Then the relative population levels are
$$
\frac{n_u}{n_l} = 3 e^{-0.0682\;\mathrm{K}/T_\mathrm{spin}},
$$&lt;/p&gt;
&lt;p&gt;How does \( e^{-0.0682\;\mathrm{K}/T_\mathrm{spin}}\) behave? Unless \(T_\mathrm{spin} \lesssim 0.0682\;\mathrm{K}\), the exponent is approximately zero, and \(e^0 = 1\). We can explore this relationship for a range of spin temperatures, and we see that \(n_u/n_l \approx 3\) for all but the &lt;em&gt;very coldest&lt;/em&gt; spin temperatures.&lt;/p&gt;
&lt;p&gt;Thinking about material out there in space, can someone tell me the coldest something is likely to be if it&amp;rsquo;s just sitting out there in interstellar space, even close to LTE? The CMB puts a pretty good lower bound on \(T\) so long as nothing funny is happening, since \(T_\mathrm{CMB} = 2.725\;\mathrm{K}\).&lt;/p&gt;
&lt;p&gt;Even the average CMB photon is much more energetic than this, \(T_\mathrm{CMB} = 2.725\;\mathrm{K}\), so these upper levels can be and are easily populated!&lt;/p&gt;




&lt;h3 id="emissivity"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/07-hi-21cm/#emissivity"&gt;Emissivity&lt;/a&gt;
&lt;/h3&gt;
&lt;p&gt;So we reach the interesting conclusion that the level populations of H I 21cm transitions are
$$
n_u \approx \frac{3}{4} n(\mathrm{H\;I})
$$
$$
n_l \approx \frac{1}{4} n(\mathrm{H\;I})
$$
&lt;em&gt;independent&lt;/em&gt; of any astrophysically practical spin temperatures. Because the upper level always contains approximatel 75% of the H I, this means that the H I 21cm emissivity is also &lt;em&gt;independent&lt;/em&gt; of the spin temperature, and we have
$$
j_\nu = n_u \frac{A_{ul}}{4 \pi} h \nu_{ul} \phi_\nu = \frac{3}{16\pi} A_{ul} h \nu_{ul} n(\mathrm{H\;I}) \phi_\nu.
$$&lt;/p&gt;




&lt;h3 id="absorption-coefficient"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/07-hi-21cm/#absorption-coefficient"&gt;Absorption coefficient&lt;/a&gt;
&lt;/h3&gt;
&lt;p&gt;Following similar manipulations we covered in Draine Chs. 6 and 7, we can write the absorption coefficient
$$
\kappa_\nu = n_l \sigma_{lu} - n_u |\sigma_{ul}|
$$
$$
\kappa_\nu = n_l \frac{g_u}{g_l} \frac{A_{ul}}{8 \pi} \lambda_{ul}^2 \phi_\nu \left [ 1 - \frac{n_u}{n_l} \frac{g_l}{g_u} \right]
$$
$$
\kappa_\nu = n_l \frac{g_u}{g_l} \frac{A_{ul}}{8 \pi} \lambda_{ul}^2 \phi_\nu \left [ 1 - e^{-h \nu_{ul} / k T_\mathrm{spin}} \right]
$$
as we just discussed, \( e^{-h \nu_{ul} / k T_\mathrm{spin}} \approx 1\).&lt;/p&gt;
&lt;p&gt;What does \(\kappa_\nu\) represent? It&amp;rsquo;s the &lt;em&gt;net&lt;/em&gt; absorption coefficient, meaning it&amp;rsquo;s the true (total) absorption minus the contribution from stimulated emission.&lt;/p&gt;
&lt;p&gt;If we have \( e^{-h \nu_{ul} / k T_\mathrm{spin}} \approx 1\), the upper state is going to fairly populated, so what does that say about stimulated emission? It means that it is significant and we need to include it in our calculations.&lt;/p&gt;
&lt;p&gt;Using the small x approximation \(e^{-x} \approx 1 - x\) for \(x \ll 1\), we can finally simplify to
$$
\kappa_\nu \approx \frac{3}{32 \pi} A_{ul} \frac{h c \lambda_{ul}}{k T_\mathrm{spin}} n(\mathrm{H\;I}) \phi_\nu
$$
and we find that the absorption coefficient depends on spin temperature like
$$
\kappa_\nu \propto \frac{1}{T_\mathrm{spin}}
$$
i.e., it gets large when the spin temperature is low and the lower level is populated (subject to our assumption that \(h \nu_{ul} / k T_\mathrm{spin} \ll 1\) for the conditions of interest).&lt;/p&gt;
&lt;p&gt;If the spin temperature gets very high, we aren&amp;rsquo;t going to have much &lt;em&gt;net&lt;/em&gt; absorption (\(\kappa_\nu \approx 0 \)), because every truly absorbed photon will be balanced out by one contributed from stimulated emission.&lt;/p&gt;




&lt;h3 id="example-calculation"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/07-hi-21cm/#example-calculation"&gt;Example calculation&lt;/a&gt;
&lt;/h3&gt;
&lt;p&gt;Assume a Gaussian internal velocity distribution for the H I cloud with dispersion \(\sigma_V\). Assume we are looking at a frequency \(\nu\), which corresponds to a Doppler shift of \(u\), where
$$
\nu = \nu_{ul}(1 - u/c)
$$
and yields a normalized line profile of
$$
\phi_\nu = \frac{1}{\sqrt{2 \pi}} \frac{c}{\nu_{ul}} \frac{1}{\sigma_V}e^{-u^2/2 \sigma_V^2}
$$
The absorption coefficient is
$$
\kappa_\nu = 2.190\times10^{-19}\;\mathrm{cm}^2\;n(\mathrm{H\;I}) \frac{K}{T_\mathrm{spin}} \frac{\mathrm{km\;s}^{-1}}{\sigma_V} e^{-u^2/2 \sigma^2_V}.
$$
We can find optical depth
$$
d\tau = \kappa_\nu ds.
$$
Let&amp;rsquo;s assume the properties of the cloud (spin temp, velocity dispersion) are constant along the line of sight, and write down column density as
$$
N(\mathrm{H\;I}) = \int n(\mathrm{H\;I}) ds
$$
then we have,
$$
\tau_\nu = 2.190 \frac{N(\mathrm{H\;I})}{10^{21}\;\mathrm{cm}^{-2}} \frac{100\;\mathrm{K}}{T_\mathrm{spin}} \frac{\mathrm{km\;s}^{-1}}{\sigma_V} e^{-u^2/2 \sigma_V^2}.
$$
What column density do we need for the line of sight to be marginally optically thick (\( \tau_\nu \gtrsim 1\))?&lt;/p&gt;
&lt;p&gt;Many sightlines have \(N(\mathrm{H\;I}) \gtrsim 10^{21}\;\mathrm{cm}^{-2}\), spin temperatures of \(T_\mathrm{spin} \approx 10^2\;\mathrm{K}\), and velocity dispersions of a few km/s, which means self-absorption of the 21-cm line can be important.&lt;/p&gt;
&lt;p&gt;&lt;a href="https://www.naic.edu/~gibson/hisa/hisa_expl.html"&gt;Self-absorption&lt;/a&gt;. A foreground cloud will appear darker than a background cloud. Think of this in the line profiles, where you can have small dips on top of a larger line &lt;a href="https://www.atnf.csiro.au/pasa/18_1/mccluregriffiths/paper/node3.html"&gt;plots&lt;/a&gt;.&lt;/p&gt;
&lt;figure&gt;&lt;a href="https://iopscience-iop-org.ezaccess.libraries.psu.edu/article/10.1086/346227/meta#"&gt;&lt;img src="https://iancze.github.io/courses/astro542/lectures/07-hi-21cm/li_and_goldsmith_1.png"
 alt="Figure 1. Credit: Li and Goldsmith 2003"&gt;&lt;/a&gt;&lt;figcaption&gt;
 &lt;p&gt;Figure 1. Credit: Li and Goldsmith 2003&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;

&lt;figure&gt;&lt;a href="https://iopscience-iop-org.ezaccess.libraries.psu.edu/article/10.1086/346227/meta#"&gt;&lt;img src="https://iancze.github.io/courses/astro542/lectures/07-hi-21cm/li_and_goldsmith_2.png"
 alt="Figure 2. Credit: Li and Goldsmith 2003"&gt;&lt;/a&gt;&lt;figcaption&gt;
 &lt;p&gt;Figure 2. Credit: Li and Goldsmith 2003&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;

&lt;p&gt;The &lt;a href="https://iopscience-iop-org.ezaccess.libraries.psu.edu/article/10.1086/346227/meta#"&gt;Li and Goldsmith&lt;/a&gt; paper.&lt;/p&gt;




&lt;h2 id="optically-thin-cloud"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/07-hi-21cm/#optically-thin-cloud"&gt;Optically Thin Cloud&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;Assume a cloud with
$$
N(\mathrm{H\;I}) \lesssim 10^{20}\,\mathrm{cm}^{-2}\frac{T_\mathrm{spin}}{100\,\mathrm{K}} \frac{\sigma_V}{\mathrm{km\;s}^{-1}}
$$
such that \(\tau_\nu \lesssim 0.2\) even at line center (i.e., it&amp;rsquo;s optically thin).&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Assume that we can neglect absorption. How would we calculate the emergent intensity \(I_\nu\) from this cloud, using the equation of radiative transfer?&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;Using the path length formulation,
$$
I_\nu = I_\nu(0) + \int j_\nu ds.
$$
Let&amp;rsquo;s take our equation for \(j_\nu\) from before, assume constant properties along the ray, and use the idea that
$$
N(\mathrm{H\;I}) = \int n(\mathrm{H\;I}) ds
$$
we have
$$
I_\nu = I_\nu(0) + \frac{3}{16\pi}A_{ul} h \nu_{ul} \phi_\nu N(\mathrm{H\;I}).
$$
Assume we know \(I_\nu(0)\) from somewhere else. Rearrange and calculate the intensity contribution from the cloud, over all frequencies
$$
\int \left [ I_\nu - I_\nu(0) \right] d \nu = \frac{3}{16\pi}A_{ul} h \nu_{ul} N(\mathrm{H\;I})
$$
because we know \(\int \phi_\nu d\nu = 1\).&lt;/p&gt;
&lt;p&gt;Let&amp;rsquo;s rewrite this using antenna temperature and relative velocity \(u\)
$$
\int \left [ T_A - T_A(0) \right] du = \int \frac{c^2}{2 k \nu^2} \left [I_\nu - I_\nu(0) \right] \frac{c}{\nu} d\nu
$$
$$
\int \left [ T_A - T_A(0) \right] du = 54.89\;\mathrm{K\;km\;s}^{-1} \frac{N(\mathrm{H\;I})}{10^{20}\;\mathrm{cm}^{-2}}
$$
This relationship allows us to&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;make an integrated line intensity measurement of 21-cm emission&lt;/li&gt;
&lt;li&gt;calculate the total H I column density \(N(\mathrm{H\;I})\)&lt;/li&gt;
&lt;li&gt;without needing to know \(T_\mathrm{spin}\)! (as long as self-absorption not important)&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;If we can assume optically thin emission, we can even estimate the total H I mass in another galaxy.&lt;/p&gt;




&lt;h2 id="spin-temperature-determination-using-background-radio-sources"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/07-hi-21cm/#spin-temperature-determination-using-background-radio-sources"&gt;Spin temperature determination using background radio sources&lt;/a&gt;
&lt;/h2&gt;
&lt;figure&gt;&lt;img src="https://iancze.github.io/courses/astro542/lectures/07-hi-21cm/fig_8_2.png"
 alt="Credit: Draine"&gt;&lt;figcaption&gt;
 &lt;p&gt;Credit: Draine&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;

&lt;p&gt;Assumptions&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;we have a bright background radio source with a &amp;ldquo;known&amp;rdquo; continuum spectrum \(T_\mathrm{RS}\), or at least known that it should be intrinsically flat within some frequency range&lt;/li&gt;
&lt;li&gt;cloud (ISM) properties are uniform across the two sightlines: \(N(\mathrm{H\;I})\) and \(T_\mathrm{spin}\)&lt;/li&gt;
&lt;li&gt;nothing behind the &amp;ldquo;off&amp;rdquo; sightline, just blank sky whose antenna temperature \(T_\mathrm{sky}\) we can measure/approximate from nearby measurements in space and frequency.&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;Using the fact that specific intensity and antenna temperature have a linear relationship, take the RTE in integral form and write down the relationships between all of the temperatures in the Figure, assuming a radial velocity \(v\).
$$
T_A^\mathrm{off}(v) = T_\mathrm{sky}e^{-\tau_\nu} + T_\mathrm{spin}(1 - e^{-\tau_\nu})
$$
and
$$
T_A^\mathrm{on}(v) = T_\mathrm{RS} e^{-\tau_\nu} + T_\mathrm{spin}(1 - e^{-\tau_\nu}).
$$
Then, solve for \(\tau_\nu\) and \(T_\mathrm{spin}\)
$$
\tau(v) = \ln \left [ \frac{T_\mathrm{RS} - T_\mathrm{sky}}{T_A^\mathrm{on}(v) - T_A^\mathrm{off}} \right]
$$
and
$$
T_\mathrm{spin} = \frac{T_A^\mathrm{off}(v) T_\mathrm{RS} - T_A^\mathrm{on}(v) T_\mathrm{sky}}{(T_\mathrm{RS} - T_\mathrm{sky}) - [T_A^\mathrm{on}(v) - T_A^\mathrm{off}(v)]}.
$$
When absorption is strong,&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;\(T_\mathrm{RS} - T_\mathrm{sky}\) is measurably larger than \(T_A^\mathrm{on}(v) - T_A^\mathrm{off}\)&lt;/li&gt;
&lt;li&gt;can calculate \(\tau_\nu\) and \(T_\mathrm{spin}\)&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;If absorption is weak,&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;can&amp;rsquo;t tell the difference between \(T_\mathrm{RS} - T_\mathrm{sky}\) vs. \(T_A^\mathrm{on}(v) - T_A^\mathrm{off}\)&lt;/li&gt;
&lt;li&gt;upper limit on \(\tau (v)\) and lower limit on \(T_\mathrm{spin}(v)\).&lt;/li&gt;
&lt;/ul&gt;</description></item><item><title>Radiative Transfer</title><link>https://iancze.github.io/courses/astro542/lectures/06-radiative-transfer/</link><pubDate>Wed, 01 Sep 2021 00:00:00 +0000</pubDate><guid>https://iancze.github.io/courses/astro542/lectures/06-radiative-transfer/</guid><description>&lt;p&gt;&lt;a href="https://psu.mediaspace.kaltura.com/media/Astro+542A+Lecture+6+Sep+3/1_seg7c0y0"&gt;Zoom link&lt;/a&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Draine Ch. 7&lt;/li&gt;
&lt;li&gt;Ryden and Pogge: Ch 2.1: The Equation of Radiative Transfer&lt;/li&gt;
&lt;li&gt;Ryden and Pogge: Ch 3.2: Radiative Transfer of Line Emission&lt;/li&gt;
&lt;li&gt;Lecture notes on &lt;a href="https://www.ita.uni-heidelberg.de/~dullemond/lectures/radtrans_2012/index.shtml"&gt;Radiative Transfer in Astrophysics&lt;/a&gt; by C.P. Dullemond.&lt;/li&gt;
&lt;li&gt;&lt;em&gt;Radiative Processes&lt;/em&gt; by Rybicki and Lightman, particularly Ch. 1&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;Radiative transfer describes the propagation of radiation through absorbing and emitting media.&lt;/p&gt;
&lt;p&gt;In Wednesday&amp;rsquo;s lecture, we introduced specific intensity \(I_\nu\) and photon occupation number \(n_\gamma(\nu)\).&lt;/p&gt;
&lt;p&gt;Photon occupation number
$$
n_\gamma(\nu) = \frac{c^2}{2 h \nu^3} I_\nu
$$
&amp;ldquo;dimensionless&amp;rdquo; number of photons per mode per polarization (per solid angle per frequency bin).&lt;/p&gt;
&lt;p&gt;In LTE (blackbody),
$$
I_\nu = B_\nu(T) = \frac{2 h\nu^3}{\exp(h \nu / kT) - 1}
$$
and
$$
n_\gamma = \frac{1}{\exp(h \nu / k T) - 1}.
$$&lt;/p&gt;




&lt;h2 id="gallery-of-temperatures"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/06-radiative-transfer/#gallery-of-temperatures"&gt;Gallery of temperatures!&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Now don&amp;rsquo;t assume LTE&lt;/strong&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Brightness temperature \(T_B(\nu)\) (units K) is a way to characterize a &lt;em&gt;specific intensity&lt;/em&gt; (or surface brightness). Given some specific intensity \(I_\nu\), it is defined as the temperature such that a blackbody at that temperature has that specific intensity, i.e.,
$$
T_B(\nu) = \frac{h \nu / k}{\ln[1 + 2 h \nu^3/c^2 I_\nu]}
$$&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;On it&amp;rsquo;s own, quoting a brightness temperature &lt;em&gt;says nothing about whether the astrophysical source is in LTE&lt;/em&gt;. In fact, brightness temperature is commonly used to describe sources that are far from LTE or isotropic (masers, GRBs, etc.) You could alternatively describe the specific intensity (or surface brightness) in units of Jy/ster, for example, since that is what the Planck function provides (given some temperature).&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Antenna temperature \(T_A\) (units K) is just like brightness temperature, only we make the assumption that we&amp;rsquo;re in the Rayleigh-Jeans limit of \(k T_A \gg h \nu\) or \(n_\gamma \gg 1\) so we have
$$
T_A(\nu) = \frac{c^2}{2 k \nu^2} I_\nu
$$&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;Again, an antenna temperature says &lt;em&gt;nothing about whether the astrophysical source is in LTE&lt;/em&gt;, it is a convenient way to describe a specific intensity or surface brightness. If you&amp;rsquo;re going to use temperature as a shorthand for specific intensity (without needing to appeal to LTE temperature), antenna temperature is a good one to use because it is &lt;em&gt;linear&lt;/em&gt; in intensity.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;BUT&lt;/strong&gt;, if a source &lt;em&gt;is&lt;/em&gt; in LTE with temperature \(T\), then we will have \(T_B = T\), and if we&amp;rsquo;re observing at radio wavelengths, \(T_A \approx T\) as well.&lt;/p&gt;
&lt;p&gt;Remember from last lecture we said that &lt;em&gt;in LTE&lt;/em&gt; we could calculate the relative level populations between two states using the Boltzmann distribution
$$
\frac{n_u}{n_l} = \frac{g_u}{g_l}e^{(E_l - E_u)/kT}
$$
&lt;em&gt;Given some relative population&lt;/em&gt; of two levels, we can define a quantity called excitation temperature
$$
\frac{n_u}{n_l} = \frac{g_u}{g_l}e^{(E_l - E_u)/kT_\mathrm{exc}}.
$$
So an excitation temperature (in K) is a way to communicate the relative population of two levels. In LTE, \(T_\mathrm{exc} = T\).&lt;/p&gt;




&lt;h2 id="radiative-transfer"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/06-radiative-transfer/#radiative-transfer"&gt;Radiative transfer&lt;/a&gt;
&lt;/h2&gt;
&lt;figure&gt;&lt;img src="https://iancze.github.io/courses/astro542/lectures/06-radiative-transfer/fig_7_1.jpg"
 alt="Draine Figure 7.1. Geometry for radiative transfer."&gt;&lt;figcaption&gt;
 &lt;p&gt;Draine Figure 7.1. Geometry for radiative transfer.&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;

&lt;p&gt;Equation of radiative transfer. Consider a ray or beam of radiation. Neglect scattering.
$$
d I_\nu = - I_\nu \kappa_\nu ds + j_\nu ds
$$&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;The first term is the net change in \(I_\nu\) due to absorption and stimulated emission.&lt;/li&gt;
&lt;li&gt;The second term is the change in \(I_\nu\) due to spontaneous emission by the material in the path of the beam&lt;/li&gt;
&lt;/ul&gt;




&lt;h2 id="emission-and-absorption-coefficients"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/06-radiative-transfer/#emission-and-absorption-coefficients"&gt;Emission and absorption coefficients&lt;/a&gt;
&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;\(\kappa_\nu\) is attenuation coefficient, and has dimensions of [1/cm]. Usually positive, except for masers/lasers, where it&amp;rsquo;s negative. Rybicki and Lightman call this \(\alpha_\nu\).&lt;/li&gt;
&lt;li&gt;\(j_\nu\) is emissivity, with units power per unit volume per frequency per unit solid angle&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;For something with energy levels (and randomly oriented in space)
$$
j_\nu = \frac{1}{4 \pi} n_u A_{ul} h \nu \phi_\nu
$$
where \(\phi_\nu\) is the normalized line profile from the previous lecture.&lt;/p&gt;
&lt;p&gt;Attenuation coefficient is proportional to the &lt;em&gt;net absorption&lt;/em&gt;, which is true absorption minus stimulated emission
$$
\kappa_\nu = n_l \sigma_{l \rightarrow u}(\nu) - n_u \sigma_{u \rightarrow l}(\nu)
$$
$$
\kappa_\nu = n_l \sigma_{l \rightarrow u}(\nu) \left [ 1 - \frac{n_u/n_l}{g_u/g_l} \right]
$$
$$
\kappa_\nu = n_l \sigma_{l \rightarrow u}(\nu) \left [ 1 - e^{-h \nu/k T_\mathrm{exc}} \right]
$$&lt;/p&gt;




&lt;h2 id="integrating-the-radiative-transfer-equation"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/06-radiative-transfer/#integrating-the-radiative-transfer-equation"&gt;Integrating the Radiative Transfer Equation&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;You can do this as a function of pathlength, \(s\). E.g., carry out this numerically using an ODE solver.&lt;/p&gt;
&lt;p&gt;But, given the way absorption and emission works, it&amp;rsquo;s often very helpful to do a change of variables to an optical depth, \(\tau\). The differential optical depth is defined as
$$
d \tau_\nu = \kappa_\nu ds
$$
this definition assumes radiation propagates in the direction of increasing \(\tau\), and the radiative transfer equation becomes
$$
d I_\nu = - I_\nu d \tau_\nu + S_\nu d \tau_\nu
$$
where the source function is
$$
S_\nu = \frac{j_\nu}{\kappa_\nu}
$$&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Remember, no scattering included&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Again, you can solve radiative transfer problems using this ODE as is.&lt;/p&gt;
&lt;p&gt;We can also get another useful form by using integrating factors to formally integrate the equation&lt;/p&gt;
&lt;p&gt;$$
e^{\tau_\nu} (d I_\nu + I_\nu d \tau_\nu) = e^{\tau_\nu} S_\nu d \tau_\nu
$$&lt;/p&gt;
&lt;p&gt;$$
d(e^{\tau_\nu} I_\nu) = e^{\tau_\nu} S_\nu d \tau_\nu
$$&lt;/p&gt;
&lt;p&gt;Choose some boundary condition \(\tau_\nu = 0\) with initial value \(I_\nu(0)\)
$$
e^{\tau}I_\nu - I_\nu(0) = \int_0^{\tau_\nu} e^{\tau^\prime} S_\nu d \tau^\prime
$$
multiply through by \(e^{-\tau_\nu}\) and we have the &lt;strong&gt;equation of radiative transfer in integral form&lt;/strong&gt;
$$
I_\nu(\tau_\nu) = I_\nu(0)e^{-\tau_\nu} + \int_0^{\tau_\nu} e^{-(\tau_\nu - \tau^\prime)}S_\nu d \tau^\prime
$$
\(S_\nu\) can be a function of position.&lt;/p&gt;
&lt;p&gt;Interpretation: the intensity at some optical depth \(\tau_\nu\) is&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;the initial intensity \(I_\nu(0)\) attenuated by a factor \(e^{-\tau}\)&lt;/li&gt;
&lt;li&gt;plus the integral over the emission \(S_\nu d \tau^\prime\) attenuated by the factor \(e^{-(\tau_\nu - \tau^\prime)}\) due to the effective emission from the point of emission.&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;When scattering is present, \(S_\nu\) contains a contribution from \(I_\nu\), so you cannot specify \(S_\nu\) a priori. Iterative schemes may be needed to solve to convergence (see modern RT packages).&lt;/p&gt;




&lt;h2 id="uniform-medium"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/06-radiative-transfer/#uniform-medium"&gt;Uniform medium&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;Slab of uniform medium, energy levels populated according to a single excitation temperature \(T_\mathrm{exc}\), and the system is in LTE, so
$$
I_\nu = B_\nu(T_\mathrm{exc}) = S_\nu.
$$
If the system is in LTE, why must \(S_\nu = B_\nu\)? Otherwise the presence of the material would alter the radiation, and the system would not be in thermal equilibrium (but, by definition, it already is).&lt;/p&gt;
&lt;p&gt;Then the RTE becomes
$$
0 = d I_\nu = - B_\nu d \tau_\nu + S_\nu d \tau_\nu
$$&lt;/p&gt;
&lt;p&gt;In LTE, \(j_\nu\) and \(\kappa_\nu\) only depend on the local properties of the matter. Then we can solve
$$
I_\nu = I_\nu(0) e^{-\tau_\nu} + B_\nu(T_\mathrm{exc})(1 - e^{-\tau_\nu})
$$&lt;/p&gt;
&lt;p&gt;Think about what happens as we grow \(\tau_\nu\).&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;at \(\tau_\nu \approx 0\), we retain \(I_\nu(0)\)&lt;/li&gt;
&lt;li&gt;at \(\tau \gg 1\), we obtain \(S_\nu = B_\nu\)&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;You can also reverse the coordinates.&lt;/p&gt;
&lt;p&gt;Let&amp;rsquo;s talk about which pathlengths in this image (from the camera to the ground) are &amp;ldquo;optically thick&amp;rdquo; or &amp;ldquo;optically thin.&amp;rdquo;&lt;/p&gt;
&lt;figure&gt;&lt;img src="https://iancze.github.io/courses/astro542/lectures/06-radiative-transfer/optical_depth.png"&gt;
&lt;/figure&gt;

&lt;p&gt;There&amp;rsquo;s scattering by water droplets here, so it&amp;rsquo;s not a perfect analogy. But the idea about optical depth is still relevant.&lt;/p&gt;




&lt;h2 id="maser-lines"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/06-radiative-transfer/#maser-lines"&gt;Maser lines&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;Radiative pumping (discussed later) can set up relative level populations
$$
n_u &amp;gt; \frac{g_u}{g_l} n_l
$$&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;the excitation temperature goes negative(!)&lt;/li&gt;
&lt;li&gt;we have a population inversion&lt;/li&gt;
&lt;li&gt;stimulated emission is stronger than absorption&lt;/li&gt;
&lt;li&gt;radiation is amplified as it propagates&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;Masing happens for microwave transitions of H I, OH, H2O, and SiO.
Some masers have brightness/antenna temperatures of \(T_A &amp;gt; 10^{11}\)K.&lt;/p&gt;
&lt;p&gt;This means they would have a HUGE flux &lt;em&gt;if&lt;/em&gt; they were as spatially extended as other astronomical sources. But the \(d\Omega\) is typically small.&lt;/p&gt;</description></item><item><title>Spontaneous Emission, Stimulated Emission, and Absorption</title><link>https://iancze.github.io/courses/astro542/lectures/05-emission-and-absorption/</link><pubDate>Mon, 30 Aug 2021 00:00:00 +0000</pubDate><guid>https://iancze.github.io/courses/astro542/lectures/05-emission-and-absorption/</guid><description>&lt;p&gt;&lt;a href="https://psu.mediaspace.kaltura.com/media/Astro+542A+Lecture+5A+Sep+1/1_o7bbam0v"&gt;Zoom link&lt;/a&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Draine Ch. 6: Spontaneous Emission, Stimulated Emission, and Absorption&lt;/li&gt;
&lt;li&gt;Ryden and Pogge Ch. 2.1: Radiative Transfer&lt;/li&gt;
&lt;li&gt;Ryden and Pogge Ch. 2.2: Absorbers and Emitters&lt;/li&gt;
&lt;li&gt;Lecture notes on &lt;a href="https://www.ita.uni-heidelberg.de/~dullemond/lectures/radtrans_2012/index.shtml"&gt;Radiative Transfer in Astrophysics&lt;/a&gt; by C.P. Dullemond.&lt;/li&gt;
&lt;li&gt;&lt;em&gt;Radiative Processes&lt;/em&gt; by Rybicki and Lightman&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;General rules for absorption and emission of radiation by absorbers with &lt;em&gt;quantized&lt;/em&gt; energy levels. Can be atoms, ions, molecules, dust grains, or anything that has (quantized) energy levels.&lt;/p&gt;




&lt;h2 id="absorption-of-photons"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/05-emission-and-absorption/#absorption-of-photons"&gt;Absorption of photons&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;Some absorber is in a lower level, there is radiation present that has
$$
h \nu = E_u - E_l
$$
the absorber can absorb a photon and undergo an upward transition
$$
X_l + h \nu \rightarrow X_u.
$$&lt;/p&gt;




&lt;h3 id="rate-of-this-reaction"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/05-emission-and-absorption/#rate-of-this-reaction"&gt;Rate of this reaction&lt;/a&gt;
&lt;/h3&gt;
&lt;p&gt;There is some \(n(X_l)\). The rate per volume at which absorbers absorb photons (\(l \rightarrow u\)) is proportional to the density of photons of the appropriate energy and the number density of absorbers&lt;/p&gt;
&lt;p&gt;$$
\underset{\mathrm{populate\;level}\;u}{\left ( \frac{\mathrm{d}n_u}{\mathrm{d}t} \right )_{l \rightarrow u}} = \underset{\mathrm{depopulate\;level}\;l}{- \left ( \frac{\mathrm{d}n_l}{\mathrm{d}t} \right)}
$$&lt;/p&gt;
&lt;p&gt;$$
= n_l B_{lu} u_\nu
$$&lt;/p&gt;
&lt;p&gt;\(u_\nu\) is the radiation energy density per unit frequency.&lt;/p&gt;
&lt;p&gt;\(B_{lu}\) is a proportionality constant called the Einstein B coefficient for the transition \(l \rightarrow u\).&lt;/p&gt;




&lt;h2 id="emission-of-photons"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/05-emission-and-absorption/#emission-of-photons"&gt;Emission of photons&lt;/a&gt;
&lt;/h2&gt;




&lt;h3 id="spontaneous-emission"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/05-emission-and-absorption/#spontaneous-emission"&gt;Spontaneous emission&lt;/a&gt;
&lt;/h3&gt;
&lt;p&gt;$$
X_u \rightarrow X_l + h \nu
$$&lt;/p&gt;
&lt;p&gt;Random process (independent of radiation field), and occurs with a probability per unit time \(A_{ul}\) called the Einstein A coefficient.&lt;/p&gt;




&lt;h3 id="stimulated-emission"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/05-emission-and-absorption/#stimulated-emission"&gt;Stimulated emission&lt;/a&gt;
&lt;/h3&gt;
&lt;p&gt;$$
X_u + h \nu \rightarrow X_l + 2 h \nu
$$&lt;/p&gt;
&lt;p&gt;Occurs if photons of &lt;em&gt;identical&lt;/em&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;frequency&lt;/li&gt;
&lt;li&gt;polarization&lt;/li&gt;
&lt;li&gt;direction
are present in a radiation field.&lt;/li&gt;
&lt;/ul&gt;




&lt;h3 id="rate-of-emission"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/05-emission-and-absorption/#rate-of-emission"&gt;Rate of emission&lt;/a&gt;
&lt;/h3&gt;
&lt;p&gt;From state \(u \rightarrow l\)&lt;/p&gt;
&lt;p&gt;$$
\left ( \frac{\mathrm{d}n_l}{\mathrm{d}t} \right) = - \left ( \frac{\mathrm{d}n_u}{\mathrm{d}t}\right) = n_u (A_{ul} + B_{ul} u_\nu)
$$&lt;/p&gt;
&lt;p&gt;\(B_{ul}\) is the Einstein B coefficient for the downward transition.&lt;/p&gt;




&lt;h2 id="einstein-coefficients-are-not-independent"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/05-emission-and-absorption/#einstein-coefficients-are-not-independent"&gt;Einstein coefficients are not independent&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;$$
B_{ul} = \frac{c^3}{8 \pi h \nu^3} A_{ul}
$$&lt;/p&gt;
&lt;p&gt;$$
B_{lu} = \frac{g_u}{g_l} B_{ul} = \frac{g_u}{g_l} \frac{c^3}{8 \pi h \nu^3} A_{ul}
$$&lt;/p&gt;
&lt;p&gt;Strength of stimulated emission (\(B_{ul}\)) and absorption (\(B_{lu}\)) are both determined by \(A_{ul}\) (spontaneous emission) and the ratio of the degeneracies \(\frac{g_u}{g_l}\).&lt;/p&gt;




&lt;h2 id="radiation-fields"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/05-emission-and-absorption/#radiation-fields"&gt;Radiation fields&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;(Review from Rybicki and Lightman, Ch 1)&lt;/p&gt;
&lt;p&gt;\(I_\nu \) is the &lt;em&gt;specific intensity&lt;/em&gt; of radiation, you can think of it as the energy carried along by an infinitesimal &amp;ldquo;bundle&amp;rdquo; of rays.&lt;/p&gt;
&lt;p&gt;It has dimensions
$$
\mathrm{ergs}\;\mathrm{s}^{-1}\;\mathrm{cm}^{-2}\;\mathrm{ster}^{-1}\;\mathrm{Hz}^{-1}
$$
I like calling \(I_\nu\) the &amp;ldquo;specific intensity,&amp;rdquo; and that seems to be common usage in astronomy. In non-astronomy settings, this is called &amp;ldquo;spectral intensity.&amp;rdquo; If it is integrated over all frequency, it&amp;rsquo;s sometimes called the radiant intensity \(I\).&lt;/p&gt;
&lt;p&gt;\(I_\nu \) can be a little mind-bending to think about&amp;hellip; it can be a function of&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;3D space&lt;/li&gt;
&lt;li&gt;direction&lt;/li&gt;
&lt;li&gt;frequency&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;Intensity itself is &lt;em&gt;not&lt;/em&gt; a vector quantity but it is a scalar field that &lt;em&gt;is&lt;/em&gt; function of direction. Draine and Rybicki and Lightman write the angular direction vector as \(\Omega\) and the solid angle surrounding that vector as \(\mathrm{d}\Omega\).&lt;/p&gt;
&lt;p&gt;If we have a defined reference frame, we would probably write \(\Omega\) as a vector in spherical coordinates and define the components along \(\hat{\phi}\) and \(\hat{\theta}\) and let \(\mathrm{d}\Omega = d\phi \sin \theta d\theta\).&lt;/p&gt;
&lt;p&gt;\(B_\nu\) is exactly the same type of variable, we just use it instead of \(I_\nu\) when we&amp;rsquo;re referring to blackbody radiation, specifically of the form
$$
B_\nu = \frac{2 h \nu^3}{c^2}\frac{1}{e^{h \nu / kT} - 1}
$$
it also has units of
$$
\mathrm{ergs}\;\mathrm{s}^{-1}\;\mathrm{cm}^{-2}\;\mathrm{ster}^{-1}\;\mathrm{Hz}^{-1}
$$&lt;/p&gt;




&lt;h3 id="flux"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/05-emission-and-absorption/#flux"&gt;Flux&lt;/a&gt;
&lt;/h3&gt;
&lt;p&gt;Once you&amp;rsquo;ve defined \(I_\nu\), then it&amp;rsquo;s relatively easy to calculate quantities like energy density, flux, momentum, etc, as integrals of the specific intensity field.&lt;/p&gt;
&lt;figure&gt;&lt;img src="https://iancze.github.io/courses/astro542/lectures/05-emission-and-absorption/flux.png"
 alt="How to calculate flux from specific intensity. Credit: Rybicki and Lightman Ch. 1"&gt;&lt;figcaption&gt;
 &lt;p&gt;How to calculate flux from specific intensity. Credit: Rybicki and Lightman Ch. 1&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;

&lt;p&gt;$$
F_\nu = \int I_\nu \cos \theta d \Omega
$$
(intensity passing through some differential area \(dA\), lowered by the effective angle).&lt;/p&gt;
&lt;p&gt;\(F_\nu\) has units of
$$
\mathrm{ergs}\;\mathrm{s}^{-1}\;\mathrm{cm}^{-2}\;\mathrm{Hz}^{-1}
$$
(i.e., angular dependence has been integrated out). I like using the unit of Jansky, which is
$$
1\,\mathrm{Jy} = 10^{-23} \mathrm{ergs}\;\mathrm{s}^{-1}\;\mathrm{cm}^{-2}\;\mathrm{Hz}^{-1}
$$&lt;/p&gt;
&lt;p&gt;In astronomy settings, \(F_\nu\) is called the &lt;em&gt;spectral flux density&lt;/em&gt;. In non-astronomy settings, this is called spectral irradiance. Sometimes you will see \(F_\lambda\), which has units
$$
\mathrm{ergs}\;\mathrm{s}^{-1}\;\mathrm{cm}^{-2}\;\mathrm{cm}^{-1}
$$
(or per Å, depending on your wavelength range).&lt;/p&gt;
&lt;p&gt;Note that \(F_\nu\) and \(F_\lambda\) are flux &lt;em&gt;densities&lt;/em&gt; or distribution function, which means that
$$
F_\nu \ne F_\lambda
$$
(their units are different)!&lt;/p&gt;
&lt;p&gt;What is true is
$$
F_\nu \mathrm{d}\nu = F_\lambda \mathrm{d}\lambda
$$
because we&amp;rsquo;re comparing two quantities with units
$$
\mathrm{ergs}\;\mathrm{s}^{-1}\;\mathrm{cm}^{-2}
$$&lt;/p&gt;
&lt;p&gt;You&amp;rsquo;ll sometimes also see a spectral energy distribution plotted
$$
F_\nu \nu = F_\lambda \lambda
$$
this comes about because
$$
\nu = \frac{c}{\lambda}
$$
$$
\frac{d \lambda}{d \nu} = -\frac{c}{\nu^2} = -\frac{\lambda}{\nu}
$$
and the minus sign goes away because fluxes are defined per &lt;em&gt;positive&lt;/em&gt; unit of frequency or wavelength.&lt;/p&gt;
&lt;p&gt;I find the question of whether we&amp;rsquo;re referring to&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;\(I_\nu\): specific intensity (Jy/ster)&lt;/li&gt;
&lt;li&gt;\(F_\nu\): spectral flux density (Jy)&lt;/li&gt;
&lt;li&gt;\(F\): Bolometric flux \(\mathrm{ergs}\;\mathrm{s}^{-1}\;\mathrm{cm}^{-2}\).&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;to be endlessly confusing in conversation, because it&amp;rsquo;s very common to colloquially use &amp;ldquo;flux&amp;rdquo; or &amp;ldquo;brightness&amp;rdquo; to mean a range of quantities, even though they have specific definitions in many settings. I find the clearest thing is to state the variable \(I_\nu\) or \(F_\nu\) (or \(F_\lambda\)) &lt;em&gt;and&lt;/em&gt; the units.&lt;/p&gt;




&lt;h2 id="energy-density-and-photon-occupation-number"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/05-emission-and-absorption/#energy-density-and-photon-occupation-number"&gt;Energy density and photon occupation number&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;The &amp;ldquo;mean&amp;rdquo; (directionally averaged) intensity is
$$
J_\nu = \frac{1}{4 \pi} \int I_\nu d \Omega = \frac{1}{4\pi} \int I_\nu \sin \theta d \theta d\phi.
$$
it has a different meaning than flux. In an isotropic radiation field, the net flux will be 0, whereas the mean intensity will have some positive value.&lt;/p&gt;
&lt;p&gt;and the mean radiation density is
$$
u_\nu = \frac{4 \pi}{c} J_\nu.
$$
these quantities &lt;em&gt;are not&lt;/em&gt; a function of direction, since we&amp;rsquo;ve averaged over it.&lt;/p&gt;
&lt;p&gt;For a thermal, blackbody spectrum, we have
$$
u_\nu = \frac{4 \pi}{c} B_\nu(T)
$$&lt;/p&gt;
&lt;p&gt;In equilibrium, the absorbers must have levels populated according to
$$
\frac{n_u}{n_l} = \frac{g_u}{g_l}e^{(E_l - E_u)/kT}
$$&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;The following doesn&amp;rsquo;t depend on thermal equilibrium&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;We can also write down the photon occupation number \(n_\gamma\), i.e., a dimensionless quantity tracking how many photons exist &amp;ldquo;per given solid angle&amp;rdquo;&lt;/p&gt;
&lt;p&gt;$$
n_\gamma = \frac{c^2}{2 h \nu^3} I_\nu
$$&lt;/p&gt;
&lt;p&gt;which we can average over all directions to get
$$
n_{\mathrm{bar},\gamma} = \frac{c^3}{8 \pi h \nu^3} u_\nu
$$
(a truly dimensionless quantity).&lt;/p&gt;
&lt;p&gt;These photon occupation numbers make it easy to rewrite the transition rates.&lt;/p&gt;
&lt;p&gt;From \( u \rightarrow l\)
$$
\left ( \frac{dn_l}{dt}\right)= n_u A_{ul} (1 + n_{\mathrm{bar},\gamma})
$$&lt;/p&gt;
&lt;p&gt;Stimulated emission is not important when \( n_{\mathrm{bar},\gamma} \ll 1 \).&lt;/p&gt;
&lt;p&gt;From \( l \rightarrow u\)
$$
\left ( \frac{d n_u}{dt} \right ) = n_l \frac{g_u}{g_l} A_{ul} n_{\mathrm{bar},\gamma}
$$&lt;/p&gt;




&lt;h2 id="absorption-cross-section"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/05-emission-and-absorption/#absorption-cross-section"&gt;Absorption cross section&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;Because we&amp;rsquo;re talking about photons, instead of a velocity-dependent cross section, we will write a frequency-dependent cross section, \(\sigma(\nu)\). The velocity will be the speed of light \(c\).&lt;/p&gt;
&lt;p&gt;Photon density [1/cm^3] per unit frequency
$$
\frac{u_\nu}{h \nu}
$$&lt;/p&gt;
&lt;p&gt;Like in Ch 2, we would rate a rate as
$$
n_A n_B \langle \sigma v \rangle
$$
we&amp;rsquo;ll do the same thing with \(n_l\), \(\frac{u_\nu}{h \nu}\), and \(\sigma(\nu)\).&lt;/p&gt;
&lt;p&gt;For the \(l \rightarrow u\) transition,
$$
\left ( \frac{d n_u}{d t} \right ) = n_l \int d \nu \sigma_{lu}(\nu) c \frac{u_\nu}{h \nu}.
$$&lt;/p&gt;
&lt;p&gt;We can relate this back to the Einstein B coefficient for absorption to find
$$
B_{lu} = \frac{c}{h \nu} \int d\nu \sigma_{lu}(\nu)
$$
relate this back to the \(A_{ul}\) coefficient,
$$
\int d_\nu \sigma_{lu}(\nu) = \frac{g_u}{g_l} \frac{c^2}{8 \pi \nu^2_{lu}} A_{ul}.
$$
Then solve for \(\sigma_{lu}(\nu)\)
$$
\sigma_{lu}(\nu) = \frac{g_u}{g_l} \frac{c^2}{8 \pi \nu^2_{lu}}A_{ul} \phi_\nu
$$
where \(\phi_\nu\) is a normalized line profile
$$
\int \phi_\nu d \nu = 1.
$$&lt;/p&gt;




&lt;h2 id="oscillator-strength"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/05-emission-and-absorption/#oscillator-strength"&gt;Oscillator strength&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;You can also write these relationships with something called the oscillator strength, \(f_{lu}\).
$$
A_{ul} = \frac{8 \pi^2 e^2 \nu_{lu}^2}{m_e c^3} \frac{g_l}{g_u} f_{lu}
$$&lt;/p&gt;




&lt;h2 id="intrinsic-line-profile"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/05-emission-and-absorption/#intrinsic-line-profile"&gt;Intrinsic line profile&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;A Lorentzian is a good description of the intrinsic line profile (need quantum mechanical calculation to get it exact).&lt;/p&gt;
&lt;p&gt;$$
\phi(\nu) = \frac{4 \gamma_{ul}}{16 \pi^2(\nu - \nu_{ul})^2 + \gamma_{ul}^2}
$$&lt;/p&gt;
&lt;p&gt;The intrinsic width of the absorption line reflects the uncertainty in the energies of levels \(l,u\) due to the finite lifetimes against transitions to other levels.&lt;/p&gt;
&lt;p&gt;Intrinsic widths of lines \(\propto h \nu\), which means x-ray transitions can have considerably larger line widths than radio transitions (assuming the medium is stationary).&lt;/p&gt;
&lt;p&gt;We&amp;rsquo;ll talk more about absorption line profiles, Doppler broadening, Voigt profiles, etc&amp;hellip; in a few lectures.&lt;/p&gt;</description></item><item><title>Energy Levels of Atoms, Ions, and Molecules</title><link>https://iancze.github.io/courses/astro542/lectures/04-energy-levels-atoms/</link><pubDate>Sun, 29 Aug 2021 00:00:00 +0000</pubDate><guid>https://iancze.github.io/courses/astro542/lectures/04-energy-levels-atoms/</guid><description>&lt;p&gt;&lt;a href="https://psu.mediaspace.kaltura.com/media/Astro+542A+Lecture+4A+Aug+30/1_7cregjt3"&gt;Zoom link&lt;/a&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Draine Ch. 4 + 5&lt;/li&gt;
&lt;li&gt;Ryden and Pogge &amp;ldquo;Guide to Spectroscopic Notation&amp;rdquo; available as PDF from &lt;a href="https://www.cambridge.org/highereducation/books/interstellar-and-intergalactic-medium/A647BECF975E19E86F7F88F7BB103AA7/resources/general-resources/EE39FB29FEC5B2061B17B5D036647F0A/guide-to-spectroscopic-notation/6B0467B962DB4E3D8986F4893569207F"&gt;here&lt;/a&gt;.&lt;/li&gt;
&lt;li&gt;&lt;em&gt;Radiative Processes&lt;/em&gt; by Rybicki and Lightman, Ch. 11: Molecular Structure&lt;/li&gt;
&lt;li&gt;&lt;em&gt;Physics of Atoms and Molecules&lt;/em&gt; by Bransden and Joachain&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;Some of this lecture will likely be review from your quantum mechanics (and/or chemistry) courses. The goal is to revisit and organize our understanding of the energy levels of atoms, ions, and molecules to form a foundation for later astrophysical processes that use these emission/absorption lines.&lt;/p&gt;




&lt;h2 id="ionization"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/04-energy-levels-atoms/#ionization"&gt;Ionization&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;As we covered in the first lecture (with hydrogen), ionization state is denoted by a Roman numeral (especially for spectroscopic purposes)&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;O I = neutral oxygen&lt;/li&gt;
&lt;li&gt;O II = singly ionized oxygen&lt;/li&gt;
&lt;li&gt;Ne III = doubly ionized neon&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;Sometimes, you may see the more conventional &amp;ldquo;chemical&amp;rdquo; notation when we&amp;rsquo;re just referring to the ion, i.e., \(\mathrm{Ne}^{4+}\).&lt;/p&gt;




&lt;h2 id="single-electron-orbitals"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/04-energy-levels-atoms/#single-electron-orbitals"&gt;Single-Electron orbitals&lt;/a&gt;
&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;\(n\) principal quantum number (electron energy or &amp;ldquo;shell&amp;rdquo; level). You may see these labeled using K, L, M for \(n=1,2,3\)&amp;hellip;, especially if you do x-ray science.&lt;/li&gt;
&lt;li&gt;\(l\) orbital angular momentum in units of \(\hbar\) (orbital type, &lt;em&gt;s, p, d, f&lt;/em&gt; corresponding to \(l=0,1,2,3\)). Letters themselves are a holdover from pre-quantum empirical lab studies.&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;For given \(n\), \(l\) can take on values \(0 \leq l &amp;lt; n\).&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;\(m_l\) (Drain calls it \(m_z\)) is the projection of orbital angular momentum onto the \(z\) axis and called the &amp;ldquo;magnetic&amp;rdquo; quantum number. If no applied magnetic field, energy for two \(m_l\) values are the same. Can take on \(2l + 1\) different values: \(m_l = -l, \ldots, -1, 0, 1, \dots l\).&lt;/li&gt;
&lt;li&gt;\(m_s\) is the projection of spin onto the \(z\) axis. Electrons are spin \(1/2\) particles, so projection only takes on two values. Degenerate energies if no applied field.&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;So a given pair of quantum numbers \(nl\) refers to \(2(2l + 1)\) distinct wavefunctions.&lt;/p&gt;




&lt;h2 id="pauli-exclusion-principal"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/04-energy-levels-atoms/#pauli-exclusion-principal"&gt;Pauli exclusion principal&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;Two electrons cannot share the same wavefunction.&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;each \(nl\) subshell can have at most \(2(2l + 1)\) electrons&lt;/li&gt;
&lt;li&gt;\(s = 2\)&lt;/li&gt;
&lt;li&gt;\(p = 6\)&lt;/li&gt;
&lt;li&gt;\(d = 10\)&lt;/li&gt;
&lt;li&gt;\(f = 14\)&lt;/li&gt;
&lt;/ul&gt;
&lt;figure&gt;&lt;a href="https://chem.libretexts.org/Bookshelves/Physical_and_Theoretical_Chemistry_Textbook_Maps/Supplemental_Modules_%28Physical_and_Theoretical_Chemistry%29/Quantum_Mechanics/09._The_Hydrogen_Atom/Atomic_Theory/Electrons_in_Atoms/Electronic_Orbitals"&gt;&lt;img src="https://iancze.github.io/courses/astro542/lectures/04-energy-levels-atoms/Single_electron_orbitals.jpg"
 alt="Visualization of the electron orbitals. From Physical_and_Theoretical_Chemistry_Textbook_Maps, Libretexts"&gt;&lt;/a&gt;&lt;figcaption&gt;
 &lt;p&gt;Visualization of the electron orbitals. From Physical_and_Theoretical_Chemistry_Textbook_Maps, Libretexts&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;

&lt;p&gt;Orbitals are filled in order of: 1s, 2s, 2p, 3s, 3p, 4s, 3d, 4p, 5s, 4d, 5p, 6s, 4f, 5d, 6p, 7s, 5f, 6d, 7p.&lt;/p&gt;
&lt;p&gt;And we can denote the state of electrons in an atom using superscripts, e.g., for \(\mathrm{Mg}\,\mathrm{I}\)
$$
1\mathrm{s}^2\,2\mathrm{s}^2\,2\mathrm{p}^6\,3\mathrm{s}^2
$$&lt;/p&gt;




&lt;h2 id="spectroscopic-terms"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/04-energy-levels-atoms/#spectroscopic-terms"&gt;Spectroscopic terms&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;The spectral lines we observe are most commonly the result of the interactions among the electrons in the outermost shell, called the &amp;ldquo;optically active&amp;rdquo; electrons.&lt;/p&gt;
&lt;p&gt;There are three relevant vectors and corresponding quantum numbers&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;\(\boldsymbol{L}\) is total orbital angular momentum of electrons in outermost shell, with corresponding quantum number \(m_L = \sum m_l\), the projection of the &lt;em&gt;total&lt;/em&gt; orbital angular momentum \(L \hbar\) onto the \(z\)-axis.&lt;/li&gt;
&lt;li&gt;\(\boldsymbol{S}\) is the total spin angular momentum of the electrons in the shell, and has an associated quantum number \(m_S = \sum m_s\), the projection of the &lt;em&gt;total&lt;/em&gt; spin angular momentum \( S \hbar\) onto the z-axis.&lt;/li&gt;
&lt;li&gt;\(\boldsymbol{J} = \boldsymbol{L} + \boldsymbol{S}\) is the total (orbital + spin) angular momentum of the electrons. This term arises because of the coupling between the orbital and spin angular momentum, called the L-S coupling.&lt;/li&gt;
&lt;/ul&gt;




&lt;h2 id="permitted-semi-forbidden-and-forbidden-radiative-transitions"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/04-energy-levels-atoms/#permitted-semi-forbidden-and-forbidden-radiative-transitions"&gt;Permitted, semi-forbidden, and forbidden radiative transitions&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;Dipole selection rules determine whether a transition is &amp;ldquo;permitted&amp;rdquo; or not.&lt;/p&gt;
&lt;p&gt;&amp;ldquo;Semi-forbidden&amp;rdquo; and &amp;ldquo;forbidden&amp;rdquo; radiative transitions are not &lt;em&gt;strictly&lt;/em&gt; forbidden. It&amp;rsquo;s just that their transition probabilities (Einstein A coefficients) are much smaller than permitted lines (with forbidden lines being less likely still).&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Semi-forbidden \(\mathrm{O\,III}],\lambda1666 \)Å&lt;/li&gt;
&lt;li&gt;Forbidden \([\mathrm{O\,III}],\lambda5007 \)Å&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;In terrestrial settings, it&amp;rsquo;s more likely that an atom will be collisionally de-excited than emit in said transition. So it&amp;rsquo;s more accurate to call these &amp;ldquo;terrestrially-forbidden&amp;rdquo; transitions.&lt;/p&gt;




&lt;h2 id="hyperfine-structure"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/04-energy-levels-atoms/#hyperfine-structure"&gt;Hyperfine Structure&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;If the nucleus has a non-zero magnetic moment, the interaction of the magnetic fields of the nucleus and electrons gives rise to a small energy difference, which is called hyperfine structure. The hyperfine quantum number is
$$
\boldsymbol{F} = \boldsymbol{I} + \boldsymbol{J}
$$
where \(\boldsymbol{I}\) is the quantized spin angular momentum.&lt;/p&gt;
&lt;p&gt;The most famous of these is the 21.106 cm line of neutral hydrogen (H I), i.e., the &amp;ldquo;21-cm&amp;rdquo; line, corresponding to the electron &amp;ldquo;spin-flip.&amp;rdquo;&lt;/p&gt;
&lt;p&gt;But see also the hyperfine lines of HCN and \(\mathrm{C}_2 \mathrm{H}\) in protoplanetary disks.&lt;/p&gt;




&lt;h2 id="molecular-spectroscopy"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/04-energy-levels-atoms/#molecular-spectroscopy"&gt;Molecular Spectroscopy&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;&lt;em&gt;Molecular spectroscopy is hard&lt;/em&gt;. But worthwhile. Why is molecular spectroscopy hard? Having two (or more) atoms destroys the nice symmetry that we had with atomic spectra. This means that there are &lt;em&gt;many more&lt;/em&gt; ways for a molecule to move, which means many more degrees of freedom, which means many more quantum states to transition between and thus many more discrete photon energies to emit.&lt;/p&gt;
&lt;p&gt;Most abundant, astrophysically-interesting molecules are diatomic molecules (having two atoms). Thankfully, this means we at least have a line of rotational symmetry along the &lt;em&gt;internuclear axis&lt;/em&gt;, the line from one nucleus to another.&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;homonuclear: having the same nucleus (e.g., \(H_2\), \(O_2\))&lt;/li&gt;
&lt;li&gt;heteronuclear: having different nuclei (e.g., \(HD\), \(CO\) )&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;The main types of molecular transitions that you will encounter in astrophysical contexts are either vibrational or rotational.&lt;/p&gt;




&lt;h2 id="vibrational-transitions"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/04-energy-levels-atoms/#vibrational-transitions"&gt;Vibrational Transitions&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;Due to the quantization of the vibrational energy as atoms stretch on the internuclear axis.&lt;/p&gt;
&lt;figure&gt;&lt;img src="https://iancze.github.io/courses/astro542/lectures/04-energy-levels-atoms/rybicki-11-1.png"
 alt="The potential between two atoms in a molecule as a function of their separation. Attribution: Figure 11.1 from &amp;lsquo;Radiative Processes&amp;rsquo; by Rybicki and Lightman"&gt;&lt;figcaption&gt;
 &lt;p&gt;The potential between two atoms in a molecule as a function of their separation. Attribution: Figure 11.1 from &amp;lsquo;Radiative Processes&amp;rsquo; by Rybicki and Lightman&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;

&lt;p&gt;Ground state is very near the bottom (but not completely so).&lt;/p&gt;
&lt;p&gt;Can be approximated as a &lt;em&gt;harmonic oscillator&lt;/em&gt; with quantum number \(v\), and vibrational energy levels
$$
E_\mathrm{vib}(v) = \hbar \omega_0 (v + 1/2).
$$
where
$$
\omega_0 = \sqrt{k/m_r}
$$
where in this case only, \(k\) is the spring constant for the harmonic oscillator. \(m_r\) is the reduced mass
$$
m_r = \frac{m_1 m_2}{m_1 + m_2}.
$$&lt;/p&gt;
&lt;p&gt;Typical vibrational energies \(\hbar \omega_0\) range from 0.2 - 0.5 eV, corresponding to spectral lines in the infrared.&lt;/p&gt;
&lt;p&gt;Even at the ground vibrational state (\(v =0\), a molecule still has a zero point energy
$$
\frac{1}{2} \hbar \omega_0
$$&lt;/p&gt;




&lt;h2 id="rotational-transitions"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/04-energy-levels-atoms/#rotational-transitions"&gt;Rotational Transitions&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;Due to quantization of rotational angular momentum as the molecule rotates about the axis &lt;em&gt;perpendicular&lt;/em&gt; to the internuclear axis.&lt;/p&gt;
&lt;figure&gt;&lt;a href="http://hyperphysics.phy-astr.gsu.edu/hbase/molecule/rotrig.html"&gt;&lt;img src="https://iancze.github.io/courses/astro542/lectures/04-energy-levels-atoms/rotation.png"
 alt="Moment of inertia for a diatomic molecule. Credit: Hyperphysics"&gt;&lt;/a&gt;&lt;figcaption&gt;
 &lt;p&gt;Moment of inertia for a diatomic molecule. Credit: Hyperphysics&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;

&lt;p&gt;Has rotation quantum number \(J\) and rotational energy levels
$$
E_\mathrm{rot}(J) = \frac{\hbar}{2I}J(J +1)
$$
where \(I\) is the molecule&amp;rsquo;s moment of inertia. Typical rotational levels have energy differences approximately 100x smaller than vibrational levels, so each vibrational level is usually split into many rotational sub-levels.&lt;/p&gt;
&lt;p&gt;Moment of inertia given by
$$
I = m_1 r_1^2 + m_2 r_2^2 = \mu R^2
$$&lt;/p&gt;
&lt;p&gt;One would specify a transition using a combination of quantum numbers, like
$$
{}^{12}\mathrm{CO}\,v=0; J=1-0
$$
a purely rotational transition.&lt;/p&gt;




&lt;h2 id="electronic-transitions"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/04-energy-levels-atoms/#electronic-transitions"&gt;Electronic transitions&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;Molecules can also have electronic transitions (i.e., w/ electrons), just like in atoms. And these typically have energies of a few eV and produce spectral lines in UV, optical and near-IR. Generally, though, when we&amp;rsquo;re talking about observations of (diatomic) molecular spectra, we&amp;rsquo;re usually dealing with rotational (radio) and sometimes vibrational (IR) transitions.&lt;/p&gt;</description></item><item><title>Statistical Mechanics and Thermodynamic Equilibrium</title><link>https://iancze.github.io/courses/astro542/lectures/03-stat-mech/</link><pubDate>Wed, 25 Aug 2021 00:00:00 +0000</pubDate><guid>https://iancze.github.io/courses/astro542/lectures/03-stat-mech/</guid><description>&lt;p&gt;&lt;a href="https://psu.mediaspace.kaltura.com/media/Astro+542A+Lecture+3+Aug+27/1_jxunbxmk"&gt;Zoom link&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;Draine Ch. 3&lt;/p&gt;
&lt;p&gt;Also good review is &amp;ldquo;An Introduction to Thermal Physics&amp;rdquo; by Daniel V. Schroeder, esp. Ch 6 &amp;ldquo;Boltzmann Statistics.&amp;rdquo;&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;The ISM and IGM are generally far from thermodynamic equilibrium&lt;/strong&gt;.&lt;/p&gt;
&lt;p&gt;But, to understand the nonequilibrium conditions, we first need to understand the equilibrium conditions, which are useful for relating forward and reverse reaction rates of ionization and recombination processes important to the ISM.&lt;/p&gt;
&lt;p&gt;Local thermodynamic equilibrium: emission properties can be calculated from measured properties and temperature. Key point is that we can define temperature &lt;em&gt;and&lt;/em&gt; it has some bearing on the microphysics of the fluid.&lt;/p&gt;
&lt;p&gt;For example, LTE exists in a glass of water containing a melting ice cube. Temperature can be defined at any point, and if the energies of the molecules at a specific point are measured, they will follow a Maxwell-Boltzmann distribution with that temperature (Wikipedia).&lt;/p&gt;
&lt;p&gt;LTE also exists in the photospheres of stars.&lt;/p&gt;




&lt;h2 id="partition-functions"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/03-stat-mech/#partition-functions"&gt;Partition functions&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;The partition function \(Z\) is a dimensionless number describing a physical system, e.g., some set of particles in a box with volume \(V\). It allows you to understand the macrostates of the system from the microstates of the system.&lt;/p&gt;
&lt;p&gt;If you know the partition function, you can calculate energy, pressure, magnetization, entropy, etc.&lt;/p&gt;
&lt;p&gt;For a classical discrete system, the partition function is
$$
Z(T) = \sum_s e^{- E(s)/kT}
$$
sum is over all possible states \(s\) of the system, and \(E(s)\) is the energy of state \(s\).&lt;/p&gt;
&lt;p&gt;For a continuous system, the partition function is expressed using an integral (since the set of microstates as a function of position and momentum is infinite)&lt;/p&gt;
&lt;p&gt;$$
Z(X, T) = \frac{V}{h^3} \int_0^\infty 4 \pi p^2 \, \mathrm{d}p \; e^{-p^2/2 M_x k T}
$$
where \(M_X\) is the mass of the particle.&lt;/p&gt;
&lt;p&gt;For dilute gases, we generally need to talk about both types of partition functions, since the gas particles can have both translational motion and internal energy (i.e., quantum states)&lt;/p&gt;
&lt;p&gt;$$
Z(T) = Z_\mathrm{tran}(T) z_\mathrm{int}(T)
$$&lt;/p&gt;
&lt;p&gt;the translational partition function is&lt;/p&gt;
&lt;p&gt;$$
Z_\mathrm{tran} = \frac{(2 \pi M_X k T)^{3/2}}{h^3} V
$$&lt;/p&gt;
&lt;p&gt;The internal partition function is a sum over the possible internal states of \(X\)
$$
z_\mathrm{int}(X; T) = \sum_i g_i e^{E_i/kT}
$$
where \(g_i\) is the degeneracy of level \(i\), or the number of quantum states grouped together in the same energy level.&lt;/p&gt;
&lt;p&gt;For example, a free electron has just two internal states (spin up and down), both of which have \(E_i = 0\), so \(z_\mathrm{int} = 2\).&lt;/p&gt;
&lt;p&gt;Finally, we will define the &lt;em&gt;partition function per unit volume&lt;/em&gt;
$$
f(X; T) = \frac{Z}{V} = \left [ \frac{(2 \pi M_X k T)^{3/2}}{h^3} \right] z_\mathrm{int}(X; T)
$$&lt;/p&gt;




&lt;h2 id="detailed-balance-law-of-mass-action"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/03-stat-mech/#detailed-balance-law-of-mass-action"&gt;Detailed balance: law of mass action&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;Suppose there is a chemical reaction \(A + B \leftrightarrow C\), which is in &lt;strong&gt;local thermodynamic equilibrium&lt;/strong&gt;. Statistical mechanics tells us that the LTE abundance of species \(X\) is proportional to \(f(X)\).&lt;/p&gt;
&lt;p&gt;And therefore the number densities of the species satisfy the law of mass action&lt;/p&gt;
&lt;p&gt;$$
\frac{n_C}{n_A n_B} = \frac{f(C)}{f(A) f(B)}.
$$&lt;/p&gt;
&lt;p&gt;Basically, in LTE, the ratio of the product of number densities is equal to the ratio of the product of partition functions
$$
\frac{\prod_i n_{\mathrm{left},i}}{\prod_i n_{\mathrm{right},i}} = \frac{\prod_i f_{\mathrm{left},i}}{\prod_i f_{\mathrm{right},i}}
$$&lt;/p&gt;
&lt;p&gt;more details in Draine 3.2.&lt;/p&gt;
&lt;p&gt;As we mentioned, the partition function is useful for describing the macrostates of the system. If we want to find the ratio of number densities of a species, then we can just use the ratio of the partition functions.&lt;/p&gt;




&lt;h2 id="ionization-and-recombination"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/03-stat-mech/#ionization-and-recombination"&gt;Ionization and Recombination&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;Let&amp;rsquo;s apply this understanding to the case of recombination and ionization, where an electron recombines with an ionized atom, moving its energy level in the process
$$
e^- + X_u^{+r +1} \leftrightarrow X_l^{+r}
$$
Say we know the abundance of \(X_u^{+r+1}\) and \(n_e\), that the system is in LTE, and the energies of the two states (\(E_{r,l}\) and \(E_{r+1, u}\)). What is the abundance of \(X_l^{+r}\)?&lt;/p&gt;
&lt;p&gt;Let&amp;rsquo;s remember our left-hand-side / right-hand-side balance&lt;/p&gt;
&lt;p&gt;$$
\frac{n(X_l^{+r})}{n(X_u^{+r+1})n(e^-)} = \frac{z_{\mathrm{int},\mathrm{X_l^{+r}}}}{z_{\mathrm{int},\mathrm{X_u^{+r+1}}}z_{\mathrm{int},\mathrm{e^-}}}.
$$&lt;/p&gt;
&lt;p&gt;We just calculated \(z_{\mathrm{int},\mathrm{e^-}}\), and its 2. So we have&lt;/p&gt;
&lt;p&gt;$$
n(X_l^{+r}) = n(X_u^{+r+1})n(e^-) \frac{1}{2} \left ( \frac{z_{\mathrm{int},\mathrm{X_l^{+r}}}}{z_{\mathrm{int},\mathrm{X_u^{+r+1}}}} \right )
$$&lt;/p&gt;
&lt;p&gt;looking back to the partition function definition, the bracketed term is (relatively) easy to calculate, because we only have a single energy level under consideration for each species (no sum)&lt;/p&gt;
&lt;p&gt;$$
\frac{z_{\mathrm{int},\mathrm{X_l^{+r}}}}{z_{\mathrm{int},\mathrm{X_u^{+r+1}}}} = \frac{g(X_l^{+r}) e^{-E_{r,l}/kT}}{g(X_u^{+r+1}) e^{-E_{r+1,u}/kT}}
$$&lt;/p&gt;
&lt;p&gt;and there, we&amp;rsquo;ve solved for \(n(X_l^{+r})\).&lt;/p&gt;




&lt;h2 id="saha-equation"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/03-stat-mech/#saha-equation"&gt;Saha Equation&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;Bengali physicist Meghnad Saha in 1920. Combining ideas of quantum mechanics and statistical mechanics to explain spectral classification of stars. Is the reason why we see drastically different spectral absorption lines in stars as a function of temperature, even though all stars are majority hydrogen composition.&lt;/p&gt;
&lt;p&gt;Lets expand the previous example by considering all energy levels of \(X^{+r}\) and \(X^{+r+1}\).&lt;/p&gt;
&lt;p&gt;$$
e^- + X^{+r +1} \leftrightarrow X^{+r}
$$&lt;/p&gt;
&lt;p&gt;same thing with the left-hand-side / right-hand-side balance, only though now we need to consider all levels&lt;/p&gt;
&lt;p&gt;$$
\frac{n(X^{+r})}{n(X^{+r+1})n(e^-)} = \frac{z_{\mathrm{int},\mathrm{X^{+r}}}}{z_{\mathrm{int},\mathrm{X^{+r+1}}}z_{\mathrm{int},\mathrm{e^-}}}.
$$&lt;/p&gt;
&lt;p&gt;which means we need to consider all levels in the partition function calculation.&lt;/p&gt;
&lt;p&gt;$$
\frac{z_{\mathrm{int},\mathrm{X^{+r}}}}{z_{\mathrm{int},\mathrm{X^{+r+1}}}} = \frac{\sum_j g_{r,j} e^{-E_{r,j}/kT}}{\sum_j g_{r+1,j} e^{-E_{r+1,j}/kT}}
$$&lt;/p&gt;
&lt;p&gt;If the temperature is sufficiently low that only the lowest energy level states are significant in the calculation of the partition function (\(e^{-E_i/kT}\)). If \(T\) is large, then \(e^{-E_i/kT} \rightarrow 0\) as \(E_i\) increases.&lt;/p&gt;
&lt;p&gt;Let the ionization energy be given by
$$
\Phi_r = E_{r+1,1} - E_{r,1}
$$
then the Saha equation is
$$
\frac{n(e^-) n(X^{+r+1})}{n(X^{+r})} \approx \frac{2(2 \pi m_e kT)^{3/2}}{h^3} \frac{g_{r+1,1}}{g_{r,1}}e^{-\Phi_r/kT}
$$&lt;/p&gt;
&lt;p&gt;again, all of this is assuming &lt;em&gt;local thermodynamic equilibrium&lt;/em&gt;. Good for stellar interiors, but &lt;em&gt;not&lt;/em&gt; a good approximation in the ISM or IGM.&lt;/p&gt;




&lt;h2 id="ratios-of-rate-coefficients"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/03-stat-mech/#ratios-of-rate-coefficients"&gt;Ratios of rate coefficients&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;Assuming we are in local thermodynamic equilibrium and that the reaction is in equilibrium, how can we use the law of mass action to calculate the ratio of rate coefficients?&lt;/p&gt;
&lt;p&gt;Consider the inelastic scattering reaction
$$
X(l) + Y \rightarrow X(u) + Y
$$&lt;/p&gt;
&lt;p&gt;Since we are in LTE, you can think of there being a &amp;ldquo;bath&amp;rdquo; of species \(Y\) that, given the opportunity, will react with &lt;em&gt;both&lt;/em&gt; \(X(l)\) and \(X(u)\). Assume \(Y\) is a collisional partner that does not change internal state during the interaction.&lt;/p&gt;
&lt;p&gt;What do we know from law of mass action?&lt;/p&gt;
&lt;p&gt;$$
\frac{n(X(u)) n(Y)}{n(X(l))n(Y)} = \frac{f(X(u))f(Y)}{f(X(l))f(Y)} = \frac{z_\mathrm{int}(X(u))}{z_\mathrm{int}(X(l))}
$$&lt;/p&gt;
&lt;p&gt;What do we know from conservation of mass?
$$
n(X(u)) n(Y) \langle \sigma v \rangle_{l \rightarrow u} = n(X(u)) n(Y) \langle \sigma v \rangle_{u \rightarrow l}.
$$&lt;/p&gt;
&lt;p&gt;Rearrange these to find
$$
\frac{\langle \sigma v \rangle_{l \rightarrow u}}{\langle \sigma v \rangle_{u \rightarrow l}} = \frac{z_\mathrm{int}(X(u))}{z_\mathrm{int}(X(l))} = \frac{g_u}{g_l} e^{-(E_u - E_l)/kT}
$$&lt;/p&gt;
&lt;p&gt;More info in Draine 3.5.&lt;/p&gt;
&lt;p&gt;These relationships can also be used to put conditions on the ratios of cross sections. See Draine 3.6 for more details.&lt;/p&gt;




&lt;h2 id="three-body-recombination"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/03-stat-mech/#three-body-recombination"&gt;Three body recombination&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;Can be important for hydrogen recombination, especially higher energy levels.&lt;/p&gt;
&lt;p&gt;$$
H^+ + e^- + e^- \rightarrow H(n) + e^-
$$&lt;/p&gt;
&lt;p&gt;Draine 3.7 has some calculations for this, the interesting result being that the number density \(n(H(n)) \propto n^2\). You can think of this as one electron loosing just enough energy to become bound at a higher energy level.&lt;/p&gt;




&lt;h2 id="departure-coefficients"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/03-stat-mech/#departure-coefficients"&gt;Departure coefficients&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;Draine 3.8&lt;/p&gt;
&lt;p&gt;The departure coefficient is the ratio of the actual number density to the number density if one were in LTE&lt;/p&gt;
&lt;p&gt;$$
b_n \equiv \frac{n[H(n)]}{n_\mathrm{LTE}[H(n)]}
$$&lt;/p&gt;
&lt;p&gt;If we were &lt;em&gt;only&lt;/em&gt; considering collisional processes like collisional ionization and three-body recombination, then \(b_n = 1\). Important to recognize that &lt;em&gt;collisions&lt;/em&gt; do the thermalization that brings a parcel of gas into LTE. This is why most higher density things can be assumed to be in LTE (Sun&amp;rsquo;s photosphere, earth&amp;rsquo;s atmosphere, regions of protoplanetary disks, etc&amp;hellip;) but we can get in trouble when we go to less dense regions.&lt;/p&gt;
&lt;p&gt;In less dense regions, the excited states can be depopulated by &lt;em&gt;spontaneous emission&lt;/em&gt; at rates comparable or exceeding collisional dexcitation (or recombination). The spontaneous emission rate is larger for smaller \(n\), yielding the following plot for hydrogen atoms&lt;/p&gt;
&lt;figure&gt;&lt;img src="https://iancze.github.io/courses/astro542/lectures/03-stat-mech/fig_3_1.jpg"
 alt="Departure coefficient \(b_n\) as a function of principle quantum number \(n\), for hydrogen atoms in thermal plasma with \(T=10^{3.9}\) K, for three different densities. The departure coefficient gets quite large for diffuse regions. Attribution: Draine Figure 3.1"&gt;&lt;figcaption&gt;
 &lt;p&gt;Departure coefficient \(b_n\) as a function of principle quantum number \(n\), for hydrogen atoms in thermal plasma with \(T=10^{3.9}\) K, for three different densities. The departure coefficient gets quite large for diffuse regions. Attribution: Draine Figure 3.1&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;

&lt;p&gt;High \(n\) levels are interesting things. Transitions from
$$
n + 1 \rightarrow 1
$$
principal states are called \(n\alpha\) transitions. E.g., Lyman \(\alpha\) is the \(1\alpha\) transition.&lt;/p&gt;
&lt;p&gt;When \(n\) becomes very large, the photon is in the radio band of the spectrum, and these are called &amp;ldquo;radio recombination lines.&amp;rdquo; E.g., we might talk about the \(166\alpha\) or \(159\alpha\) transitions.&lt;/p&gt;
&lt;p&gt;The curve of \(b_n(n)\) also sets up conditions so that &lt;em&gt;maser amplification&lt;/em&gt; can occur.&lt;/p&gt;</description></item><item><title>Collisional Processes</title><link>https://iancze.github.io/courses/astro542/lectures/02-collisional-processes/</link><pubDate>Sun, 22 Aug 2021 00:00:00 +0000</pubDate><guid>https://iancze.github.io/courses/astro542/lectures/02-collisional-processes/</guid><description>&lt;p&gt;&lt;a href="https://psu.mediaspace.kaltura.com/media/Astro+542A+Lecture+2+Aug+25/1_gp4w3at4"&gt;Zoom link&lt;/a&gt;&lt;/p&gt;




&lt;h2 id="references-for-this-lecture"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/02-collisional-processes/#references-for-this-lecture"&gt;References for this lecture&lt;/a&gt;
&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;Draine Ch. 2 (primary)&lt;/li&gt;
&lt;/ul&gt;




&lt;h2 id="intro"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/02-collisional-processes/#intro"&gt;Intro&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;Collisions are one of the main ways in which energy is transferred about the ISM, so in order to understand the physics of the ISM, we need to understand the physics of collisions.&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;thermal, electrical, diffusion coefficients&lt;/li&gt;
&lt;li&gt;produce most excitations of atoms and molecules (which then emit photons)&lt;/li&gt;
&lt;li&gt;recombine ions + electrons&lt;/li&gt;
&lt;li&gt;chemical reactions&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;&lt;strong&gt;Goal&lt;/strong&gt;: set up a basic physical framework for understanding different types of collisions, and how their rates depend on density and temperature.&lt;/p&gt;
&lt;p&gt;Types of collisional interactions and how we&amp;rsquo;ll understand them&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;long range Coulomb (charges, ions, etc&amp;hellip;) \(\propto 1/r\) potential: impact approximation&lt;/li&gt;
&lt;li&gt;intermediate range \(\propto r^{-4}\) potential induced dipole between ions and neutral atoms or molecules: scattering by \(\propto r^{-4}\) potential&lt;/li&gt;
&lt;li&gt;interactions between electrons and neutrals: experimental data&lt;/li&gt;
&lt;li&gt;short-range interactions between neutrals: &amp;ldquo;hard-sphere&amp;rdquo; estimates&lt;/li&gt;
&lt;/ul&gt;




&lt;h2 id="collisional-rates-and-rate-coefficients"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/02-collisional-processes/#collisional-rates-and-rate-coefficients"&gt;Collisional rates and rate coefficients&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;Consider a general two-body collisional process&lt;/p&gt;
&lt;p&gt;$$
A + B \rightarrow \mathrm{products}.
$$&lt;/p&gt;
&lt;p&gt;We can write the (average) reaction rate per unit volume (units \(\mathrm{cm}^{-3} \mathrm{s}^{-1}\)) as&lt;/p&gt;
&lt;p&gt;$$
n_A n_B \langle \sigma v \rangle_{AB}
$$&lt;/p&gt;
&lt;p&gt;i.e., the rate is proportional to the number density of each species and the &lt;strong&gt;two-body collisional rate coefficient&lt;/strong&gt;, \( \langle \sigma v \rangle_{AB} \), which has units of \(\mathrm{cm}^3 \mathrm{s}^{-1}\). Once you know the rate coefficient, this is a pretty simple relationship, just multiplying terms together. But let&amp;rsquo;s unpack how we calculate the rate coefficient.&lt;/p&gt;
&lt;p&gt;How do we calculate this coefficient?&lt;/p&gt;
&lt;p&gt;First, let&amp;rsquo;s consider the cross-section of the reaction&lt;/p&gt;
&lt;figure&gt;&lt;a href="https://en.wikipedia.org/wiki/Cross_section_%28physics%29"&gt;&lt;img src="https://iancze.github.io/courses/astro542/lectures/02-collisional-processes/cross-section.png"
 alt="Scattering Cross Section. Attribution: Wikipedia: Qwerty123uiop"&gt;&lt;/a&gt;&lt;figcaption&gt;
 &lt;p&gt;Scattering Cross Section. Attribution: Wikipedia: Qwerty123uiop&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;

&lt;p&gt;$$
\sigma = \frac{1}{n \lambda}
$$&lt;/p&gt;
&lt;p&gt;where \(n\) is number density and \(\lambda\) is the mean free path. Generally, \(\sigma \) can be thought of as the cross-sectional area (hence the name) for the interaction, and it has units of area (\(\mathrm{cm}^2\)), just like a cross-section of a solid. I.e., if the cross-section of the spheres are larger, then they&amp;rsquo;re more likely to hit each other (interact). However, if the cross-section is very small, it&amp;rsquo;s unlikely that they&amp;rsquo;ll hit each other (interact).&lt;/p&gt;
&lt;p&gt;For the types of interactions we&amp;rsquo;re talking about, the cross-section of the interaction can be dependent on the relative velocity between the particles, \(\sigma(v)\).&lt;/p&gt;
&lt;p&gt;For a gas, what can we generally say about the distribution of velocities of the particles? For a fluid in thermal equilibrium, the distribution function of the relative speed of encounters between particles is given by a Maxwellian velocity distribution&lt;/p&gt;
&lt;p&gt;$$
f_v \, \mathrm{d}v = 4 \pi \left (\frac{\mu}{2 \pi k_B T }\right)^{3/2} \exp(- \mu v^2/2 k_B T) \, v^2 \mathrm{d} v
$$&lt;/p&gt;
&lt;p&gt;where
$$
\mu = \frac{m_A m_B}{m_A + m_B}
$$
is the reduced mass of the collisional partners. \(f_v\) is a distribution function, so to find the fraction of encounters that have relative speeds \(v_a &amp;lt; v &amp;lt; v_b\), we would just need to integrate over the relevant range
$$
\int_{v_a}^{v_b} f_v \, \mathrm{d}v,
$$
and to find the average value (i.e., expectation value) of some quantity \(X(v)\), we&amp;rsquo;d do
$$
\langle X \rangle = \int_{0}^{\infty} X(v) f_v \, \mathrm{d}v.
$$&lt;/p&gt;
&lt;p&gt;Getting back to the original goal, we want to calculate the &lt;strong&gt;two-body collisional rate coefficient&lt;/strong&gt;, \( \langle \sigma v \rangle_{AB} \), where \(X = \sigma v \). The cross section \(\sigma\) is a function of velocity \(\sigma(v)\), so we have \(X = \sigma(v) v \), or
$$
\langle \sigma v \rangle_{AB} = \int_0^\infty \sigma_{AB}(v) v f_v \, \mathrm{d}v.
$$&lt;/p&gt;
&lt;p&gt;Following Draine (S2.1), we can also do a change of variables to center of mass energy \(E = \mu v^2/2\) to make the calculations more convenient.&lt;/p&gt;
&lt;p&gt;Three-body collisions may be important in some situations, such as for populating the high-\(n\) levels of hydrogen. Their reaction rate is
$$
k_{ABC} n_A n_B n_C
$$
where \(k_{ABC}\) is the three-body collisional rate coefficient.&lt;/p&gt;




&lt;h2 id="inverse-square-law-forces"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/02-collisional-processes/#inverse-square-law-forces"&gt;Inverse-Square Law Forces&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;Let&amp;rsquo;s consider the scattering by two particles interacting through an inverse square law \(1/r^2\), called Rutherford or Coulomb Scattering.&lt;/p&gt;




&lt;h3 id="question-time"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/02-collisional-processes/#question-time"&gt;Question time&lt;/a&gt;
&lt;/h3&gt;
&lt;ul&gt;
&lt;li&gt;How can we quickly get an estimate of the &lt;em&gt;momentum&lt;/em&gt; transferred between two particles when they interact?&lt;/li&gt;
&lt;li&gt;What are some quick ways we could estimate this? If we needed to be very accurate, what would we need to do?&lt;/li&gt;
&lt;/ul&gt;




&lt;h3 id="impact-approximation"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/02-collisional-processes/#impact-approximation"&gt;Impact approximation&lt;/a&gt;
&lt;/h3&gt;
&lt;p&gt;A proper treatment involves some tedious integrals (see the &lt;a href="https://en.wikipedia.org/wiki/Cross_section_(physics)#Differential_cross_section"&gt;Wikipedia article&lt;/a&gt; for reference), but we can get to an order-of-magnitude estimate using the &amp;ldquo;impact approximation.&amp;rdquo;&lt;/p&gt;
&lt;figure&gt;&lt;img src="https://iancze.github.io/courses/astro542/lectures/02-collisional-processes/fig_2_1.jpg"
 alt="Coordinates for the Impact Approximation. Attribution: Draine Figure 2.1"&gt;&lt;figcaption&gt;
 &lt;p&gt;Coordinates for the Impact Approximation. Attribution: Draine Figure 2.1&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;

&lt;p&gt;Coulomb force&lt;/p&gt;
&lt;p&gt;$$
F_{12} = \frac{Z_1 Z_2 e^2}{r^2}
$$&lt;/p&gt;




&lt;h3 id="super-simple-way"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/02-collisional-processes/#super-simple-way"&gt;Super simple way&lt;/a&gt;
&lt;/h3&gt;
&lt;p&gt;Change in momentum is an &lt;em&gt;impulse&lt;/em&gt;,
$$
\Delta p = F \Delta t = m \Delta v.
$$
And we&amp;rsquo;ll calculate the impulse given by the product of the perpendicular force at closest approach times the characteristic interaction time. This is likely to be the &amp;ldquo;most wrong&amp;rdquo; calculation, but it should at least get us in the ball park in terms of order of magnitude.&lt;/p&gt;
&lt;p&gt;The instantaneous force perpendicular to the trajectory is
$$
F_\bot = \frac{Z_1 Z_2 e^2}{(b \cos \theta)^2} \cos \theta = \frac{Z_1 Z_2 e^2}{b^2} \cos^3 \theta
$$&lt;/p&gt;
&lt;p&gt;What is &amp;ldquo;characteristic time&amp;rdquo; for the interaction? A decent guess is \(b/v_1\). Then the change in momentum is just&lt;/p&gt;
&lt;p&gt;$$
\Delta p = \frac{Z_1 Z_2 e^2}{b v_1}.
$$&lt;/p&gt;
&lt;p&gt;A (slightly) more thorough treatment in the textbook (Draine 2.2.1) yields a more accurate
$$
\Delta p = 2 \frac{Z_1 Z_2 e^2}{b v_1}.
$$&lt;/p&gt;
&lt;p&gt;Great, how can we use this to estimate things relevant to the ISM, such as the ionization rate of an atom with a single bound electron?&lt;/p&gt;




&lt;h3 id="collisional-ionization"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/02-collisional-processes/#collisional-ionization"&gt;Collisional Ionization&lt;/a&gt;
&lt;/h3&gt;
&lt;p&gt;Let \(I\) be the energy required to ionize the atom, i.e., completely unbind the electron.&lt;/p&gt;
&lt;p&gt;We will consider the ionizer to be (another) electron that is moving fast enough such that its kinetic energy is \(\gg I\). If we could &lt;em&gt;directly&lt;/em&gt; translate this energy to unbinding the electron, it would. But, we&amp;rsquo;re considering glancing (and more distant) interactions, so it&amp;rsquo;s not immediately clear how much we&amp;rsquo;d transfer.&lt;/p&gt;
&lt;p&gt;If \(b\) is small, more energy will transferred. If \(b\) is large, less energy will be transferred. There will be some \(b\) so large that an insufficient amount of energy will be transferred to unbind the electron and ionize the atom. Let&amp;rsquo;s use our impact approximation to calculate this.&lt;/p&gt;
&lt;p&gt;The criterion is
$$
\frac{(\Delta p)^2}{2 m_e} &amp;gt; I
$$
i.e., transfers of momentum exceedingFy the ionization energy.&lt;/p&gt;
&lt;p&gt;We use this and the \(\Delta p\) equation above to solve for a maximum impact parameter&lt;/p&gt;
&lt;p&gt;$$
b_\mathrm{max} = \left [ \frac{2 Z_p^2 e^4}{m_e v^2 I} \right ]^{1/2}
$$&lt;/p&gt;
&lt;p&gt;We can use the impact parameter as the &amp;ldquo;radius&amp;rdquo; of the cross-section, therefore the cross-section area for an ionizing interaction is
$$
\sigma(v) \approx \pi b_\mathrm{max}^2(v).
$$&lt;/p&gt;
&lt;p&gt;Now we calculate the thermal rate coefficient
$$
\langle \sigma v \rangle = \int_{v_\mathrm{min}}^\infty \sigma(v) v f_v \, \mathrm{d}v.
$$
\(f_v\) is still given by the Maxwellian distribution, so we have
$$
\langle \sigma v \rangle = Z_p^2 \left (\frac{8 \pi}{m_e kT} \right)^{1/2} \frac{e^4}{I}e^{-I/kT}.
$$&lt;/p&gt;
&lt;p&gt;For a hydrogen atom, the ionization energy is
$$
I = \frac{13.602}{n^2} \, [\mathrm{eV}]
$$
where \(n\) is the principal quantum number. For highly excited energy (\(n \approx 100\)), the ionization energy becomes small and the collisional ionization rate becomes very large.&lt;/p&gt;




&lt;h3 id="electron-ion-inelastic-scattering-collisional-strength-omega_ul"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/02-collisional-processes/#electron-ion-inelastic-scattering-collisional-strength-omega_ul"&gt;Electron-ion inelastic scattering: Collisional strength \(\Omega_{ul}\)&lt;/a&gt;
&lt;/h3&gt;
&lt;p&gt;We just discussed elastic scattering of electrons by ions, whereby momentum transfer between the two is dominated by &amp;ldquo;distant&amp;rdquo; encounters with impact parameters much larger than the relevant scales of the atom.&lt;/p&gt;
&lt;p&gt;When the fast-moving electron &lt;em&gt;does&lt;/em&gt; pass very close to the atom with bound electron, however, quantum mechanics plays a role (as it always does&amp;hellip;) and the bound electron may transition to another energetically allowed state.&lt;/p&gt;
&lt;p&gt;It&amp;rsquo;s common to write the cross-section for these interactions using a dimensionless quantity \(\Omega_{ul}\) called the collision strength. You can see S 2.3 of Draine for the full details, but the important part is that
$$
\langle \sigma v \rangle_{u \rightarrow l} \propto \frac{\Omega_{ul}}{g_u \sqrt{T}}.
$$
The good news is that&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;\(\Omega_{ul}\) is approximately indepedent of \(T\) for temperatures below 10,000 K&lt;/li&gt;
&lt;li&gt;typical values of \(\Omega_{ul}\) are 1 - 10&lt;/li&gt;
&lt;li&gt;this means collisional rate coefficient \(\langle \sigma v \rangle_{u \rightarrow l}\) is \(\propto T^{-1/2}\)&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;Inelastic collisions play a role in line radiation from hot gas, since they leave the atom in an excited state, from which it decays by emitting a photon.&lt;/p&gt;




&lt;h2 id="ion-neutral-collision-rates"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/02-collisional-processes/#ion-neutral-collision-rates"&gt;Ion-Neutral Collision Rates&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;A charged ion particle (more than just an electron) interacting with neutral particles (atoms or molecules). If the two are separated by more than a few Å, the charged particle creates an electric field \(\vec{E}\) which &lt;em&gt;polarizes&lt;/em&gt; the neutral.&lt;/p&gt;
&lt;p&gt;In the case of a simple hydrogen atom, you can think of this as the proton and electron moving to separate sides. The neutral acquires a dipole moment \(\vec{P} = \alpha_N \vec{E}\), where \(\alpha_N\) is the polarizability.&lt;/p&gt;
&lt;p&gt;This creates an interaction potential which looks like
$$
U(r) = -\frac{1}{2}\frac{\alpha_N Z^2 e^2}{r^4}.
$$&lt;/p&gt;
&lt;p&gt;Interactions in this potential are interesting.&lt;/p&gt;
&lt;figure&gt;&lt;img src="https://iancze.github.io/courses/astro542/lectures/02-collisional-processes/fig_2_2.jpg"
 alt="Three trajectories in a \(1/r^4\) potential. Attribution: Draine Figure 2.2"&gt;&lt;figcaption&gt;
 &lt;p&gt;Three trajectories in a \(1/r^4\) potential. Attribution: Draine Figure 2.2&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;

&lt;p&gt;When \(b &amp;lt; b_0\), the charged particle can undergo an &amp;ldquo;orbiting&amp;rdquo; trajectory, which has cross section for interaction \(\pi b_0^2\). If you do the math, this unique cross section leads to a rate coefficient \(\langle \sigma v \rangle\) that is independent of temperature.&lt;/p&gt;
&lt;p&gt;See Draine Table 2.1 for a list of Ion-Neutral scattering parameters. We&amp;rsquo;re talking about neutrals like H, He, \(\mathrm{H}_2\), and O, and ions like \(H^+, C^+, \mathrm{H}_2^+\).&lt;/p&gt;
&lt;p&gt;Orbiting trajectories bring ion and neutral into close contact, and if there is an energetically allowed outcome&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;collisional dexcitation&lt;/li&gt;
&lt;li&gt;exothermic charge exchange&lt;/li&gt;
&lt;li&gt;chemical exchange reaction
and the reaction rate coefficients for these reactions become comparable to the orbiting rate coefficient.&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;Because the rate coefficients are independent of temperature (all others we&amp;rsquo;ve examined &lt;em&gt;are&lt;/em&gt; dependent on temperature), exothermic ion-neutral reactions play a major role in chemistry of &lt;em&gt;cool&lt;/em&gt; interstellar gas.&lt;/p&gt;




&lt;h2 id="electron-neutral-collision-rates"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/02-collisional-processes/#electron-neutral-collision-rates"&gt;Electron-Neutral Collision Rates&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;Elastic scattering of electrons by neutrals. Can be important in &lt;em&gt;very low ionization&lt;/em&gt; environments (e.g., protoplanetary disks). Primary collision partner is \(H_2\). Rates are determined experimentally.&lt;/p&gt;




&lt;h2 id="neutral-neutral-collision-rates"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/02-collisional-processes/#neutral-neutral-collision-rates"&gt;Neutral-Neutral Collision Rates&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;Interactions between two neutral species&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;at small separations is repulsive&lt;/li&gt;
&lt;li&gt;at larger separations is attractive (van der Waals) \(U(r) \propto r^{-6}\)&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;The fact that attractive interaction is very weak and the onset of repulsive interaction sufficiently rapid that we can treat the two partners as &amp;ldquo;hard spheres,&amp;rdquo; i.e., the cross-sectional area is really just the characteristic size of the atom or molecule (~1 Å). Think of throwing two basketballs at each other. They will only interact if \(b &amp;lt; R_1 + R_2\), i.e., if the balls would hit each other. No need to consider effective cross sectional area for charges. For temperatures less than 100 K, the rate coefficient for neutral-neutral scattering is more than 10x smaller than the rate coefficient for ion-neutral scattering.&lt;/p&gt;




&lt;h2 id="review"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/02-collisional-processes/#review"&gt;Review&lt;/a&gt;
&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;Collisions are fundamental to ISM physics&lt;/li&gt;
&lt;li&gt;We&amp;rsquo;ve introduced a simple framework for understanding collisional rate coefficients for different types of interactions&lt;/li&gt;
&lt;/ul&gt;</description></item><item><title>Introduction and Course Overview</title><link>https://iancze.github.io/courses/astro542/lectures/01-introduction-and-overview/</link><pubDate>Sat, 21 Aug 2021 00:00:00 +0000</pubDate><guid>https://iancze.github.io/courses/astro542/lectures/01-introduction-and-overview/</guid><description>&lt;p&gt;&lt;a href="https://psu.mediaspace.kaltura.com/media/Astro+542A+Lecture+1+Aug+23/1_vl7lkx54"&gt;Zoom Recording&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;The Interstellar medium course is at once a course about nothing and a course about everything. What I mean by that is we&amp;rsquo;re talking about the &lt;em&gt;space between things&lt;/em&gt;, some of the least dense parts of the galaxy. But we&amp;rsquo;re also going to touch on nearly all aspects of astrophysics:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;quantum mechanics of atoms and molecules&lt;/li&gt;
&lt;li&gt;radiative transfer&lt;/li&gt;
&lt;li&gt;star formation&lt;/li&gt;
&lt;li&gt;planet formation&lt;/li&gt;
&lt;li&gt;galactic stellar populations&lt;/li&gt;
&lt;li&gt;and even cosmology (slightly)&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;That&amp;rsquo;s because this course is fundamentally about the interactions between all of these more easily-defined &amp;ldquo;units&amp;rdquo; of astrophysics. The science lectures in this course will basically fall into one or two categories:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;coverage/review of fundamental concepts/theories (e.g., quantum mechanics, radiative transfer, etc&amp;hellip;)&lt;/li&gt;
&lt;li&gt;applying/integrating those concepts to understanding the interstellar medium (e.g., dust emission, star formation, etc&amp;hellip;)&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;We&amp;rsquo;ll be talking about many fundamental astrophysical concepts in some detail (warning: not as much detail as courses designed to cover these subjects, but enough detail so that you&amp;rsquo;ll be able to follow along if you haven&amp;rsquo;t taken these courses already, or if you forgot)!&lt;/p&gt;




&lt;h2 id="what-is-a-graduate-course"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/01-introduction-and-overview/#what-is-a-graduate-course"&gt;What is a graduate course?&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;Compared to undergraduate courses, graduate courses are one step closer to operating at the interface between established knowledge and full-blown research. Graduate courses try to distill (sometimes very recently gained) knowledge into an &lt;em&gt;efficient&lt;/em&gt; format for learning. Or, at least, a more efficient format than the alternative, which would be trying to assemble the knowledge oneself through reading various scientific articles. At the end of your Ph.D., you&amp;rsquo;ll look back on the series of papers you&amp;rsquo;ve written and the talks you&amp;rsquo;ve given at various conferences, and you&amp;rsquo;ll have your own set of new discoveries and topics that you know in better detail than nearly everyone else on this planet. Now, the question is, how can you efficiently teach a new generation the core concepts? That&amp;rsquo;s essentially what a graduate course is trying to do.&lt;/p&gt;




&lt;h2 id="course-schedule"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/01-introduction-and-overview/#course-schedule"&gt;Course Schedule&lt;/a&gt;
&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;&lt;strong&gt;First third&lt;/strong&gt;: coverage/review of quantum mechanics, collisional processes, statistical mechanics, radiative transfer, absorption and emission lines, radiation fields, etc&amp;hellip;, with a focus on the processes necessary for understanding the components of the interstellar medium. &lt;em&gt;References&lt;/em&gt;: Draine textbook Chs 2 - 14&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Second third&lt;/strong&gt;: star formation, generally: clouds, astrochemistry, shocks, CO surveys, protostars. &lt;em&gt;References&lt;/em&gt;: Draine textbook Chs 14 - 42 (not all covered)&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Final third&lt;/strong&gt;: protoplanetary disks, planet formation, exoplanets, radio astronomy, ALMA proposals. &lt;em&gt;References&lt;/em&gt;: additional resources.&lt;/li&gt;
&lt;/ul&gt;




&lt;h2 id="course-syllabus"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/01-introduction-and-overview/#course-syllabus"&gt;Course Syllabus&lt;/a&gt;
&lt;/h2&gt;




&lt;h3 id="format"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/01-introduction-and-overview/#format"&gt;Format&lt;/a&gt;
&lt;/h3&gt;
&lt;ul&gt;
&lt;li&gt;3 midterm exams (total 40%)&lt;/li&gt;
&lt;li&gt;three problem sets (total 20%)&lt;/li&gt;
&lt;li&gt;paper presentation (choose two dates Monday, August 30th) (15%)&lt;/li&gt;
&lt;li&gt;reviews from mock TAC (5%)&lt;/li&gt;
&lt;li&gt;ALMA proposal (20%)&lt;/li&gt;
&lt;li&gt;no final&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;&lt;a href="https://iancze.github.io/courses/astro542/syllabus/"&gt;Link&lt;/a&gt;&lt;/p&gt;




&lt;h2 id="what-is-the-ism"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/01-introduction-and-overview/#what-is-the-ism"&gt;What is the ISM?&lt;/a&gt;
&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;primarily, the disks of galaxies&lt;/li&gt;
&lt;li&gt;Gigantic range of \(T\) (\(10 - 10^6\)) and number density \(n\) (\(10^{-3} - 10^{-6} \mathrm{cm}^{-3}\))&lt;/li&gt;
&lt;li&gt;wide range of ionization states&lt;/li&gt;
&lt;li&gt;spatially inhomogeneous&lt;/li&gt;
&lt;li&gt;roughly solar abundances&lt;/li&gt;
&lt;li&gt;multi-phase&lt;/li&gt;
&lt;li&gt;far from equilibrium/steady-state&lt;/li&gt;
&lt;/ul&gt;
&lt;figure&gt;&lt;a href="https://asd.gsfc.nasa.gov/archive/mwmw/mmw_product.html#slides" target="multi-wavelength Milky Way"&gt;&lt;img src="https://iancze.github.io/courses/astro542/lectures/01-introduction-and-overview/mwmw.jpg"
 alt="A multi-wavelength view of the Milky Way. Image Credit: NASA"&gt;&lt;/a&gt;&lt;figcaption&gt;
 &lt;p&gt;A multi-wavelength view of the Milky Way. Image Credit: NASA&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;





&lt;h2 id="ism-nomenclature"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/01-introduction-and-overview/#ism-nomenclature"&gt;ISM nomenclature&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;Throughout this course we will be frequently discussing neutral atomic hydrogen, ionized atomic hydrogen (i.e., protons), and (neutral )molecular hydrogen. Atomic spectroscopists denote ionization states with Roman numerals, with &lt;strong&gt;I&lt;/strong&gt; corresponding to the neutral case.&lt;/p&gt;
&lt;p&gt;Take Calcium, for example. Neutral Calcium is Ca I, singly ionized Calcium is Ca II, and five-times ionized Calcium is Ca IV.&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Neutral Hydrogen: H I&lt;/li&gt;
&lt;li&gt;Singly-ionized Hydrogen (i.e., a proton): H II&lt;/li&gt;
&lt;li&gt;Molecular Hydrogen: \(H_2\)&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;When we say &amp;ldquo;H two,&amp;rdquo; which do we mean? In this course, at least, we&amp;rsquo;ll strive to maintain the convention that H II &lt;code&gt;==&lt;/code&gt; &amp;ldquo;H two,&amp;rdquo; and when we are referring to \(H_2\), we will say &amp;ldquo;molecular hydrogen.&amp;rdquo;&lt;/p&gt;
&lt;p&gt;ISM is mostly hydrogen. Hydrogen states by mass (following Draine Table 1.2).&lt;/p&gt;
&lt;div id='vis'&gt;&lt;/div&gt;
&lt;script type="text/javascript"&gt;
 var spec = 'bar-chart.vg.json';
 vegaEmbed('#vis', spec).then(function (result) {
 
 }).catch(console.error);
&lt;/script&gt;




&lt;h2 id="phases-of-the-ism"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/01-introduction-and-overview/#phases-of-the-ism"&gt;Phases of the ISM&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;Why does it matter that we have a clear distinction between the forms of Hydrogen? Because Hydrogen is the most abundant element (cosmologically speaking), and these trace the phases of the ISM.&lt;/p&gt;
&lt;p&gt;What do we mean by phases of the ISM? Broadly speaking, these are characteristic states of the baryons in the ISM. Different authors will make different levels of distinction, i.e., &amp;ldquo;two-phase&amp;rdquo; or &amp;ldquo;three-phase&amp;rdquo; ISM. In the case of Draine (Table 1.3), we have seven phases. In the case of Ryden and Pogge, we have five phases. Recognize that ultimately the &amp;ldquo;phases&amp;rdquo; are a continuum of states, and that it makes sense to group things into phases based upon the physical processes that occur with them.&lt;/p&gt;
&lt;p&gt;Don&amp;rsquo;t worry too much about the exact boundaries of each phase, but instead focus on the huge, &lt;em&gt;orders of magnitude&lt;/em&gt; range in densities, temperatures, and volume fractions. If you wanted to chose a most-important phase based on &lt;em&gt;volume&lt;/em&gt;, you&amp;rsquo;re going to get a different answer than if you chose one based on &lt;em&gt;mass&lt;/em&gt;.&lt;/p&gt;
&lt;p&gt;For now, let&amp;rsquo;s just reflect on Draine Table 1.3 / Ryden and Pogge Table 1.1.&lt;/p&gt;
&lt;p&gt;We see combinations of &lt;em&gt;temperature&lt;/em&gt;, &lt;em&gt;density&lt;/em&gt;, and &lt;em&gt;ionization state&lt;/em&gt;. The point of this course is basically understanding the interaction between these phases. Following Draine,&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Coronal gas, &amp;ldquo;hot ionized medium&amp;rdquo; or &amp;ldquo;HIM&amp;rdquo;&lt;/li&gt;
&lt;li&gt;H II gas, dense H II regions, diffuse H II regions, &amp;ldquo;warm ionized medium&amp;rdquo; or &amp;ldquo;WIM&amp;rdquo;&lt;/li&gt;
&lt;li&gt;Warm H I, &amp;ldquo;warm neutral medium&amp;rdquo; or &amp;ldquo;WNM&amp;rdquo;&lt;/li&gt;
&lt;li&gt;Cool H I, &amp;ldquo;cold neutral medium&amp;rdquo; or &amp;ldquo;CNM&amp;rdquo;&lt;/li&gt;
&lt;li&gt;Diffuse molecular gas&lt;/li&gt;
&lt;li&gt;Dense molecular gas&lt;/li&gt;
&lt;li&gt;Stellar outflows&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;Ryden and Pogge lump all molecular gas together, and don&amp;rsquo;t differentiate stellar outflows.&lt;/p&gt;
&lt;p&gt;The ISM is dynamic, and baryons in each of these phases can change into others. Ionizing photons will turn cold molecular gas to hot ionized hydrogen (HII); hot gas can cool to lower temperatures through radiative cooling; ions and electrons can recombine into atoms, and atoms can recombine to form \(H_2\) molecules.&lt;/p&gt;
&lt;p&gt;Let&amp;rsquo;s explore these phases &lt;a href="https://observablehq.com/@iancze/phases-of-the-ism"&gt;through graphs&lt;/a&gt;.&lt;/p&gt;


&lt;iframe width="100%" height="1034" frameborder="0"
 src="https://observablehq.com/embed/@iancze/phases-of-the-ism?cells=temp%2Cdensity%2Cvolume"&gt;&lt;/iframe&gt;
 




&lt;h2 id="sources-of-energy-in-ism"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/01-introduction-and-overview/#sources-of-energy-in-ism"&gt;Sources of Energy in ISM&lt;/a&gt;
&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;Photons: CMB photons, infrared photons emitted from dust, and starlight&lt;/li&gt;
&lt;li&gt;Cosmic rays&lt;/li&gt;
&lt;li&gt;Magnetic field energy&lt;/li&gt;
&lt;li&gt;Bulk kinetic energy&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;All energy densities are approximately similar, and makes sense from a coupling standpoint.&lt;/p&gt;
&lt;figure&gt;&lt;img src="https://iancze.github.io/courses/astro542/lectures/01-introduction-and-overview/draine_1.3.png"
 alt="Flow of energy in the Milky Way (Draine Figure 1.3)"&gt;&lt;figcaption&gt;
 &lt;p&gt;Flow of energy in the Milky Way (Draine Figure 1.3)&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;





&lt;h2 id="historical-beginnings-of-ism-observations"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/01-introduction-and-overview/#historical-beginnings-of-ism-observations"&gt;Historical beginnings of ISM observations&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;Ryden and Pogge Ch 1.1&lt;/p&gt;




&lt;h3 id="nebulae"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/01-introduction-and-overview/#nebulae"&gt;Nebulae&lt;/a&gt;
&lt;/h3&gt;
&lt;ul&gt;
&lt;li&gt;Orion nebulae (1610), gas clouds&lt;/li&gt;
&lt;li&gt;Messier catalog (1787) contains many nebulae&lt;/li&gt;
&lt;li&gt;Kant and Laplace (1755/1797) nebular hypothesis&lt;/li&gt;
&lt;li&gt;William Huggins (1860s) spectra of planetary nebula, diffuse gas&lt;/li&gt;
&lt;li&gt;Henry Draper (1880) photographs orion nebula&lt;/li&gt;
&lt;/ul&gt;
&lt;figure&gt;&lt;a href="https://www.amateurastrophotography.com/top-5-orion-nebula-images"&gt;&lt;img src="https://iancze.github.io/courses/astro542/lectures/01-introduction-and-overview/orion_nebula.jpg"
 alt="The Orion Nebula. Image Credit: Alejandro Lopez"&gt;&lt;/a&gt;&lt;figcaption&gt;
 &lt;p&gt;The Orion Nebula. Image Credit: Alejandro Lopez&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;





&lt;h3 id="interstellar-absorption-lines"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/01-introduction-and-overview/#interstellar-absorption-lines"&gt;Interstellar absorption lines&lt;/a&gt;
&lt;/h3&gt;
&lt;ul&gt;
&lt;li&gt;Hartmann (1904): stationary narrow line in a broad oscillating Ca II stellar line in a binary \(\delta\)-Ori (O9.5 star).&lt;/li&gt;
&lt;li&gt;Up until space astronomy, high spectral resolutions of absorption lines in stars was the main way to study cool interstellar clouds&lt;/li&gt;
&lt;li&gt;UV satellites like Copernicus, then IUE, HST, and FUSE allowed main transitions of HI, Lyman alpha, H2, etc. to be studied.&lt;/li&gt;
&lt;/ul&gt;




&lt;h2 id="dark-clouds"&gt;
 &lt;a class="Heading-link u-clickable" href="https://iancze.github.io/courses/astro542/lectures/01-introduction-and-overview/#dark-clouds"&gt;Dark clouds&lt;/a&gt;
&lt;/h2&gt;
&lt;p&gt;Modern image:&lt;/p&gt;
&lt;figure&gt;&lt;a href="https://www.eso.org/public/images/eso9934b/"&gt;&lt;img src="https://iancze.github.io/courses/astro542/lectures/01-introduction-and-overview/barnard_68.jpg"
 alt="The dark cloud B68 at different wavelegths. Credit: ESO"&gt;&lt;/a&gt;&lt;figcaption&gt;
 &lt;p&gt;The dark cloud B68 at different wavelegths. Credit: ESO&lt;/p&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;

&lt;p&gt;What&amp;rsquo;s going on here?&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;extinction diminishes with wavelength&lt;/li&gt;
&lt;li&gt;centers of clouds are denser than peripheries&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;We see these in the visible light image of the Milky Way.&lt;/p&gt;</description></item></channel></rss>