Brought to you by:

A publishing partnership

The following article is Open access

The PHANGS-HST Survey: Physics at High Angular Resolution in Nearby Galaxies with the Hubble Space Telescope

, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , and

Published 2022 January 10 © 2022. The Author(s). Published by the American Astronomical Society.
, , Citation Janice C. Lee et al 2022 ApJS 258 10 DOI 10.3847/1538-4365/ac1fe5

Download Article PDF
DownloadArticle ePub

You need an eReader or compatible software to experience the benefits of the ePub3 file format.

0067-0049/258/1/10

Abstract

The PHANGS program is building the first data set to enable the multiphase, multiscale study of star formation across the nearby spiral galaxy population. This effort is enabled by large survey programs with the Atacama Large Millimeter/submillimeter Array (ALMA), MUSE on the Very Large Telescope, and the Hubble Space Telescope (HST), with which we have obtained CO(2–1) imaging, optical spectroscopic mapping, and high-resolution UV–optical imaging, respectively. Here, we present PHANGS-HST, which has obtained NUV–UBVI imaging of the disks of 38 spiral galaxies at distances of 4–23 Mpc, and parallel V- and I-band imaging of their halos, to provide a census of tens of thousands of compact star clusters and multiscale stellar associations. The combination of HST, ALMA, and VLT/MUSE observations will yield an unprecedented joint catalog of the observed and physical properties of ∼100,000 star clusters, associations, H ii regions, and molecular clouds. With these basic units of star formation, PHANGS will systematically chart the evolutionary cycling between gas and stars across a diversity of galactic environments found in nearby galaxies. We discuss the design of the PHANGS-HST survey and provide an overview of the HST data processing pipeline and first results. We highlight new methods for selecting star cluster candidates, morphological classification of candidates with convolutional neural networks, and identification of stellar associations over a range of physical scales with a watershed algorithm. We describe the cross-observatory imaging, catalogs, and software products to be released. The PHANGS high-level science products will seed a broad range of investigations, in particular, the study of embedded stellar populations and dust with the James Webb Space Telescope, for which a PHANGS Cycle 1 Treasury program to obtain eight-band 2–21 μm imaging has been approved.

Export citation and abstract BibTeX RIS

Original content from this work may be used under the terms of the Creative Commons Attribution 4.0 licence. Any further distribution of this work must maintain attribution to the author(s) and the title of the work, journal citation and DOI.

1. Introduction

How do stars form from the complex multiphase interstellar medium (ISM) in galaxies? This question lies at the heart of astrophysics, as star formation is a key mechanism governing the evolution of baryons in the universe (e.g., Péroux & Howk 2020). Star formation converts interstellar matter into stars and their planetary systems, depletes galaxies of gas, and feeds back metals, energy, and momentum into the ISM, which may reach the halos of galaxies and beyond. In turn, this feedback, together with galactic-scale inflows and dynamics, impacts the state of the gas and the future of star formation.

Many processes underlie this star formation cycle, and nearly all have been the focus of dedicated study. Star formation typically occurs in molecular clouds (e.g., Blitz 1993; Heyer & Dame 2015; Miville-Deschênes et al. 2017), parts of which can become gravitationally unstable and contract until new stars are born (e.g., Krumholz & McKee 2005; Dobbs et al. 2014; Chevance et al. 2020). This process is controlled by the intricate interplay between self-gravity and various opposing agents, such as supersonic turbulence, magnetic fields, radiation, and gas and cosmic-ray pressure (e.g., Elmegreen 2000; McKee & Ostriker 2007; Girichidis et al. 2020). The flow patterns within the galactic ISM help determine where and at what rate stars form, and are themselves influenced by the energy and momentum input from massive stars (e.g., Mac Low & Klessen 2004; McKee & Ostriker 2007; Hennebelle & Falgarone 2012; Federrath 2013). That is, the local process of stellar birth is impacted by the supply, organization, and stability of cloud-scale natal gas as governed by large-scale galaxy dynamics, including spiral arm features or perturbations from satellite galaxies or accretion of fresh gas from the cosmic web (e.g., Kennicutt 1998; Dobbs et al. 2006; Dobbs 2008; Leroy et al. 2008, 2013; Meidt et al. 2013, 2020). Stellar feedback, in the form of radiation, winds, and supernova explosions, creates a hierarchy of highly nonlinear feedback loops that impact ISM dynamics across a wide range of physical scales (e.g., Hopkins et al. 2014; Lopez et al. 2014; Walch et al. 2015; Gnedin 2016; Rahner et al. 2017; Olivier et al. 2021), thereby determining the chemical and thermal state of the ISM and affecting subsequent star formation (e.g., Klessen & Glover 2016).

We now understand from decades of study and observations across the electromagnetic spectrum that all of these processes that drive, regulate, and extinguish star formation operate together over a vast range of stellar, interstellar, galactic, and circumgalactic scales. Accordingly, we have come to recognize that systematic observations—spanning essential spatial scales and phases of the star formation cycle, across different galactic environments—are necessary for the development of a robust, unified model of star formation and galaxy evolution.

Here, we present the PHANGS−Hubble Space Telescope (HST) Treasury survey, which as part of the Physics at High Angular Resolution in Nearby Galaxies 40 (PHANGS) program, is building a data set for the systematic multiscale, multiphase study of star formation. PHANGS is charting the connections between giant molecular clouds, H ii regions, and young stars throughout a diversity of galactic environments in the local universe by combining observations from large surveys with the Atacama Large Millimeter/submillimeter Array (ALMA), MUSE on the Very Large Telescope (VLT), and HST. Supporting data including Very Large Array (VLA) H i and Astrosat far-ultraviolet/near-ultraviolet imaging, as well as the wealth of panchromatic ground- and space-based survey observations obtained for the nearby galaxies in the sample over the past three decades, have also been assembled. There are currently three major components of PHANGS:

PHANGS-ALMA: The foundation of PHANGS has been built with the transformative capabilities of ALMA. Through a Cycle 5 PHANGS-ALMA Large Program (PI E. Schinnerer) and smaller precursor programs, PHANGS has obtained ∼1'' resolution CO(2–1) maps for a sample of 74 massive spiral galaxies at distances of 4–23 Mpc. At these distances, ALMA can detect individual giant molecular clouds with better than 2.4 km s−1 velocity resolution and physical resolutions of ∼50–100 pc, while still efficiently covering the star-forming disk (Sun et al. 2018; Leroy et al. 2021a, 2021b).

PHANGS-MUSE and PHANGS-Hα: For 19 of these galaxies, with the VLT Multi Unit Spectroscopic Explorer (MUSE), PHANGS-MUSE (PI E. Schinnerer) has obtained IFU spectroscopy with ∼2.5 Å spectral resolution and ∼0farcs7 spatial resolution to deliver a 3D view of the ionized (104 K) gas, stellar populations, and kinematics via various gas and stellar tracers in the optical from 4800 to 9300 Å (Emsellem et al. 2021; see first results in Kreckel et al. 2018; Ho et al. 2019; Kreckel et al. 2019, 2020). To supplement the MUSE observations, the PHANGS-Hα survey (A. Razza et al. 2021, in preparation) has obtained seeing-limited (∼1'') narrowband Hα imaging to provide star formation rate (SFR) maps and catalogs of ionized nebulae for the full PHANGS-ALMA sample. The Hα imaging was obtained using WFI on the ESO/MPG 2.2 m telescope at La Silla, and the DirectCCD on the du Pont 2.5 m telescope at Las Campanas Observatory.

PHANGS-HST, the subject of this paper, is a Cycle 26 HST Treasury survey (PI J. C. Lee) that has obtained NUV–UBVI imaging of the disks of 38 galaxies from the parent PHANGS-ALMA sample and parallel V- and I-band imaging of their halos. The HST sample includes all 19 galaxies with MUSE IFU spectroscopy.

The high-resolution capabilities of HST (∼0farcs08) have enabled the study of compact star clusters and associations in galaxies out to distances of several tens of Mpc (e.g., Whitmore et al. 1999; Linden et al. 2017; Adamo et al. 2020). These structures, which typically have half-light radii of a few parsecs (Portegies Zwart et al. 2010; Ryon et al. 2017; Krumholz et al. 2019), have been the focus of much recent work and are not only important to study in their own right (e.g., Whitmore et al. 2007; Chandar et al. 2010a; Kruijssen 2012; Krumholz et al. 2019; Adamo et al. 2020), but also have great utility as "clocks"—effectively single-aged stellar populations that can be age-dated and used to time various star formation and ISM processes. The PHANGS-HST UV–optical imaging enables inventories of young star clusters and associations down to a few thousand solar masses, with age and mass determinations from spectral energy distribution (SED) fitting accurate to a factor of ∼2 (e.g., Turner et al. 2021) on average.

Altogether, PHANGS will yield an unprecedented sample of ∼100,000 star clusters, associations, H ii regions, and molecular clouds in diverse galactic environments to provide answers to the following open questions:

  • 1.  
    What are the timescales for the onset of star formation in clouds, the destruction of clouds, and the removal of gas from young star clusters?
  • 2.  
    How are the mass functions of star clusters/associations related to those of clouds? What are the implied star formation efficiencies?
  • 3.  
    How are star formation and gas organized into multiscale structures? How do their relative spatial distributions evolve with time?

These questions, particularly those examining the relationship between molecular clouds and star clusters, have been posed in the context of the Milky Way (Murray 2011; Lee et al.2016) and select Local Group and Nearby galaxies (e.g., M51: Hughes et al. 2013; Grasha et al. 2019; NGC 7793: Grasha et al. 2018; NGC 300: Kruijssen et al. 2019). But whether the answers to the questions vary with galactic environment is still unclear, as there has not yet been a systematic study on the cluster scale across a well-defined sample of galaxies spanning a broad range of global properties. PHANGS will provide not only the answers to these questions, but moreover will show how they depend on galactic properties such as the phase balance and physical conditions of the ISM, stellar mass, gas mass, SFR (as well as their surface densities), metallicity, and the presence of dynamical features such as rings, bars, and spiral arms.

Here, we present the design of the PHANGS-HST survey and provide an overview of the data processing pipeline developed to generate the data products required for the investigation of these questions. The remainder of this paper is organized as follows. In Section 2, we describe how galaxies are selected from the parent PHANGS sample for observation with HST and summarize the global properties of the sample. Our HST imaging observations with WFC3 and ACS are described in Section 3. Section 4 describes the PHANGS-HST pipeline used to produce catalogs of compact star clusters and associations and to measure their observed and physical properties. This high-level description is intended to provide the framework for a series of papers, as summarized in Section 4, that document each of the major components in detail, in particular new methods for selecting star cluster candidates, morphological classification of candidates, and identification of stellar associations over a range of physical scales. Data products resulting from this pipeline that will be released to support community science are described in Section 5. In Section 6, we conclude with a summary and look ahead to upcoming work on dust and young embedded stellar populations with the James Webb Space Telescope (JWST). Through a Cycle 1 Treasury program, we will add a fourth major component of PHANGS and obtain eight-band 2–21 μm imaging for the 19 galaxies with the full suite of PHANGS-ALMA, MUSE, and HST observations.

Magnitudes in this and other PHANGS-HST pipeline papers are given in the Vega system, unless otherwise noted, to facilitate comparison with prior HST studies of resolved stellar populations.

2. Galaxy Sample

Galaxies are chosen for HST observations from the PHANGS-ALMA Large Program sample (PI E. Schinnerer). PHANGS-ALMA has obtained CO(2–1) maps for a complete sample of 74 southern galaxies 41 (−75° < δ < 20°; i.e., ALMA-observable), which were selected to be massive (M* ≳ 109.75 M), star-forming, not edge-on to the line of sight, and at distances ≲17 Mpc. 42 CO(2–1) observations were obtained for this sample via an ALMA Cycle 5 Large Program (2017.1.00886.L), which builds upon and incorporates several smaller precursor programs in Cycles 2–3. 43 A full description of the PHANGS-ALMA Large Program sample criteria and derivation of integrated properties used for selection, such as stellar mass (M*), SFR, and integrated CO luminosities, is given in Leroy et al. (2021b).

With HST, we target the galaxies best suited for joint HST-ALMA analysis of resolved young stellar populations and giant molecular clouds. That is, we select galaxies from the PHANGS-ALMA parent sample that (1) have inclinations i ≲ 70° to minimize source blending and attenuation along the line of sight due to dust within the target, (2) avoid the Galactic plane (∣b∣ > 15°) to minimize the impact of Milky Way reddening and foreground stars, and (3) are sufficiently active (SFR ≳ 0.3 M yr−1) to ensure that wide-spread molecular cloud and star cluster populations are available for joint study. The resulting set of 38 galaxies chosen for HST observations is given in Table 1 along with basic properties (which have been refined and updated since the sample was first compiled Leroy et al. 2021b) relevant to their selection. The 19 galaxies for which VLT/MUSE optical integral field spectroscopy has been obtained are all included by the HST selection criteria.

Table 1. PHANGS-HST Galaxy Sample

Galaxy α δ b D σ(D)Method D(Ref) i T SFRtot SFRlog M* ΣCO
 (J2000)(J2000)(deg)(Mpc)(Mpc)  (deg) (M yr−1)(M yr−1)(log M)(M kpc−2)
(1)(2)(3)(4)(5)(6)(7)(8)(9)(10)(11)(12)(13)(14)
IC 195403h31m31fs39−51°54'17farcs4−51.20112.82.05NAM+TF3+4+5573.30.360.349.61.1
IC 5332*23h34m27fs49−36°06'03farcs9−71.3669.010.41TRGB18276.80.410.119.5−999
NGC 628*01h36m41fs75+15°47'01farcs2−45.7059.840.63TRGB295.21.750.9310.21.5
NGC 68501h47m42fs81−52°45'42farcs5−62.30419.942.99NAM4+5235.40.420.379.90.6
NGC 1087*02h46m25fs16−00°29'55farcs1−51.65115.852.24Group6435.21.311.259.91.3
NGC 109702h46m19fs05−30°16'29farcs6−64.68113.582.04NAM4+5493.34.743.8510.72.2
NGC 1300*03h19m41fs08−19°24'40farcs9−55.22318.992.85NAM4+53241.171.0210.61.0
NGC 131703h22m44fs29−37°06'13farcs3−56.69319.110.84Group2230.80.480.3710.61.7
NGC 1365*03h33m36fs37−36°08'25farcs4−54.59819.570.78TRGB2553.216.9012.8810.82.7
NGC 1385*03h37m28fs85−24°30'01farcs1−52.70617.222.58NAM4+5445.92.091.969.91.3
NGC 1433*03h42m01fs55−47°13'19farcs5−51.19518.940.56PNLF7291.51.130.5610.41.3
NGC 1512*04h03m54fs28−43°20'55farcs9−48.16617.930.88PNLF7431.21.280.5910.61.1
NGC 155904h17m35fs77−62°47'01farcs2−41.19819.440.44Mira8655.93.763.7610.21.5
NGC 1566*04h20m00fs42−54°56'16farcs1−43.39317.692.00Group63044.543.2110.72.0
NGC 1672*04h45m42fs50−59°14'49farcs9−38.99019.402.91NAM4+5433.37.606.6010.62.1
NGC 179205h05m14fs45−37°58'50farcs7−36.45316.202.43NAM4+56543.703.2310.51.8
NGC 277509h10m20fs12+07°02'16farcs633.98823.153.47NAM4+5411.60.870.7011.11.2
NGC 2835*09h17m52fs91−22°21'16farcs818.50912.220.94TRGB184151.240.579.80.9
NGC 290309h32m10fs11+21°30'03farcs044.5409.610.39TRGB196743.082.3910.61.7
NGC 3351*10h43m57fs70+11°42'13farcs756.3689.960.33TRGB2453.11.320.8710.31.4
NGC 362111h18m16fs51−32°48'50farcs626.0997.060.28TRGB18666.90.990.7810.01.4
NGC 3627*11h20m14fs96+12°59'29farcs564.41811.320.48TRGB2573.13.843.3110.71.9
NGC 4254*12h18m49fs60+14°24'59farcs475.19013.12.8SCM14345.23.072.7710.32.2
NGC 429812h21m32fs76+14°36'22farcs275.67314.921.37TRGB18595.10.460.439.91.4
NGC 4303*12h21m54fs90+04°28'25farcs166.27616.993.04Group62445.334.2510.62.1
NGC 4321*12h22m54fs83+15°49'18farcs576.89815.210.49Cepheid103943.562.4310.71.6
NGC 4535*12h34m20fs31+08°11'51farcs970.64115.770.37Cepheid104552.161.2310.51.2
NGC 453612h34m27fs05+02°11'17farcs364.73016.251.13TRGB2664.33.453.1210.21.4
NGC 454812h35m26fs45+14°29'46farcs876.83016.220.38Cepheid10383.10.520.3410.71.0
NGC 456912h36m49fs79+13°09'46farcs675.62315.762.36Group6702.41.321.2210.81.4
NGC 457112h36m56fs38+14°13'02farcs576.65414.91.2Cepheid16336.40.290.2610.00.9
NGC 465412h43m56fs58+13°07'36farcs075.88921.981.16Group10565.93.793.0610.51.4
NGC 468912h47m45fs56+13°45'46farcs176.60715.02.25NAM+TF3+4+5394.70.40.410.10.9
NGC 482612h56m43fs64+21°40'58farcs784.4234.410.19TRGB18592.20.20.1710.21.5
NGC 5068*13h18m54fs81−21°02'20farcs841.3765.200.21TRGB183660.280.209.40.8
NGC 524813h37m32fs02+08°53'06farcs668.75114.871.34Group64742.291.6610.31.8
NGC 674419h09m46fs10−63°51'27farcs1−26.1469.390.43TRGB185342.410.7110.71.0
NGC 7496*23h09m47fs29−43°25'40farcs6−63.80118.722.81NAM4+5363.22.262.009.81.2

Note. Column 1: Galaxy name. * Indicates PHANGS-MUSE integral field spectroscopy available, and will be observed in the PHANGS-JWST Treasury Survey. Columns 2−3: R.A. and decl. Column 4: Galactic latitude. Columns 5−8: Galaxy distances, uncertainties, and references as follows: (1) Karachentsev et al. (2004), (2) Jacobs et al. (2009), (3) Tully et al. (2016), (4) Shaya et al. (2017), (5) Kourkchi et al. (2020a, 2020b), (6) Kourkchi & Tully (2017), (7) F. Scheuermann et al. (2021, in preparation), (8) Huang et al. (2020), (9) Leonard et al. (2003), (10) Freedman et al. (2001), (11) Olivares E. et al. (2010), (12) Barbarino et al. (2015), (13) Tonry et al. (2001), (14) Nugent et al. (2006), (15) Reid et al. (2019), (16) Pierce et al. (1994), (17) Ruiz-Lapuente (1996), (18) Anand et al. (2020), (19) this paper. Column 9: Galaxy inclination. Following Leroy et al. (2021b) and adopted from Lang et al. (2020). Column 10: Morphological T-type. Column 11: Total galaxy star formation rate. Based on GALEX far-UV and WISE W4 imaging with SFR prescription calibrated to match results from population synthesis modeling of Salim et al. (2016, 2018) as in Leroy et al. (2021b). For NGC 685 and NGC 4689, GALEX far-UV imaging is not available, and the SFRs are based only on WISE W4 data. Column 12: Star formation rate, as computed for column 11, but limited to areas studied by PHANGS-HST, as shown in the footprints illustrated in Figure 3 and provided for the full galaxy sample at https://archive.stsci.edu/hlsp/phangs-hst. Column 13: Galaxy stellar mass. Following Leroy et al. (2021a), based on Spitzer IRAC 3.6 μm when available, or WISE 3.4 μm, and mass-to-light ratio prescription of Leroy et al. (2019) calculated as a function of radius in the galaxy. Column 14: Here, we calculate Σmol adopting a fixed ${\alpha }_{\mathrm{CO}}^{2-1}=6:25\,{M}_{\odot }\,{\mathrm{pc}}^{-2}$ (K km s−1)−1, appropriate for a Galactic conversion factor and a typical CO(2−1)/CO(1−0) line ratio (Sun et al. 2018). Thus, the x-axis indicates mean CO surface brightness in units of mass surface density.

Download table as:  ASCIITypeset image

The 38 PHANGS-HST galaxies probe a full range of global properties covered by the PHANGS-ALMA parent sample, which is itself representative of the overall present-day spiral galaxy population. This is illustrated in Figure 1 (upper left panel), which shows M* and SFR of the PHANGS sample and the subset targeted with HST, overlaid on the locus occupied by star-forming galaxies from the Sloan Digital Sky Survey. Figure 1 shows that the PHANGS samples provide excellent coverage of the galaxy "main sequence" between stellar masses of ∼109.5–1011 M. Main-sequence galaxies in this stellar mass range are representative of the environments where the bulk of molecular gas and present-day star formation are found (Salim et al. 2007; Saintonge et al. 2017). The bottom panels of Figure 1 illustrate the coverage of specific SFR as a function of morphological type and the SFR as a function of the molecular gas surface density. The PHANGS-HST observations include spiral galaxies with morphological types of Sa through Sd, sSFRs of ∼10−10.5–10−9 yr−1, SFRs of ∼0.2–17 M yr−1, and Σmol of ∼100.5–102.7 M pc−2.

Figure 1.

Figure 1. Upper left: coverage of the SFR–M* plane by galaxies in the parent PHANGS sample (N = 74; orange open and filled points) and those targeted for HST observations (N = 38; orange filled points). The PHANGS-HST subsample is representative of the d ≲ 20 Mpc massive galaxy population on the local star-forming main sequence, which contains the bulk of molecular gas and present-day star formation. Shown for context are: an SDSS local galaxy sample, which extends over a much larger volume to z ≲ 0.3 (grayscale; Salim et al. 2016); the HST LEGUS sample (blue; Calzetti et al. 2015b), which focuses on the nearest galaxies (d ≲ 11 Mpc) and includes a significant number of lower mass dwarfs; and luminous infrared galaxies within ∼350 Mpc from the GOALS sample (red; U et al. 2012). Upper right: distribution of distances of galaxies in the PHANGS parent sample and those selected for HST observations. All distances are provided in Table 1 and are taken from the compilation of Anand et al. (2020), with an update for one new TRGB distance derived from PHANGS-HST observations taken after the publication of that paper (see Section 3.2). Altogether, the PHANGS-HST V- and I-band parallel imaging have yielded eight TRGB measurements, which represent the best-available distances for those galaxies. Bottom panels: PHANGS-HST also provides coverage of a full range of specific SFRs, molecular gas surface densities, and spiral galaxy morphologies. Galaxies with MUSE IFU spectroscopy are indicated (N = 19) with crosses in the upper left and bottom plots. These 19 galaxies comprise the sample targeted by the PHANGS-JWST Cycle 1 Treasury program (see Section 6).

Standard image High-resolution image

To place the PHANGS-HST survey in further context, nearby galaxies from two other complementary HST imaging programs are also shown in the upper left panel of Figure 1: HST LEGUS (Legacy ExtraGalactic Ultraviolet Survey; Calzetti et al. 2015a), and GOALS (Great Observatories All-Sky LIRG Survey; Armus et al. 2009). As a Cycle 21 Treasury program, LEGUS also obtained five-band UV–optical imaging to study star clusters in a representative sample of 50 galaxies (Adamo et al. 2017), but focused on the nearest late-type systems (≲11 Mpc; Kennicutt et al. 2008; Lee et al. 2011) to also enable the reconstruction of star formation histories from individually resolved stars (Cignoni et al. 2018, 2019; Sacchi et al. 2018). LEGUS is therefore naturally dominated by lower mass, local volume dwarf and irregular galaxies (e.g., Lee et al.2009), which comprise about half of its 50 galaxy sample (Cook et al. 2019). PHANGS-HST builds on the groundwork laid by programs such as LEGUS, and earlier transformative work by the Panchromatic Hubble Andromeda Treasury (PHAT; Dalcanton et al. 2012) and the WFC3 Early Release Science (ERS) program. 44 The WFC3 imaging observations obtained by these successive programs, which provide multiwavelength coverage beginning in the near-UV (with ERS using up to 11 filters including both broad and narrow bands), have enabled the study of the physical properties of star clusters in samples of main-sequence galaxies that have grown from a few to over one hundred in the past decade (e.g., Chandar et al. 2010b, 2014; Johnson et al. 2016, 2017; Adamo et al. 2017; Grasha et al. 2017; Cook et al. 2019). At the edge of the parameter space occupied by nearby star-forming galaxies, GOALS has obtained HST imaging in the B, I, and H filters for ∼90 luminous infrared galaxies with thermal IR (8–1000 μm) dust emission greater than 1011 L (Haan et al. 2011; Kim et al.2013). (A subsample of 64 LIRGS from U et al. 2012 with stellar masses derived from SED fitting is shown in Figure 1.) Such highly active star-forming galaxies are rare in the present-day universe, so the GOALS sample extends over much larger distances compared to LEGUS and PHANGS-HST. GOALS galaxies are located at distances of up to ∼350 Mpc (z ≲ 0.08), and hence sample-wide studies of the stellar populations have focused on larger "clumps" (∼90 pc, Larson et al. 2020, but also see Linden et al. 2017). Currently PHANGS is the only program with uniform ALMA CO observations for a significant sample of nearby galaxies, but ultimately, analysis of all of these programs together are needed to fully understand the impact of galactic environment on star formation.

3. HST Observations

Imaging observations for the PHANGS-HST Treasury program (Cycle 26, PID 15654) were conducted from 2019 April to 2021 April with an allocation of 122 orbits. Previous to this program, no HST wide-field UV imaging was available for 80% of the PHANGS-HST sample, and 60% also did not have any optical imaging with either WFC3 or ACS. Thus, PHANGS-HST provides a critical augmentation to the HST archive for the set of nearby spiral galaxies where both star clusters and molecular clouds can be efficiently detected by HST and ALMA over galactic scales.

As discussed in Section 2, the PHANGS-HST sample contains a total of 38 galaxies. No new observations were conducted for four of these galaxies, since sufficient imaging for those targets were previously obtained by the LEGUS program. 45 Altogether, new observations were planned for 43 fields in 34 galaxies. Initial observations for eight fields were corrupted due to guiding failures, but were all rescheduled and successfully observed on the second attempt. These reobservations were the primary reason the execution of the program spanned over two years instead of one.

To illustrate the PHANGS coverage of each galaxy, footprints of the HST and ALMA observations, together with those for VLT/MUSE when available, are provided at MAST at https://archive.stsci.edu/hlsp/phangs-hst/. Examples of these footprint overlays are given in Figures 2 and 3 for six galaxies chosen to span the range of molecular gas surface densities, specific SFRs, and dynamical features (rings and bars) present in the sample. Figure 2 focuses on the PHANGS-HST prime imaging area in the context of the PHANGS-ALMA CO(2–1) maps. Figure 3 shows a wider areal view of each galaxy with Digitized Sky Survey images, 46 and also includes footprints of the PHANGS-HST parallel, PHANGS-ALMA, and PHANGS-MUSE observations.

Figure 2.

Figure 2. PHANGS-HST WFC3 UVIS footprints (162'' × 162'') overlaid on PHANGS CO(2–1) ALMA maps for six galaxies in our sample of 38, showing the enormous diversity of molecular gas content and morphology in present-day massive star-forming galaxies. CO maps are overlaid on wider field DSS imaging. Scale bars and galaxy distances are shown in the upper and lower right corners, respectively. Top: Targets showing decreasing molecular gas surface density and specific SFR, from left to right. Bottom: impact of dynamical features on the gas distribution; two examples showing differing responses of the gas to the influence of a bar (left, middle) and an example of a ring feature (right). Our WFC3/UVIS observations allow us to find and characterize young stellar clusters and associations over the same area covered by these detailed CO maps, to create the first combined atlas of clouds and clusters across a representative sample of massive main-sequence galaxies in the local universe.

Standard image High-resolution image
Figure 3.

Figure 3. Figures showing the overlap of the PHANGS-HST WFC3 UVIS (blue), ALMA (red), and MUSE (where available; cyan) observation footprints, overlaid on DSS imaging for the same six galaxies as in Figure 2. A larger field (20'×20') is shown relative to Figure 2 to illustrate the placement of the HST ACS parallel pointing (dashed lines). The WFC3 UVIS field of view is 162'' × 162'' and the ACS field of view is 202'' × 202''. Such footprint maps for the full PHANGS-HST sample can be found at https://archive.stsci.edu/hlsp/phangs-hst/.

Standard image High-resolution image

3.1. Prime Observations with WFC3

For each of the 43 new fields, we aimed to cover the region mapped in CO(2–1) by ALMA in five filters: F275W (NUV), F336W (U), F438W (B), F555W (V), and F814W (I). F275W is the shortest wavelength filter that avoids the 2175 Å dust feature, and in combination with the U and B bands serves to break the age-extinction degeneracy. The V and I bands are less affected by extinction and variation in the mass-to-light ratio, and serve to constrain the stellar mass.

For 31 fields (in 26 galaxies), new observations with the WFC3 UVIS camera were needed in all five filters. Three exposures with subpixel dithering were taken in each filter (the average pixel size of WFC3 UVIS is 0farcs04), and all 15 exposures were obtained in a three-orbit visit, yielding total exposure times of ∼2200 s (NUV), ∼1100 s (U), ∼1100 s (B), ∼670 s (V), and ∼830 s (I). A post flash of 5–10 e is applied for each exposure, with larger values in the shorter wavelength filters, to increase the background to 12 e, which is the recommended value to mitigate issues due to charge transfer efficiency (CTE) losses. The dither sequence is optimized to cover the WFC3 chip gap and to help recover the undersampled point-spread function (PSF).

For the other 12 pointings (in eight galaxies), suitable WFC3 or ACS data in one or more of these filters were taken by prior programs. The available archival data were obtained from MAST and processed in a consistent manner with our new observations, and new imaging was obtained only for the missing filters within a two-orbit visit.

3.2. Parallel Observations with ACS

While the primary observational goal of the PHANGS-HST survey is to obtain UV–optical imaging of the star-forming disk, we also simultaneously observe the galaxy halo with the Advanced Camera for Surveys Wide Field Channel (ACS/WFC) in "parallel" mode. Such parallel observations can potentially yield measurements of the galaxy distance if the tip of the red giant branch (TRGB) can be identified in a color–magnitude diagram of halo stars. Thus, we have designed our observations so that ACS imaging in F606W and F814W, filters commonly used for TRGB analysis (e.g., McQuinn et al. 2017; Anand et al. 2018) accompanies each WFC3 "prime" observation.

For the range of distances and angular sizes of the PHANGS-HST galaxies, the ACS field of view generally falls on the halo of the target galaxy when WFC3 is centered on the galaxy itself (Figure 3). Given that the science requirements of PHANGS-HST constrain the positioning of the prime pointings, optimizing placement of the parallel fields (as in a focused TRGB program) is a secondary priority and is restricted by the fixed spatial offset of the two cameras on the focal plane. For galaxies with relatively large angular sizes, the parallel observations may include portions of the outer disk. For smaller galaxies, the parallels may be too far to capture a significant number of halo stars. To the extent possible, ORIENT constraints were imposed to prevent the ACS field from entirely falling on the galaxy disk, on nearby galaxy neighbors, and/or on extremely bright foreground stars. For some targets with large angular sizes where it was not possible to avoid the disk completely, the field was positioned along the major axis to help differentiate between disk and halo stars. In several cases, the desired ORIENT constraints were lifted or relaxed to allow guide stars to fall into the area of the focal plane accessible to the Fine Guidance Sensors.

The five-band prime observations with WFC3/UVIS were sequenced in each orbit to optimize exposure time in the parallel observations without impacting the primary observations. As discussed in the previous section, WFC3 observations for each pointing required two or three orbits, depending on whether suitable HST archival observations of the galaxy disk were already available. For two-orbit visit pointings, the total exposure times in the ACS parallel V and I images are ∼2100 s each, while for the three-orbit visits they are about ∼3500 s and 3200 s, respectively. In both cases, three exposures were taken in each filter.

Distance constraints resulting from analysis of the TRGB based on parallel imaging obtained in the first year of the PHANGS-HST program (for 30 galaxies through 2020 July) are presented in Anand et al. (2020). We were able to measure TRGB distances for 10 PHANGS galaxies, 4 of which are the first published TRGB distance measurements for those galaxies (IC 5332, NGC 2835, NGC 4298, and NGC 4321) and 7 of which represent the best-available distances (IC 5332, NGC 2835, NGC 3621, NGC 4298, NGC 4826, NGC 5068, and NGC 6744). Analysis of the remaining six galaxies (with seven parallel fields: IC 1954, NGC 0685, NGC 1097, NGC 2903-N/S, NGC 5068, 47 and NGC 7496) yields one additional TRGB constraint. A distance of 9.61 ± 0.39 Mpc is found for NGC 2903-N, and represents the best-available distance for that galaxy. All PHANGS-HST TRGB measurements and accompanying color–magnitude diagrams will be available from the Extragalactic Distance Database (EDD; Tully et al. 2009; Anand et al. 2021). 48 In total, the PHANGS-HST ACS parallel observations have provided eight new TRGB measurements which are the current best-available distances. The eight measurements span from 4.41 ± 0.19 Mpc (NGC 4826) to 14.9 ± 1.4 Mpc (NGC 4298), and are listed in Table 1 together with other adopted distances from the literature compiled by Anand et al. (2020).

4. Data Processing Pipeline

To enable the joint HST-ALMA-MUSE study of star formation in basic units of star clusters, associations, molecular clouds, and H ii regions, we have developed an extensive HST data processing pipeline that produces inventories of stars (point sources), compact star clusters, and stellar associations across multiple physical scales in each galaxy. The pipeline yields aligned, mosaicked images in all five filters as well as catalogs of observed (e.g., photometry and morphological parameters) and physical properties (stellar masses, ages, and reddenings, derived through SED fitting). Ultimately, the PHANGS-HST star cluster and association catalogs will be cross-correlated with the PHANGS-ALMA molecular cloud catalogs (Rosolowsky et al. 2021, A. Hughes et al. 2021, in preparation) and PHANGS-MUSE H ii region catalogs (Santoro et al. 2021). Python packages and high-level science products from the HST pipeline are being publicly released and are described in Section 5.

Here, we summarize the overall strategy of the pipeline and provide a framework for subsequent papers that document each of the major components in detail. The key steps in the PHANGS-HST pipeline are:

  • 1.  
    Image drizzling, mosaicking, and astrometric calibration (this paper, Section 4.1).
  • 2.  
    Source detection and aperture photometry (Thilker et al. 2022, Section 4.2).
  • 3.  
    Identification of bright, isolated star clusters for determining aperture corrections (Deger et al. 2021, Section 4.3).
  • 4.  
    Derivation of aperture corrections for star clusters (Deger et al. 2021, Section 4.3).
  • 5.  
    Selection of candidate star clusters (Thilker et al. 2022, Section 4.4).
  • 6.  
    Morphological classification of candidate star clusters (Wei et al. 2020; Whitmore et al. 2021, Section 4.5).
  • 7.  
    SED fitting (Turner et al. 2021, Section 4.6).
  • 8.  
    Identification and photometry of stellar associations (K. Larson et al. 2021, in preparation, Section 4.7).

Our processing workflow reflects common practice for the production of catalogs of compact star clusters in nearby galaxies (e.g., Whitmore et al. 2010; Adamo et al. 2017), with the following key augmentations. First, selection criteria are based on measurement of a series of concentration indices (CI; the difference in photometry measured with circular apertures of two different radii) rather than a single concentration index (Section 4.4). Second, we inject model star clusters into the HST imaging to aid the definition of selection criteria to separate candidate clusters from point sources and other interlopers; this also provides a foundation to estimate completeness in future work (Section 4.4). Third, we utilize convolutional neural network (CNN) models, as discussed in Wei et al. (2020), to supplement human visual inspection, with the goal of eventually automating morphological classification of candidate star clusters, as this has been a limiting step in past cluster studies (e.g., Adamo et al. 2017; Whitmore et al. 2021). Fourth, we separate the process for selecting multipeaked stellar associations from single-peaked compact stellar clusters, by applying a watershed algorithm to identify associations at physical scales from 8 pc to 64 pc (Section 4.7). This produces a far more complete inventory of young stellar populations, which is crucial for robust comparisons with molecular clouds.

A flowchart illustrating the steps in the pipeline as summarized below is provided in Figure 5.

4.1. Drizzling, Mosaicking, Astrometric Calibration

The process used to drizzle and mosaic the HST imaging data follows current standard procedures.

Data acquired for PHANGS-HST are first obtained from MAST, along with other suitable archival data taken by previous programs. These "FLT" exposures have been processed through the standard Pyraf/STSDAS CALACS or CALWFC3 software in the archive, which performs initial data quality flagging, bias subtraction, gain correction, bias stripe removal, correction for CTE losses, dark current subtraction, flat-fielding, and photometric calibration, resulting in "FLC" FITS files for each ACS/WFC and WFC3/UVIS exposure.

The PHANGS-HST pipeline is based on the STScI-supported software package DRIZZLEPAC to combine exposures and improve sampling of the PSF and is used for the prime observations targeting the galaxy disk in a two-step drizzle procedure. The parallel observations are treated separately as discussed in Anand et al. (2020).

The pipeline takes the FLC FITS files retrieved from MAST as input to produce combined images for each filter, which are all aligned and drizzled onto a common grid with a pixel scale of 0farcs04 (the native WFC3 pixel scale), with astrometry calibrated with GAIA DR2 sources (Gaia Collaboration et al. 2018; Lindegren et al. 2018). The latter is essential for the proper alignment of the HST imaging with ALMA and VLT/MUSE data and to facilitate a joint study of the three data sets.

The V-band imaging (WFC3 F555W) is used as the reference for the positioning of the images in all other filters (NUV, U, B, and I) and to define the common pixel grid. Using the F555W FLC files from MAST for each pointing, the sky positions of the centers and corners of the images are calculated to define a search area to query the ESA DR2 GAIA catalog. The TWEAKREG routine then matches the GAIA sources to the objects detected in the F555W drizzled image and calculates average shifts (with an accuracy typically better than 0.1 pixel) to correct the astrometric solution. The number of GAIA sources found in a given F555W HST pointing varies from as few as 15 sources to a maximum of 317 with an average of 40 sources. The TWEAKBACK routine is then used to propagate the corrected WCS solution back to the original F555W FLC images. Finally, TWEAKREG and TWEAKBACK are used again, but now with the drizzled images for the other four filters to find sources in common with those detected in F555W and to align to the F555W image.

The final drizzle combination is carried out with sky subtraction using the "globalmin+match" method for sky calculation. ASTRODRIZZLE first finds a minimum "global" sky value for each chip/image extension in all input images and then uses the "match" method to compute differences in sky values between images in common sky regions and to equalize the sky values between images. The final DRC FITS files are in units of e s−1 and are registered with north up and east left as usual. "EXP" (exposure time) and "ERR" (error) weight (WHT) maps, calculated from the contribution of all input exposures to a given output pixel, are also produced. The ERR maps includes all noise sources from detector and sky and are used to compute uncertainties in photometry downstream in the pipeline.

Color composites of the drizzled, mosaicked HST imaging are shown in Figure 4 for the same six galaxies featured in Figures 2 and 3.

Figure 4.

Figure 4. Color composites of PHANGS-HST imaging (Red: WFC3/UVIS F814W, green: WFC3/UVIS F555W, blue: WFC3/UVIS F438W+F336W+F275W), overlaid on DSS imaging for the same six galaxies as in Figures 2 and 3.

Standard image High-resolution image

4.2. Source Detection and Photometry

The principal method of source detection in the PHANGS-HST pipeline is provided by the DOLPHOT photometry package (v2.0; Dolphin 2016), which is based on PSF fitting and operates on the FLC files to detect and deblend sources in HST imaging. We elect to use DOLPHOT for source detection in order to provide a common starting point for the identification of both single-peaked compact star clusters, and multipeaked stellar associations over a range of physical scales. The drizzled V-band image is used as the positional reference and sources are detected to 3.5σ with PSF fitting performed at the same image positions in all bands.

Compact star clusters have effective radii between 0.5 pc to about 10 pc (Portegies Zwart et al. 2010; Ryon et al. 2017). At the distances of the galaxies in the PHANGS-HST sample, such objects will have angular sizes close to the HST WFC3 resolution (i.e., 2 pixels, ∼0farcs08; from 1.7 to 9 pc for the range of distances for the galaxy sample) and will appear sufficiently point-like to be captured by DOLPHOT. To ensure that the catalogs include star clusters with light profiles that may be more extended than the sources detected by DOLPHOT, source detection is also performed on the V-band DRC image with DAOStarFinder (the Python implementation of DAOPHOT within the photutils astropy-affiliated package), with a kernel FWHM of 2.5 pixels and an effective signal-to-noise ratio (S/N) consistent with the detection threshold used for DOLPHOT. The merged DOLPHOT and DAOStarFinder catalogs provide the source lists from which compact star cluster candidates are identified, while the branch of the pipeline that identifies multipeaked stellar associations (Section 4.7) relies only upon the DOLPHOT catalogs, as indicated in the flowchart in Figure 5.

Figure 5.

Figure 5. Flowchart illustrating the PHANGS-HST star cluster and multiscale stellar association catalog pipeline. The chart begins at the upper left corner with the acquisition of HST imaging and proceeds from left to right along the rows, which represent the major components of the pipeline, and concludes with the production of catalogs at the bottom right of the chart. Dashed lines indicate high-level science products for public release via MAST, CADC, and the PHANGS collaboration website https://sites.google.com/view/phangs/home. † indicates that the galaxy distance is used in this step.

Standard image High-resolution image

Following source detection, the star cluster pipeline then uses the positions from the merged DOLPHOT-DAOStarFinder catalog to perform aperture photometry in all five filters. For all galaxies photometry is measured in circular apertures with a 4 pixel radius, with the background determined in an annulus between 7 and 8 pixels around the aperture. An aperture correction (Section 4.3) is applied that yields the total fluxes for sources ultimately identified as compact star clusters. Photometry is also measured in a series of circular apertures (with radii from 1 to 5 pixels) to compute concentration indices that are used to distinguish star cluster candidates from stars and other sources/artifacts (Section 4.4). Catalogs with these observed parameters are produced for each galaxy, which on average contain about ∼500,000 sources. The contribution of nonredundant detections from DAOStarFinder to these catalogs is less than ∼1% of objects with a V-band aperture photometry S/N > 10, the limit subsequently applied to select star cluster candidates as discussed in Section 4.4.

Thilker et al. (2022) discuss in detail the parameter choices for both the DOLPHOT and DAOStarFinder routines, and the procedures for obtaining aperture photometry and computing errors.

4.3. Star Cluster Aperture Corrections

Star clusters, in particular those that are young, are generally found in crowded regions. Direct, accurate measurement of the total flux is often not possible since the outer light profile is frequently contaminated by other sources. Thus, HST photometry of compact star clusters in galaxies beyond the Local Group is typically measured with a limited aperture that captures ∼50% of the total flux, and then a correction, determined from bright, isolated clusters, is applied (e.g., Chandar et al. 2010b; Adamo et al. 2017; Cook et al. 2019).

Deger et al. (2021) present a detailed discussion of procedures used to determine average aperture corrections and uncertainties for each galaxy. In summary, the V-band images are visually inspected to identify a few dozen well-detected, isolated, compact clusters, and these objects are used to compute an average correction for each field. Fixed offsets, based on the change in the encircled energy distributions of the WFC3 PSF with wavelength (which has a minimum FWHM of 0farcs067 in the V band, and increases to ∼0farcs075 in the NUV and I bands) 49 , 50 are used to calculate the corresponding corrections for photometry in the NUV, U, B, and I bands, since direct measurements from growth curves in those filters for bright sources in the V band that are very red or blue can be noisy (Cook et al. 2019; Deger et al. 2021). By construction, the resulting V-band corrections are ∼0.75 mag (for the adopted photometric aperture radius of 4 pixels, i.e., 0farcs16). The corrections are larger by 0.19, 0.12, 0.03, and 0.12 mag for the NUV, U, B, and I bands, respectively.

4.4. Star Cluster Candidate Selection

The PHANGS-HST pipeline identifies cluster candidates from the source lists described in Section 4.2 using photometric and morphological properties measured in the V band. In previous work, the concentration index (computed as the difference between photometry measured in circular apertures with radii of 1 and 3 pixels, CI13 or simply CI) has been generally used to remove sources likely to be stars from consideration (e.g., Whitmore et al. 2010; Adamo et al. 2017; Cook et al. 2019). To determine the threshold to separate stars from cluster candidates, CIs have been typically measured for a few dozen isolated, bright point sources in an image. These measurements are then compared with the CI distribution for an analogous sample of compact star clusters (identified through visual inspection and also used to derive aperture corrections).

For PHANGS-HST, Thilker et al. (2022) build upon this method to develop candidate selection criteria based on multiple concentration indices (MCI), rather than a single concentration index. We measure fluxes in circular apertures with radii of 1.0, 1.5, 2.0, 2.5, 3.0, 4.0, and 5.0 pixels, and we define two metrics to characterize the light profiles between 1 and 2.5 pixels (MCIin) and 2.5–5 pixels (MCIout), where

Equation (1)

where a, b, c, and d represent the radii of the apertures, and

Equation (2)

is a CI normalized by CIij,fiducial, defined to be the CI of a relatively compact cluster; i.e., a PSF-convolved Moffat/EFF87 function (Moffat 1969; Elson et al. 1987, EFF87) with a FWHM of 2 pixels, and a power-law slope of 3 describing the surface brightness profile of the extended halo. The choice of normalization is arbitrary, and enables the various CIs to be meaningfully averaged. With these definitions, clusters measured on HST optical images generally have values of −0.5 ≲ MCIin ≲ 0.2, −2 ≲ MCIout ≲ 0.6 (see Figures 7 and 6). MCIin is anticorrelated with the standard CI13, as would be expected from their definitions; i.e., more compact sources are characterized by larger MCI values (i.e., more concentrated), which is the opposite of the sense of the standard CI13. By construction, the fiducial cluster lies at the origin of the MCIin–MCIout plane.

We define selection regions in two ways on the MCIin–MCIout plane (Figure 6) to generate two types of candidate lists. (1) The first set of selection regions are contours in the MCI plane, based on the loci of synthetic star clusters inserted into the V-band imaging for each individual target galaxy. This selection strategy yields up to several thousand candidates per target, and CNN models are used for their classification. (2) The second are smaller polygonal regions, the same set of which are used for all galaxies. Definition of these polygons is based on the loci of human inspected and verified clusters, together with the inner synthetic star cluster contours (which encloses the highest concentration of models) for the first few PHANGS-HST galaxies studied. This second selection produces smaller samples, more suitable for time-intensive, expert human classification; it is intended to yield an average of ∼1000 candidates per field, so that inspection over the full set of 38 PHANGS-HST galaxies is manageable. Once defined, it provides a simpler, faster way of selecting cluster candidates for inspection by bypassing the synthetic star cluster analysis required to generate contours for a given galaxy. More specifically:

  • 1.  
    Selection criteria in the MCI plane are defined based on model star clusters. Synthetic clusters are generated with Moffat profiles, which again are parameterized by the effective radius (in units of the intrinsic/pre-PSF-convolution FWHM), and the power-law slope describing the surface brightness profile of the extended halo. The modeling includes 216 distinct Moffat profiles, spanning 0.5 pix ≤FWHM ≤ 7 pix and power-law slopes from 0.75 to 4. Models are generated for each of these profile types with a distribution of ∼4000 apparent magnitudes. The magnitudes are computed from the distance of the galaxy and V-band luminosities based on solar metallicity, single-aged stellar population models of Bruzual & Charlot (2003), for a grid of masses (103–105 M), ages (1–1000 Myr), and extinctions (0 ≤ AV ≤ 0.5 mag). The synthetic clusters are randomly inserted into the V-band images 200 at a time, and aperture photometry is performed with the same procedure used to measure real sources. In total, ∼4 × 106 clusters are inserted. The MCI values from these synthetic clusters are plotted on the MCI plane to form a 2D histogram with a bin size of 0.01, and a contour is derived. Contours are also generated for MCI bin sizes of 0.02 and 0.04, resulting in three nested model selection regions for each field (Figure 6). Typically, the inner model region contains the majority of candidates (∼50%–70%), while the two larger contours each add comparable numbers of candidates (20%–30%). Currently, the outer contour (i.e., corresponding to the largest bin size) is used to produce candidate lists for automated classification by CNNs (see Section 4.5).
  • 2.  
    Selection criteria in the MCI plane are also defined semi-empirically. For the first few PHANGS-HST galaxies studied (e.g., NGC 1559, Wei et al. 2020), sources were inspected over a wide swath of the MCI plane (essentially spanning the outer model contour). The loci of human verified clusters were used in combination with the tightest model contour to define the left, right, and top edges of the polygon. Catalogs of visually inspected clusters published by the HST LEGUS program for 34 HST fields in 30 galaxies 51 were then used to verify that the majority of LEGUS clusters would be captured by this selection (Figure 7). The lower bound of MCIout increases as the distance of the host galaxies increases—clusters will be less resolved and appear more compact at larger distances, and both the model contours and the loci of the human verified cluster populations do indeed shrink with increasing distance. Using the ensemble of cluster populations identified in the first few PHANGS-HST galaxies studied, plus those in the LEGUS catalogs, we adopt limits of MCIout = −1.7, −1.1, and −0.7 for galaxies with distances ≤8 Mpc, 8 Mpc <d < 14 Mpc, and ≥14 Mpc, respectively (Figure 7). This initial analysis over a wide swath of the MCI plane for PHANGS-HST sources suggests that the density of clusters rapidly drops beyond the boundaries of the polygon, but a more careful investigation of such completeness issues will be the subject of future work. These semi-empirical polygon selection regions are used to produce candidate lists for human inspection and classification (see Section 4.5 and Whitmore et al. 2021) to a total V-band limit of ∼24 mag (corresponding to absolute V-band limits between about −5.5 mag and −8 mag for the distances of the galaxies in the sample). The exact value of the magnitude limit depends on the number of candidates, since, again, the primary purpose of this second selection is to reduce the number of candidates for visual inspection to a manageable level (i.e., ≲1500 per galaxy) while maximizing the yield of true clusters. So, if the total number of candidates is low (several hundred) then the limit is fainter, whereas if the number of candidates is large, the limit is somewhat brighter.

Figure 6.

Figure 6. Star cluster candidate selection regions defined in the multiple concentration index plane (MCI; see Equation (1)) based on synthetic star clusters (large contours) for four galaxies at increasing distances from left to right (NGC 3627, NGC 1792, NGC 1559, and NGC 1365). Selection regions are also defined semi-empirically (polygon; see also Figure 7). Candidates are selected from detections that satisfy basic signal-to-noise ratio criteria (gray), and exclude those in the stellar (point-source dominated) region (small red contours). Candidates within the empirical selection region are slated for visual inspection to a V-band magnitude limit of ∼24 mag. The classification of the much larger samples of candidates identified with outermost model contours is automated using convolutional neural network models. In all panels, visually classified class 1 (blue) and class 2 (green) clusters, are shown. Some clusters appear outside the polygon—these result from ad hoc human inspection to confirm that the density of clusters rapidly declines outside this selection region, as well as from the inspection of sources brighter than the Humphreys–Davidson limit.

Standard image High-resolution image
Figure 7.

Figure 7. Semi-empirically defined selection regions for compact star cluster candidates in the multiple concentration index (MCI; see Equation (1)) plane. This selection is devised to produce candidate sample sizes amenable for human inspection, while maximizing the yield of true clusters. We verify that the selection region includes the majority of visually verified class 1 and 2 clusters from 30 galaxies studied by the LEGUS program (Calzetti et al. 2015a; Adamo et al. 2017, colored points; https://archive.stsci.edu/prepds/legus/dataproducts-public.html). The larger set of candidates from LEGUS are also shown (gray points). Clusters are color coded based on the distance of their host galaxy. The lower bound of MCIout increases as the distance of the host galaxy increases (see also Figure 6), since the physical resolution decreases and clusters appear more compact. We adopt different lower bounds for the three different distance ranges indicated in the legend.

Standard image High-resolution image

For both model-based (contours) and semi-empirical (polygon) selection methods:

  • 1.  
    The regions in the MCI plane dominated by stars (point sources) as detected by DOLPHOT, are defined for each field and used to exclude objects. Figure 6 shows both the synthetic and semi-empirical cluster selection regions for four galaxies at a range of distances together with their stellar exclusion regions.
  • 2.  
    Candidates must also satisfy basic criteria: V-band photometry measured in a 4 pixel radius must have S/N ≥ 10, and the source must also be detected in at least two other bands with photometric error ≤0.3 mag. The faintest sources in the resulting candidate lists for fields with the standard V-band exposure time of 670 s have total V-band magnitudes of ∼24.6, which corresponds to absolute magnitudes between −4.2 and −7.8 for the distance range spanned by the PHANGS-HST galaxies.
  • 3.  
    All sources with total, absolute V-band magnitudes brighter than −10 mag and −0.55 ≤ MCIin ≤ 0.45 (i.e., MCIin values plausible for real objects) are kept as candidates to help ensure high completeness for the brightest clusters. Sources brighter than −10 mag exceed the Humphreys–Davidson (HD) limit, the observed maximum luminosity of individual stars in the LMC, thought to be due to a modified Eddington Limit (Humphreys & Davidson 1979; Lamers & Levesque 2017). These "super HD sources" also undergo classification to remove interlopers such as saturated stars and background galaxies.

The number of candidates identified using the empirical MCI selection regions varies from many hundred to several thousand sources for each field, with a median of ∼1000 (Figure 8), and should ultimately yield sample sizes of compact star clusters similar to those studied in previous work. The variation in candidate sample sizes reflects the variation in global sSFRs for the PHANGS-HST galaxies that span a factor of ∼10 (Figure 1). The synthetic cluster MCI selection casts a wider net for potential clusters (Figure 6) and yields candidate samples about a factor two larger than the empirical selection (Figure 8). These larger samples enable analysis of potential incompleteness in previous star cluster studies, in particular for more diffuse clusters that appear to be rare.

Figure 8.

Figure 8. The number of compact star cluster candidates found in the first 15 galaxies processed through the PHANGS-HST pipeline, shown as a function of the sSFR. Candidates identified using the semi-empirical selection (polygon region) are shown in red, while those resulting from the larger model-based selection contours are shown in gray. The cluster candidates undergo a process of inspection to further remove contaminants and sort the objects into three morphological categories as summarized in Section 4.5, and described in detail in Wei et al. (2020) and Whitmore et al. (2021). The smaller samples resulting from the semi-empirical (polygon) selection are designed for human inspection, while the larger samples from the model contours are fed to convolutional neural networks for automated classification.

Standard image High-resolution image

These new selection methods, based on the measurement of multiple CI and the use of model star clusters, provides a solid foundation for quantitative investigation of structural properties including: which model clusters actually exist in nature, whether certain clusters are likely to be bound or unbound, and how their morphologies evolve with time. Our model grid of clusters also facilitates future work to characterize cluster completeness levels (e.g., Johnson et al. 2015). More discussion on the utility of the approach to cluster selection is provided in Thilker et al. (2022).

4.5. Star Cluster Candidate Inspection and Morphological Classification

The candidate star clusters then undergo a process of inspection to further remove contaminants and sort the clusters into different morphological categories. A combination of human visual inspection and automated inspection with CNN models is performed as follows:

  • 1.  
    The smaller lists of candidates resulting from the semi-empirical MCI selection region (polygons) are visually inspected by co-author B.C.W. Objects with a total V-band magnitude to a limit of ∼24 mag receive visual classifications.
  • 2.  
    The samples of candidates identified in the largest synthetic cluster MCI selection region (contours) are classified by CNN models, as described below. Candidates identified using the semi-empirical selection region generally lie within this synthetic cluster selection area (Figure 6), and hence will have both neural network and human visual classifications.
  • 3.  
    Human visual inspection of fainter candidates and those beyond the boundaries of the empirical selection region are performed on an ad hoc basis to evaluate and monitor the performance of the neural network models.

For PHANGS-HST, we adopt the general classification scheme used by LEGUS as described in Adamo et al. (2017) and Cook et al. (2019):

  • 1.  
    Class 1: compact star cluster—single peak, circularly symmetric, but radial profile more extended relative to point source.
  • 2.  
    Class 2: compact star cluster—similar to Class 1, but elongated or asymmetric.
  • 3.  
    Class 3: compact stellar association—asymmetric, multiple peaks.
  • 4.  
    Class 4: not a compact star cluster or compact stellar association (e.g., image artifacts, background galaxies, individual stars or pairs of stars).

Examples of Class 1, 2, and 3 objects are shown in Figure 9. Whitmore et al. (2021) provide a detailed description of the process of visual inspection and morphological classification, and discuss differences in the application of this scheme for Class 3 objects relative to the LEGUS project (i.e., we require evidence of four or more peaks within a radius of 5 pixels in PHANGS-HST). A brief history of star cluster classification is also given there (also see Wei et al. 2020, Section 2). While we continue to include Class 3 objects in our compact cluster catalogs, we note that this is mainly for historical continuity with the LEGUS project. The PHANGS-HST pipeline is optimized to identify single-peaked compact clusters, and this leads to a high level of incompleteness for multipeaked stellar associations. Instead, we introduce a new identification process for stellar associations, based on a watershed algorithm, which provides a far more complete inventory of young stellar populations and the star formation hierarchy at multiple physical scales, as will be summarized in Section 4.7 and presented in K. Larson et al. (2021, in preparation).

Figure 9.

Figure 9. Structures across the physical scales of the star formation hierarchy in NGC 3351, identified by the PHANGS-HST pipeline, from single-peaked compact star clusters, the densest structures, to larger scale multipeaked stellar associations. Bottom left: color composite of WFC3/UVIS F275W+F336W (blue), WFC3/UVIS F435W+F555W (green), and WFC3/UVIS F814W (red). Bottom right: young stellar associations (<10 Myr) traced by the watershed-based method of K. Larson et al. (2021, in preparation, blue contours), together with all compact clusters and associations with human visual classifications (Class 1: circles, Class 2: squares, Class 3: diamonds; color coded by age as indicated), overlaid on the PHANGS-ALMA CO(2–1) map. A 650 pc section of the outer ring (yellow box) is shown in more detail in the top left and middle panels. All three classes of compact clusters and associations are represented in the selected section, and the magnified view allows all four levels traced by the watershed method (64 pc, 32 pc, 16 pc, and 8 pc) to be clearly shown. Top left: magnified view using an Hα map constructed from the VLT/MUSE IFU data cube. Top middle: magnified view using a color-composite image where CO is now shown in red. Top right: further magnification of 180 pc areas centered on examples of the three classes of compact clusters and associations found in the selected 650 pc section of the outer ring in all PHANGS-HST filters.

Standard image High-resolution image

We note that it is debated whether such classifications for a given object can distinguish between gravitationally bound clusters and unbound associations, which may form and evolve under distinct conditions (Gieles & Portegies Zwart 2011; Kruijssen 2012; Ryon et al. 2017; Krumholz et al. 2019; Ward et al. 2020). Statistically however, it is expected that Class 1 should contain the highest percentage of bound clusters, and Class 3 should have the highest percentage of unbound associations (see Whitmore et al. 2021, for further discussion).

Ultimately, PHANGS-HST will generate up to ∼80,000 star cluster candidates for inspection and classification. In previous large studies of star clusters, the process of visual inspection has been a limiting step, which motivated the investigation of automated machine-learning techniques (Messa et al. 2018; Grasha et al. 2019; Perez et al. 2021). In Wei et al. (2020), we studied the application of deep transfer learning techniques to train CNN to classify star cluster candidates according to the scheme above. Deep transfer learning involves the tuning of a pretrained network, for example, based on the ImageNet library of everyday objects. 52 In principle, this approach enables CNNs to be successfully trained with relatively small samples (i.e., hundreds to a thousand images), which is the current size of HST star cluster samples with visual classifications. It provides an alternative to the process of training all network layers from scratch, which requires samples that are an order of magnitude larger. The results of Wei et al. (2020) were encouraging, as the prediction accuracies (70%, 40%, 40%–50%, and 50%–70% for Class 1, 2, and 3 star clusters and Class 4 nonclusters, respectively) were found to be competitive with the classification consistency between different human classifiers. The neural network models presented in Wei et al. (2020) provide a starting point for automated classification of the PHANGS-HST and other HST star cluster candidate samples, which can continue to be optimized. Whitmore et al. (2021) present results of the current Wei et al. (2020) models applied to clusters candidates in five PHANGS-HST galaxies, which also have classifications published by the LEGUS project. The Whitmore et al. (2021) analysis includes a detailed comparison between human and automated classifications, from overall prediction accuracy to differences in the distribution of ages, UBVI color–color diagrams, and stellar mass functions, and also includes a discussion of additional future work to improve performance.

4.6. SED Fitting

To derive ages, masses, and reddenings for the sources classified as star clusters and associations, we use a modified version of cigale 53 (Code Investigating GALaxy Emission; Burgarella et al. 2005; Noll et al. 2009; Boquien et al. 2019), a publicly available SED-fitting package developed for galaxies. Our modifications, which support the fitting of single-age populations and provide modeling options to facilitate comparison to prior SED modeling results for star clusters, are available in dedicated branches, SSP and SSPmag respectively, of the public git repository 54 of cigale. Turner et al. (2021) reports on these modifications and the analyses performed to validate the code.

SED fitting with cigale is performed on the five-band photometry (NUV–UBVI) for both compact star clusters and stellar associations. The fitting is based on the simple (single-aged) population synthesis models of Bruzual & Charlot (2003), assuming solar metallicity and a Chabrier (2003) IMF (with standard mass limits of 0.1–100 M), and no addition of nebular emission. The Cardelli et al. (1989) extinction curve with RV = 3.1 is used and a maximum E(BV) = 1.5 mag is imposed. The reasoning for these choices is discussed in Turner et al. (2021).

When CIGALE is run with the same assumptions and theoretical models as used in LEGUS, Turner et al. (2021) find very similar cluster ages and masses. Quantitatively, fits to identical cluster photometry yield (logarithmic) medians in the ratios of ages and masses for the two surveys of 0.001 ± 0.017 dex and 0.003 ± 0.011 dex, respectively.

4.7. Stellar Association Identification

The majority of star formation occurs in stellar associations (Lada & Lada 2003; Ward & Kruijssen 2018; Ward et al. 2020; Wright 2020, and references therein). Compact star clusters, the focus of the previous sections, are formed only in the densest peaks of the star formation hierarchy (Elmegreen 2008; Kruijssen 2012) and contain between 1% ∼ 50% of the total star formation in galaxies (Kruijssen 2012; Adamo et al. 2015, 2020; Johnson et al. 2016; Chandar et al. 2017; Krumholz et al. 2019). To produce catalogs of stellar associations, methods distinct from those used to identify single-peaked compact clusters are needed to segment the light distribution over larger physical scales and to probe further into the star formation hierarchy. Development of such methods is particularly important for obtaining complete inventories of the youngest stellar populations (≲10 Myr), which are a requirement for a robust joint analysis with molecular clouds and H ii regions.

For PHANGS-HST, K. Larson et al. (2021, in preparation) develop a technique to produce catalogs of stellar associations spanning scales from 8 to 64 pc. The technique builds upon the watershed routine in the scikit-image Python package (skimage.segmentation.watershed, van der Walt et al. 2014), which is based on the concept of geological watersheds. The routine identifies regions by "flooding" an image, given a set of markers as the starting points. In addition to the image on which associations are to be identified (i.e., requiring segmentation), the inputs needed by watershed are a list of marker positions and an image mask that defines the areas over which regions are allowed to grow.

Our adopted technique deploys watershed on a smoothed, filtered map of the positions of point sources (rather than directly on the HST images) and uses a two-parameter procedure to determine the marker positions and to produce the image mask from these position maps. The smoothed, filtered, position maps are produced as follows. Point sources are selected from the DOLPHOT catalogs that satisfy basic requirements on signal-to-noise ratio, sharpness, and data quality. The positions of these point sources are used to create maps with the same pixel grid as the PHANGS-HST DRC images, containing values of 1's corresponding to the DOLPHOT positions, and 0's otherwise. The maps are then smoothed with Gaussian profiles with FWHM of 2n pc for n = 3, 4, 5, and 6 (i.e., 8, 16, 32, and 64 pc), computed given the distance of the galaxy. Finally, a high-pass filter is applied by subtracting a map that has been smoothed with a kernel that is four times larger.

Two parameters are then defined that are tied to the characteristics of a single object on these smoothed, filtered position maps. The peak threshold parameter is the level above which markers (local maxima) are identified; it is currently defined to be 1.5 times the maximum value for a single object so that the resulting regions are multipeaked. The edge threshold parameter is the minimum surface "brightness" level beyond which the regions are not allowed to expand further, and used to create a mask image for watershed; it is defined to be the surface "brightness" at the FWHM of a single object.

The smoothed, filtered, position maps enable the identification of structures on the physical scales over which the maps have been smoothed. The 8 pc smoothed maps allow associations that overlap in size with sources in the compact cluster catalog to be studied, and the maps that are smoothed over larger scales enable the greater star formation hierarchy to be traced. Structures are identified based on the NUV and V-band DOLPHOT point-source catalogs. The resulting NUV-band selected association catalogs will predominantly contain young structures (≲100 Myr). In comparison, V-band selected catalogs will include structures over a larger range of ages and will facilitate comparison with the compact cluster samples, which have also been V-band selected.

K. Larson et al. (2021, in preparation) demonstrate the validity of this technique for identification of stellar associations in the PHANGS-HST galaxy sample based on an analysis of NGC 3351 and NGC 1566.

Figure 9 shows the watershed-identified stellar associations in NGC 3351 together with visually classified objects from the compact star cluster pipeline (Section 4.4). Three different magnifications of the galaxy are shown to illustrate the full disk and star-forming ring (bottom panels), the multiscale associations traced by the watershed method (top left and middle panels), and individual compact star clusters and associations (series of postage stamps at top right). The structures are overlaid on color composites of the HST imaging, ALMA CO(2–1) map, and a color composite of the Hα and [N ii] λ6583 maps from MUSE. Only the youngest (<10 Myr) watershed associations are shown to illustrate the correlation with the blue starlight, molecular clouds, and H ii regions. Ages are derived using CIGALE as summarized in Section 4.6 and documented in Turner et al. (2021). Details on the procedures used to measure fluxes in the regions are presented in K. Larson et al. (2021, in preparation).

From examination of the properties of the resultant watershed structures, we find:

  • 1.  
    Sample sizes of several hundred up to a few thousand associations in each galaxy. For associations identified on physical scales comparable to the aperture sizes used for selecting compact clusters candidates (a 4 pixel radius, which corresponds to 8 pc at a distance of 10 Mpc; Section 4.4), the numbers of associations identified are about ∼2–4 times larger than the numbers of visually classified compact clusters.
  • 2.  
    The process is shown to successfully identify structures at the defined scale. The size distributions are well defined, and approximately log-normal with medians near the FWHM of the smoothing kernel.
  • 3.  
    Fluxes computed within the boundaries of regions identified on the 8 and 16 pc smoothed images yield colors that are consistent with single-aged stellar population tracks on the UBVI color–color diagram. The loci are similar to previously studied samples of compact star cluster and associations (Figure 10). The similarity between two panels of Figure 10 demonstrates that these structures, which have been identified with a somewhat complicated algorithm, do behave as one might expect for groups of stars that are physically associated and born close in time. The associations may not strictly be singled-aged populations, but rather composite populations that can be approximated by a single-aged model. The level of deviation from a single-aged population, particularly as a function of the size of the region, is an issue that will be the topic of future work. Such work will be facilitated by the PHANGS-HST pipelines use of CIGALE, which allows for the self-consistent modeling of both single-aged and composite stellar populations as discussed in Turner et al. (2021).
  • 4.  
    Maps of the youngest associations (<5 Myr), show excellent correspondence to H ii regions observed in narrowband Hα imaging. As the age of the regions increases, they become more anticorrelated with the H ii regions, as would be expected (Figure 11). This provides some evidence that the age-dating of the youngest structures appears to produce reasonable results, despite the simplifying assumption of a single-aged population in the SED modeling and the complexities of the overall process to define the regions and measure their photometry.

Figure 10.

Figure 10. Comparison of UBVI color–color diagrams for multipeak stellar associations identified with the PHANGS-HST watershed-based procedure (left panel), and compact star clusters and associations from the LEGUS program (right panel). The stellar associations are based on a V-band map of point-source positions smoothed with a 16 pc FWHM Gaussian kernel.

Standard image High-resolution image
Figure 11.

Figure 11. Stellar associations in NGC 1566 color coded by SED-fit age. Associations were identified based on a map of F555W point-source positions smoothed with a 32 pc FWHM Gaussian kernel, and SED fitting was performed with CIGALE assuming a single-aged stellar population. Left: the youngest associations (≤3 Myr: blue, 3–5 Myr: green) overlaid on PHANGS ground-based Hα narrowband imaging (A. Razza et al. 2021, in preparation). The HST data do not extend to the corners of the imaging, and associations have not been traced in those areas. Middle: same as the left panel, but expanded to show greater detail in a portion of the northern spiral arm. Right: the same region as the middle panel, but now overlaid on the F555W (V-band) imaging and including older associations (5–10 Myr: yellow, 10–60 Myr: orange, >60 Myr: red). Age ranges are exclusive of the lower bound and inclusive of the upper bound.

Standard image High-resolution image

Initial testing of our watershed-based methods for identifying stellar association have been performed with NGC 3351 and NGC 1566. These two galaxies were chosen as they span a significant range of distances in the sample (from ∼10–18 Mpc), and because catalogs of clusters and compact associations are available from LEGUS for comparison. As with our new methods for identifying clusters, the completeness of our samples of multiscale associations, particularly possible systematics as a function of distance, will need to be studied with simulations.

5. Data Products

The PHANGS-HST data set will enable science extending well beyond the primary goals of the PHANGS collaboration. To enable the research community to make full use of the PHANGS-HST data, high-level science products from our star cluster and association catalog pipeline are being released. The following will be available through the PHANGS homepage at Mikulski Archive for Space Telescopes (MAST) 55 with doi:10.17909/t9-r08f-dq31.

5.1. Imaging

For imaging of the star-forming disk in NUV–UBVI bands for the 38 PHANGS-HST galaxies:

  • 1.  
    FLC FITS file for each exposure with astrometric solutions updated based on GAIA DR2 sources.
  • 2.  
    Combined DRC FITS images of individual pointings in each filter, each drizzled onto a common pixel grid defined for the galaxy target, also with astrometric solutions based on calibration to GAIA DR2 sources.
  • 3.  
    Mosaicked DRC FITS images in each filter for 13 galaxies covered by multiple pointings (NGC 628, NGC 1097, NGC 1300, NGC 1512, NGC 1672, NGC 2903, NGC 3351, NGC 3621, NGC 3627, NGC 4254, NGC 4321, NGC 4536, an NGC 6744).
  • 4.  
    ERR and EXP weight FITS images for individual pointings as well as mosaics.

5.2. Catalogs

  • 1.  
    DOLPHOT catalogs with five-band PSF-fitting photometry.
  • 2.  
    Compact star cluster and stellar association candidate catalogs, including position, five-band aperture photometry, stellar mass, age, reddening, CNN morphological classification, visual classifications for a subset of candidates in the empirical selection region, multiple concentration indices (MCI), and standard concentration index (CI) values.
  • 3.  
    Catalogs of stellar associations detected at 8, 16, 32, and 64 pc scales, including five-band region photometry, stellar mass, age, reddening, and effective radius, together with DS9 region files providing peak position and boundaries of regions, and FITS masks of regions.

5.3. Software

  • 1.  
    The Python routines that constitute the PHANGS-HST compact star cluster and association pipeline will be released at https://github.com/PhangsTeam.
  • 2.  
    CIGALE augmentations for SED fitting of single-aged stellar populations are available in dedicated branches, SSP and SSPmag, respectively, of the public git repository 56 of cigale.
  • 3.  
    CNN models for cluster candidate classification as described in Wei et al. (2020) and Whitmore et al. (2021). An annotated Python notebook containing scripts to run the models will be provided. Future updates shown to be improvements over the current models will be released as they are developed.

5.4. ALMA CO and MUSE Data Products

PHANGS-ALMA data have been released for the full PHANGS parent sample through the ALMA Archive and the Canadian Astronomy Data Centre (CADC 57 ), and also linked to the PHANGS portal at MAST. The PHANGS-ALMA products include the 12CO(2–1) spectral line data cubes, signal masks, and derived products such as the integrated intensity, line-of-sight velocity estimate, and spectral line widths. A description of the data reduction, products, and release is provided in Leroy et al. (2021a).

Likewise, PHANGS-MUSE integral field spectrograph data of the 19 galaxies targeted in the course of VLT Large Programme (ESO 1100.B-0651) has been released via the ESO Science Archive 58 and CADC, and also linked to the PHANGS portal at MAST. The released MUSE data include reduced and fully mosaicked data cubes as well as a series of two-dimensional maps associated with the gas and stellar tracers: broadband reconstructed images, emission line distribution and kinematics, stellar kinematics, star formation histories (mass and light-weighted age and metallicity maps), extinction maps from Balmer decrement, and stellar continuum fitting. The details of the data reduction and analysis processes are provided in Emsellem et al. (2021).

Links to the archive locations for all released PHANGS products are available at the survey webpage (http://phangs.org/data).

6. Summary

For decades, investigations of extragalactic molecular clouds and young resolved stellar populations have proceeded independently, and an integrated analysis has been performed only for case studies of select nearby galaxies. With the transformative capabilities of ALMA and HST working in concert, PHANGS will help bridge the fields of star formation and galaxy evolution by investigating how small-scale physics, which creates the basic quanta of star formation, may depend on the physical conditions of the greater galactic environment, and conspire to produce the scaling relationships that characterize the global properties of galaxies.

With five-band NUV–UBVI imaging of the disks of 38 spiral galaxies at distances of 4–23 Mpc, and parallel V- and I-band imaging of their halos, PHANGS-HST is providing a census of tens of thousands of compact star clusters and associations, which will be combined with PHANGS-ALMA giant molecular clouds (and PHANGS-MUSE H ii regions for 19 galaxies in the sample). Previous to this program, no HST wide-field UV imaging existed for 80% of the PHANGS-HST sample, and 60% did not have optical imaging with either WFC3 or ACS. Thus, PHANGS-HST provides a critical augmentation to the HST archive for nearby spiral galaxies in which both star clusters and molecular clouds can be efficiently detected by HST and ALMA over galactic scales. Altogether, PHANGS will provide an unprecedented joint catalog of the observed and physical parameters for ∼100,000 star clusters, associations, H ii regions, and molecular clouds.

In this paper, we have described the ensemble global properties of the 38 galaxies targeted for HST observations, and how they were selected from the parent PHANGS-ALMA sample of nearby massive galaxies on the star-forming main sequence. The acquisition and processing of the HST observations to produce aligned, drizzled, science-ready images were described in detail. An overview of the major components of the pipeline developed to produce catalogs of single-peak compact star clusters, and a parallel pipeline for multiscale stellar associations, was provided as a framework for forthcoming detailed papers on each of those components. We highlight new methods involving multiple concentration index (MCI) parameters, synthetic star clusters, and CNN models for cluster candidate selection and morphological classifications, as well as a watershed algorithm based procedure for identifying stellar associations from smoothed, filtered maps of point-source positions. We described the data products to be released via MAST at https://archive.stsci.edu/hlsp/phangs-hst/, including imaging, catalogs, and software, which will enable community science beyond the main goals of PHANGS.

These data products and the PHANGS census of star clusters, associations, H ii regions, and molecular clouds will provide the context needed for meaningful study of the earliest phases of dust enshrouded star formation and ISM physics with JWST. Molecular clouds and UV bright clusters/associations are the precursors and descendants of the youngest dusty clusters in nearby galaxies to be uncovered by JWST through their infrared emission. With HST-matched resolution in the near-IR (PSF FWHM 0farcs066 at 2 μm) and order-of-magnitude improved resolution compared to Spitzer in the mid-IR (PSF FWHM of 0farcs665 at 21 μm), molecular clouds with embedded sources can be identified, enabling a key test of our census of inactive clouds, and measurement of time to star formation onset.

Through a PHANGS Cycle 1 Treasury program 59 with an allocation of 107 hours, we will obtain NIRCAM and MIRI imaging in eight bands from 2 to 21 μm for the 19 galaxies with the full set of PHANGS-ALMA, MUSE, and HST observations. Imaging in the F200W, F300M, and F360M filters will provide a low obscuration view of stellar photospheric emission (with some contribution from hot dust to F300M and F360M). F335M, F770W, and F1130W will capture polycyclic aromatic hydrocarbon (PAH) emission, tracing a combination of size and charge, with the F300M and F360M filters enabling continuum subtraction for the 3.3 μm PAH feature. F1000W and F2100W will provide measurements of the warm dust continuum (with some contribution to the F1000W band by silicate absorption).

By resolving the infrared emission across these 19 morphologically diverse galaxies into individual regions and clusters (5–50 pc scales), the PHANGS-JWST observations will complete the inventory of star formation activity in our targets, and reveal the physical state of the small dust grains that heat the ISM. We will calculate mass functions,and spatial distributions for young embedded sources, study their relation to those of other populations, and more clearly identify the conditions that ignite star formation. The combination of uniform, systematic observations from JWST combined with those already in hand from HST, ALMA, and VLT/MUSE will significantly advance our understanding of the multiscale process of star formation, and the progression from clouds to visible stars in a galactic context.

Based on observations made with the NASA/ESA Hubble Space Telescope, obtained from the data archive at the Space Telescope Science Institute. STScI is operated by the Association of Universities for Research in Astronomy, Inc. under NASA contract NAS 5-26555. Support for Program number 15654 was provided through a grant from the STScI under NASA contract NAS5-26555.

J.C.L. acknowledges the W.M. Keck Institute for Space Studies (KISS) for its support of PHANGS-HST collaboration meetings where key work for this paper was performed. The PHANGS-HST survey benefited from discussions at the 2014 KISS workshop, "Bridging the Gap: Observations and Theory of Star Formation Meet on Large and Small Scales."

J.M.D.K. and M.C. gratefully acknowledge funding from the Deutsche Forschungsgemeinschaft (DFG, German Research Foundation) through an Emmy Noether Research Group (grant No. KR4801/1-1) and the DFG Sachbeihilfe (grant No. KR4801/2-1) and from the European Research Council (ERC) under the European Union's Horizon 2020 research and innovation program via the ERC Starting Grant MUSTANG (grant agreement No. 714907).

F.B. and A.B. acknowledge funding from the European Research Council (ERC) under the European Union's Horizon 2020 research and innovation program (grant agreement No. 726384/Empire).

R.S.K. and S.C.O.G. acknowledge financial support from the DFG via the collaborative research center (SFB 881, Project-ID 138713538) "The Milky Way System" (subprojects A1, B1, B2, and B8). They also acknowledge subsidies from the Heidelberg Cluster of Excellence STRUCTURES in the framework of Germany's Excellence Strategy (grant EXC-2181/1-390900948) and funding from the ERC via the ERC Synergy Grant ECOGAL (grant 855130).

K.K., O.E., and F.S. gratefully acknowledge funding from the German Research Foundation (DFG) in the form of an Emmy Noether Research Group (grant No. KR4598/2-1, PI Kreckel).

E.W. acknowledges support from the DFG via SFB 881 'The Milky Way System' (project-ID 138713538; subproject P2).

T.G.W. acknowledges funding from the European Research Council (ERC) under the European Union's Horizon 2020 research and innovation program (grant agreement No. 694343).

E.R. acknowledges the support of the Natural Sciences and Engineering Research Council of Canada (NSERC), funding reference number RGPIN-2017-03987.

This paper makes use of the following ALMA data, which have been processed as part of the PHANGS-ALMA survey: ADS/JAO.ALMA#2012.1.00650.S ADS/JAO.ALMA#2013.1.01161.S ADS/JAO.ALMA#2015.1.00925.S ADS/JAO.ALMA#2015.1.00956.S ADS/JAO.ALMA#2017.1.00886.L.

This research has made use of the NASA/IPAC Extragalactic Database (NED) which is operated by the Jet Propulsion Laboratory, California Institute of Technology, under contract with NASA.

Software: PyRAF (Science Software Branch at STScI 2012), Astrodrizzle (STSCI Development Team 2012), DOLPHOT (v2.0; Dolphin 2016), Photutils (Bradley et al. 2019), CIGALE (Burgarella et al. 2005; Noll et al. 2009; Boquien et al. 2019).

Footnotes

Please wait… references are loading.
10.3847/1538-4365/ac1fe5