PACMAN Control File (.pcf)

To run the different stages of PACMAN, the pipeline requires a so-called PACMAN control file (.pcf) where stage-specific parameters are defined (e.g. aperture size, path to the data, etc.).

In the following, we look at the contents of the pcf and include an example entry.

# This is the pcf file
# NOTE: DO NOT USE SPACES IN LISTS!

# 00
rundir                       /home/zieba/Desktop/Projects/Observations/Hubble/GJ1214_13021 # location of run dir
datadir                      /home/zieba/Desktop/Data/GJ1214_Hubble13021        # location of data dir
suffix                       ima                                                # data suffix (only ima supported right now)
which_visits                 [5,6]                                              # which visits to use; Options: list (e.g., [0,1,3]) or everything

save_obs_times_plot          True
show_obs_times_plot          False


## 02 barycorr
save_barycorr_plot           True                                               # save a plot with locations of HST?
show_barycorr_plot           False


## 03
Teff                         3250                                               # effective temperature of the star
logg                         5.026                                              # surface gravity of the star
MH                           0.29                                               # metallicity of the star
sm                           k93models                                          # stellar model. Options: blackbody, k93models, ck04models or phoenix

smooth                       True                                               # smooth stellar spectrum using a gaussian kernel
smooth_sigma                 441

save_smooth_plot             True
show_smooth_plot             False

save_refspec_plot            True                                               # save a plot showing the stellar model, bandpass and the product of them?
show_refspec_plot            False

## 10
di_rmin                      120                                                # coordinates (row min max and column min max) where the star in the direct image is
di_rmax                      160
di_cmin                      5
di_cmax                      50

save_image_plot              True                                               # save plot with direct image and best fit?
show_image_plot              False

di_multi                     median


## 20
s20_testing                  False
n_testing                    4

rmin                         5
rmax                         261

window                       12

opt_extract                  True
sig_cut                      15                                                 # optimal extraction, for cosmic rays etc
nsmooth                      9                                                  # optimal extraction, created smoothed spatial profile, medial smoothing filter

output                       True

correct_wave_shift           True
correct_wave_shift_refspec   True

background_thld              1000                                               # background threshold in counts

background_box               False
bg_rmin                      100
bg_rmax                      400
bg_cmin                      40
bg_cmax                      100

save_optextr_plot            False

save_sp2d_plot               False                                              # save plot of 2d spectrum
show_sp2d_plot               False

save_trace_plot              False                                              # save plot of trace
show_trace_plot              False

save_bkg_hist_plot           False                                              # save histogram of fluxes 
show_bkg_hist_plot           False

save_utr_plot                False                                              # save plot of individual up-the-ramps
show_utr_plot                False

save_sp1d_plot               False                                              # save plot of 1d spectrum
show_sp1d_plot               False

save_bkg_evo_plot            True                                               # save plot of the background flux over time
show_bkg_evo_plot            False

save_sp1d_diff_plot          False                                              # save difference plot of consecutive 1d spectra
show_sp1d_diff_plot          False

save_utr_aper_evo_plot       True                                               # save plot of aperture size over time
show_utr_aper_evo_plot       False

save_refspec_fit_plot        False                                              # save plot of fit of the 1d spectrum to the refspec
show_refspec_fit_plot        False

save_drift_plot              True                                               # save plot of 1d spectrum drift over time
show_drift_plot              False


##21

s21_most_recent_s20          True
s21_spec_dir_path_s20        None

wvl_min                      1.135
wvl_max                      1.642
wvl_bins                     11

use_wvl_list                 False
wvl_edge_list                [11400,12200,12600,13000,14600,15000,15400,15800,16200]


##30
s30_myfuncs                  ['constant','upstream_downstream','model_ramp','polynomial1','transit']
#s30_myfuncs                  ['constant','upstream_downstream','model_ramp','polynomial1','transit','uncmulti']

s30_fit_white                True
s30_most_recent_s20          True
s30_white_file_path          None

s30_fit_spec                 False
s30_most_recent_s21          False
s30_spec_dir_path            None

remove_first_exp             True
remove_first_orb             True
remove_which_orb             [0]

rescale_uncert               True
rescale_uncert_smaller_1     False

run_clipiters                0
run_clipsigma                0

white_sys_path               None

ld_model                     1
fix_ld                       False
ld_file                      /home/zieba/Desktop/Projects/Open_source/wfc3-pipeline/wfc3_reduction/limb-darkening/results/ld_outputfile.txt

toffset                      2456365

run_verbose                  True

save_allan_plot              True
save_raw_lc_plot             True
save_fit_lc_plot             True

run_lsq                      True
run_mcmc                     True
run_nested                   False

ncpu                         1

#emcee
run_nsteps                   4000
run_nwalkers                 30
run_nburn                    2000

#dynesty static
run_dlogz                    0.01
run_nlive                    400

#dynesty dynamic
run_dynamic                  True
run_dlogz_init               0.01
run_nlive_init               100
run_nlive_batch              50
run_maxbatch                 50

run_bound                    multi
run_sample                   rwalk

Stage 00

rundir

Example: rundir   /home/zieba/Desktop/Projects/Observations/Hubble/GJ1214_13021

The directory where you want PACMAN to run and save data to. If you downloaded or cloned the GitHub repository it includes a run_files directory. These three files can also be downloaded under this link: Download here. You have to copy these files into your run directory. It should include three files:

  • pacman_script.py: The run script

  • obs_par.pcf: The pcf file

  • fit_par.txt: The fit_par file with the fit parameters (only used for Stage 30)

Warning

Your path is not allowed to have any spaces in it. E.g., /home/USER/run 1 is not a valid path.

datadir

Example: datadir   /home/zieba/Desktop/Data/GJ1214_Hubble13021

This path should be correspond to the location of your data.

Warning

Your path is not allowed to have any spaces in it. E.g., /home/USER/data GJ1214 is not a valid path.

suffix

Example: suffix   ima

The only extension which is supported currently: ima. From the WFC3 data handbook (Types of WFC3 Files): “For the IR detector, an intermediate MultiAccum (ima) file is the result after all calibrations are applied (dark subtraction, linearity correction, flat fielding, etc.) to all of the individual readouts of the IR exposure.”

which_visits

Example: which_visits   [0,2]
Example: which_visits   everything

If your datadir contains several HST observations (called visits), you can select which ones to analyze. If you are interested in all visits in datadir, use everything here.

PACMAN automatically numbers the visits the datadir in chronological order, starting with zero. For example, HST GO 13021 has 15 visits in total. If you have all 15 visits in your datadir but only want to analyze the last two visits for now, you would enter [13,14] here. If your datadir only contained these two visits (and not the previous 13 visits before it), you can either write everything or [0,1].

save_obs_times_plot/show_obs_times_plot

Example: True This plot consists of one table and two subplots.

  • The table lists the number of orbits in each individual visit and the start time of the visit.

  • The left subplot shows when the visit was observed.

  • The right subplot shows when observations where taken during an visit as a function of time elapsed since the first exposure in the visit.

_images/obs_dates_all.png

If the user didn’t set which_visits   everything, two figures like this will be generated. One with all visits and the other with only the ones set with which_visits.

Stage 02

save_barycorr_plot/show_barycorr_plot

Example: True

Saves or shows a plot with the downloaded X,Y,Z positions of HST from the HORIZONS system by JPL during the observations.

_images/bjdcorr_horizons_results_v0.png

Stage 03

Teff, logg, MH

Example: Teff  3250
Example: logg  5.026
Example: MH    0.29

effective Temperature (Teff), surface gravity (logg) and metallicity (MH) of the star.

Used to generate the stellar model. If the user only wants to use a blackbody spectrum, the code ignores logg and metallicity.

sm

Example: phoenix

The stellar model one wants to use. It will be then multiplied with the grism throughput (either G102 or G141) to create a reference spectrum for the wavelength calibration of the spectra.

Options:

PACMAN currently offers the following stellar models:
- k93models: THE 1993 KURUCZ STELLAR ATMOSPHERES ATLAS
- ck04models: THE STELLAR ATMOSPHERE MODELS BY CASTELLI AND KURUCZ 2004
- phoenix: THE PHOENIX MODELS BY FRANCE ALLARD AND COLLABORATORS
- blackbody: A blackbody spectrum using Planck’s law

The stellar models (exluding the blackbody) are retrieved from https://archive.stsci.edu/hlsps/reference-atlases/cdbs/grid/

smooth/smooth_sigma

Example: smooth        True
Example: smooth_sigma  50

If smooth is True, applies a Gaussian kernel smoothing to the stellar spectrum. This is recommended since the Kurucz and Phoenix stellar models have a higher resolution than the WFC3 grisms, which have native resolution of:

  • G141: 46.9 Angstrom/pixel dispersion

  • G102: 24.6 Angstrom/pixel dispersion

Gaussian smoothing the reference spectrum is discussed in detail in Deming et al. 2013. To follow their example and convolve the reference spectrum with a Gaussian with full-width at half maximum (FWHM) of 4 pixels, we use the relation FWHM = 2.35 * sigma. The value for the G141 grism is then 2.35 * 4 * 46.9 = 441

save_smooth_plot/show_smooth_plot

Example: True

Shows how the stellar spectrum was smoothed.

_images/smooth.png

save_refspec_plot/show_refspec_plot

Example: True

Saves or shows a plot of the reference spectrum (stellar spectrum * bandpass).

_images/refspec.png

Stage 10

di_rmin, di_rmax, di_cmin, di_cmax

Example: di_rmin  120
Example: di_rmax  160
Example: di_cmin  5
Example: di_cmax  50

These values specify a cutout around the target star. Below you will find an example:

_images/quick_di0.png

You can see that the cutout (red box in the plot) includes the star, so the centroid can be determined.

save_image_plot/show_image_plot

Example: True

Saves two plots for every direct image. The first one just shows the image with the location of the direct image cutout. The second plot shows the cutout on the left and the result of the 2D gaussian fit on the right. The gaussian fit only considers the cutout for the fit.

_images/quick_di0.png _images/di_0.png

di_multi

Example: di_multi  median
Example: di_multi  latest

Options: median or latest

Some observations have more than one direct image per orbit which were taken at the start of the orbit. In these cases the user can decide if they want to only use the most recent DI or a median of the DI positions in the orbit.

Stage 20

s20_testing/n_testing

Example: s20_testing  True
Example: n_testing    1

Runs s20 in testing mode. Only the first n_testing files will be analyzed then. E.g. if n_testing = 1, only the first file will be analyzed.

rmin/rmax

Example: rmin  5
Example: rmax  261

Can be set to remove rows from the top and bottom of the 2D array. E.g. If the frame has the size of 266x266 and the user wants to cut off the upper and lower 5 pixels they can use the same settings as in the example above. Typically, for a 266x266 array, the user would set rmin  5 and rmax  261. For 522x522 array, rmin  5 and rmax  517.

window

Example: window  10

Sets the size of the extraction window. PACMAN adaptively determines the best aperture in two steps:

  1. identify the rows with the largest gradient in count rate

  2. add window additional rows above and below the rows identified in step 1

For example, if the biggest flux gradient occurs in row 35 and 55, and the user sets window  10, the extraction aperture will be between rows 25 and 65.

background_thld

Example: background_thld  1000

Sets a threshold for the background calculation. Pixels with a flux lower than background_thld electrons/second will be considered background. The background flux is then determined by taking the median flux of the pixels below this threshold.

opt_extract

Example: opt_extract  True

Extracts the spectrum using the optimal extraction routine from Horne et al. 1986. If set to false, PACMAN performs a quick box extraction (simply adding up all the counts in the aperture). This gives a quick look at the spectra, but the following stages will break if opt_extract is false.

sig_cut, nsmooth

Example: sig_cut  15
Example: nsmooth  9

sig_cut: Specifies the outlier threshold for the optimal extraction. Outliers greater than sig_cut are masked.

If you have a lot of nans in your light curve you might want to increase the sig_cut value.

smooth: Number of pixels used for median-smoothing to create the spatial profile.

save_optextr_plot

Save plot showing some diagnostics from the optimal extraction.

_images/optextr0-0.png

correct_wave_shift

Example: correct_wave_shift  True

Interpolates each spectrum to the wavelength scale of the reference spectrum, to account for spectral drift over the observation.

correct_wave_shift_refspec

uses the created reference spectrum during Stage 03 for the wavelength calibration.

output

Example: output  True

Saves the flux as a function of time and wavelength.

background_box & bg_rmin etc.

Do you want to calculate the median flux in a box to use as an estimate of the background flux?

Example: background_box               False
Example: bg_rmin                      100
Example: bg_rmax                      400
Example: bg_cmin                      40
Example: bg_cmax                      100

save_sp2d_plot/show_sp2d_plot

2D spectrum of the target with a low vmax value to see the background better.

_images/sp2d_0.png

save_trace_plot/show_trace_plot

2D spectrum with the expected position of the trace based on the direct image.

_images/trace_0.png

save_bkg_hist_plot/show_bkg_hist_plot

Shows a histogram of the flux based on the difference of two consecutive up-the-ramp samples. This is why the user might see also negative fluxes.

_images/bkg_hist0-0.png

save_utr_plot/show_utr_plot

Shows an up-the-ramp sample with the determined highest flux changes between two rows.

_images/utr0-0.png

save_sp1d_plot/show_sp1d_plot

The 1D spectrum. The optimal extraction is shown in black, and the box extraction is shown in red.

_images/sp1d_0.png

save_bkg_evo_plot/show_bkg_evo_plot

The determined background flux versus time.

_images/bkg_evo.png

save_sp1d_diff_plot/show_sp1d_diff_plot

The difference between two consecutive 1D spectra.

_images/sp1d_diff_0.png

save_utr_aper_evo_plot/show_utr_aper_evo_plot

The determined aperture size over time. In this case the rows with the highest change in flux were either 7 or 8 pixels apart.

_images/utr_aper_evo.png

save_refspec_fit_plot/show_refspec_fit_plot

The fit of the 1D spectrum compared to the reference spectrum. In the first exposure in a visit, this reference spectrum is the product of the stellar spectrum with the grism throughput.

_images/refspec_fit_0.png

In the other exposures, the reference spectrum is the wavelength-calibrated first exposure in a visit.

_images/refspec_fit_1.png

save_drift_plot/show_drift_plot

The fitted wavelength calibration parameters over time. We do a linear fit (a + b * wavelength) to the reference spectrum (which can be either a stellar model * instrument throughput or the first exposure in a visit depending on what the user chose in correct_wave_shift_refspec) and fit for the height. If a reference spectrum was a stellar model * the throughput, the first fit parameters of the fit of the 1d to the sm*throughput wont be shown in that plot.

First panel: a Second panel: b Third panel: height

_images/drift.png

Stage 21

s21_most_recent_s20/s21_spec_dir_path_s20

Example: s21_most_recent_s20    True
Example: s21_spec_dir_path_s20  None

If s21_most_recent_s20 is set to True the most recent s20 run will be used. If s21_most_recent_s20 is set to False, the user can set a path with the extracted data after s20:

Example: ``s21_most_recent_s20 False ``
Example: s21_spec_dir_path_s20  /home/zieba/Desktop/Projects/Open_source/PACMAN/run/run_2022-01-25_19-12-59_GJ1214_Hubble13021/extracted_lc/2022-02-11_17-44-56

wvl_min/wvl_max/wvl_bins

Example: wvl_min   1.125
Example: wvl_max   1.65
Example: wvl_bins  12

Start and end wavelengths for the spectroscopic light curves (microns). The wvl_bins parameter sets the number of wavelength channels.

use_wvl_list/wvl_edge_list

Example: use_wvl_list   True
Example: wvl_edge_list  [1.1,1.3,1.5,1.7]

If the user wants to use a custom wavelength list for the binning, set use_wvl_list to True. The units here are always microns.

There are two formats for a custom wvl_edge_list:

If entering a 1D array like wvl_edge_list  [1.1,1.3,1.5,1.7], PACMAN will always create NON-OVERLAPPING bins. In this example, you would get 3 bins covering the following ranges: 1.1-1.3, 1.3-1.5, 1.5-1.7.

If entering a 2D array like wvl_edge_list  [[1.0579,1.0677],[1.0604,1.0701],[1.0628,1.0726]], you will get the following three bins: 1.0579-1.0677, 1.0604-1.0701, 1.0628-1.0726. This might be useful if you want to create OVERLAPPING bins. An example for that is searching for Helium with the G102 grism around 1083 nm, like in Spake et al., 2018.

Make sure you do not use spaces in any of these lists, otherwise it’s not readable to the code.

Stage 30

s30_myfuncs

Choose the functions to fit the data. The available functions are listed in models.

s30_fit_white

Fit the white light curve created in Stage 20.

s30_most_recent_s20

Use the most recent Stage 20 run for the white light curve fit.

s30_white_file_path

If s30_most_recent_s20 was set to False, the user can put a path to the white light curve file here.

s30_fit_spec

Fit the spectroscopic light curves created in Stage 21.

s30_most_recent_s21

Use the most recent Stage 21 run for the spectroscopic light curve fit.

s30_spec_dir_path

If s30_most_recent_s21 was set to False, the user can put a path here.

remove_first_exp

Removes the first exposure from every orbit.

remove_first_orb

Removes the first orbit from every visit.

remove_which_orb

Which orbits do you want to remove? E.g., if only the first choose [0]. If the first two, use [0,1].

rescale_uncert

Rescales the uncertainties for the sampler (MCMC or nested sampling), so that the reduced chi2red = 1. Note: This only happens if chi2red < 1 after least squared fit.

An alternative is to use uncmulti as a model in your fit. This will rescale the errorbars at every step of the sampler. The uncmulti value which the user will get after the sampling should be approximately the square root of the reduced chi square of the fit (i.e., uncmulti ~ sqrt(chi2_red)).

rescale_uncert_smaller_1

If this is True, PACMAN will also rescale the errorbars if chi2_red < 1, so that chi2_red = 1. This parameter was motivated by Colón+2020 (https://arxiv.org/abs/2005.05153): “For light curves where the best fit had a reduced chi-squared (chi2_red) greater than unity, we rescaled the per point uncertainties to achieve chi2_red = 1 before we ran the MCMC.”

So if one wants the same setup as mentioned in Colon+2020, one has to set rescale_uncert to True and rescale_uncert_smaller_1 to False.

run_clipiters

NOT TESTED

run_clipsigma

NOT TESTED

white_sys_path

Location of the white_systematics.txt file. Only needed if the user wants to use ‘divide_white’ as a model.

ld_model

Example: ld_model  1

Options:

1 = “linear” limb darkening

2 = “quadratic” limb darkening

kipping2013 = quadratic limb darkening in the Kipping2013 parameterization.

fix_ld

If true, PACMAN will use ExoTiC-LD to calculate limb darkening parameters and fix them during the fits.

ld_file

NOT TESTED but should give the possibility to use your own limb darkening file.

toffset

Subtracts an offset from the time stamps so that there is no problem with floating precision due to the size of dates in BJD.

run_verbose

If set to True, additional information is being outputted when running.

save_allan_plot

The Allan deviation plot.

_images/corr_plot_bin0_wvl1.400.png

save_raw_lc_plot

Raw light curve plot

_images/raw_lc_bin0_wvl1.400.png

save_fit_lc_plot

Plots the light curve fit without the instrumental systematics and only the astrophysical signal.

_images/lsq_lc_bin0_wvl1.400.png

run_lsq

Runs the least square routine. Has to be True currently.

run_mcmc

Runs an MCMC using the emcee package.

run_nested

Runs nested sampling using the dynesty package.

ncpu

Number of cores used by emcee or dynesty. Use ncpu = 1, if you don’t want to use parallelization.

run_nsteps/run_nwalkers/run_nburn

Parameters for emcee.

nwalkers is explained here in the emcee API and run_nsteps here.

run_dlogz/run_nlive etc

Example: run_dlogz                    0.01
Example: run_nlive                    400
Example: run_dynamic                  False
Example: run_dlogz_init               0.01
Example: run_nlive_init               80
Example: run_nlive_batch              100
Example: run_maxbatch                 20
Example: run_bound                    multi
Example: run_sample                   auto

Parameters for dynesty.

If run_dynamic = False, then dynesty’s static mode will be used. The dynamic parameters (run_dlogz_init, run_nlive_init, run_nlive_batch, run_maxbatch, run_bound, run_sample) are well-explained in the dynesty API and in this tutorial The static parameters (run_dlogz, run_nlive, run_bound, run_sample) are expained in the dynesty API too.

The bound and sample parameters which are used by both the static and the dynamic mode are explained here.

lc_type

Currently not used.