GWFish Tutorial

GWFish Tutorial#

This is a brief guide on how to use GWFish software

#! pip install -q git+https://github.com/janosch314/GWFish.git
#! pip install -q lalsuite
#! pip install -q corner

Settings for working locally with GWFish#

Install GWFish in a `conda` environment#

It is advisable to create a conda environment where to make the GWFish modules available

conda create --name gwfish_env python=3.9
conda activate gwfish_env

To make GWFish modules available from any location in your PC, after clonig the repository

git clone git@github.com:janosch314/GWFish.git

from the folder location, and after activating the conda environment, execute the command

pip install .

The following packages need to be installed as well:

pip install lalsuite
pip install corner
pip install tqdm
pip install pandas
pip install tables
pip install sympy

For numpy it is preferred to have a version below 2.0; since it is usually automatically installed with the conda environment, the suggestion is to uninstall it if the versione is $\ge 2$ and downgrade it:

pip uninstall numpy
pip install numpy==1.25

What is the code about#

GWFish is a Fisher matrix code, useful to calculate the covariance matrix for a gravitational wave (GW) event in a really small amout of computational time (around seconds). Let’s take a step back and briefly recap how parameter estimation (PE) is done in the GW field.

Starter: GW data analysis#

Bayesian gravitational wave data analysis is used to infer the properties of gravitational wave sources and make predictions about their parameters (see below what are in detail the parameters we are talking about). It combines the principles of Bayesian statistics with the analysis of gravitational wave signals detected by ground-based observatories like LIGO and Virgo.

The goal of Bayesian gravitational wave data analysis is to extract this information from the noisy gravitational wave signals detected by the observatories: $$s(t) = h_0(t) + n(t)$$ where $h_0(t)$ is the true (unknown) signal and $n(t)$ is the detector noise, assumed to be Gaussian and stationary.

Mathematically, Bayes’ theorem can be expressed as:

\[p(\vec{\theta}|s) \propto \pi(\vec{\theta}){\mathcal{L}}(s|\vec{\theta})\]

where $p(\vec{\theta}|d)$ is the posterior distribution, ${\mathcal{L}}(d|\vec{\theta})$ is the likelihood function, $\pi(\vec{\theta})$ is the prior distribution, and we neglected the evidence or marginal likelihood at the denominator.

To perform Bayesian gravitational wave data analysis, we use various techniques such as Markov Chain Monte Carlo (MCMC) sampling and nested sampling. The mostly used software is bilby, and typically full Baysian parameter estimation is computationally expensive.

SNR Calculation#

When a gravitational-wave (GW) signal passes through a detector, the observed strain is

\[ s(t) = h_0(t) + n(t) \]

where $h_0(t)$ is the true GW signal, and $n(t)$ is the detector noise.
We assume $n(t)$ is a zero-mean, stationary, Gaussian random process characterized by its power spectral density (PSD):

\[ \langle \tilde{n}(f) \tilde{n}^*(f') \rangle = \tfrac{1}{2} S_n(f) \delta(f - f') \]

where tildes denote Fourier transforms. The PSD $S_n(f)$ quantifies the average noise power per unit frequency.

Matched Filtering#

To extract a signal buried in noise, we use matched filtering — a technique that maximizes the signal-to-noise ratio (SNR) by correlating the data with a template waveform $h(t)$.
The (complex) filter output is defined as

\[ z = \int_{-\infty}^{+\infty} df \, \tilde{s}(f) \tilde{K}^*(f) \]

where $\tilde{K}(f)$ is the frequency response of the filter.

If only noise is present, $s(t) = n(t)$, and the variance of the filter output is

\[ \langle |z|^2 \rangle = \int_{-\infty}^{+\infty} df \, \tfrac{1}{2} S_n(f) |\tilde{K}(f)|^2 \]

This defines the noise power in the filtered data.

Defining the SNR#

We can define an inner product between two functions $a(t)$ and $b(t)$ as

\[ \langle a | b \rangle \equiv 4 \, \mathrm{Re} \int_{f_{\min}}^{f_{\max}} \frac{\tilde{a}(f) \tilde{b}^*(f)}{S_n(f)} \, df \]

where the integral is restricted to the detector’s sensitive band.
This weighting by $1/S_n(f)$ gives less importance to noisy frequency regions.

The SNR for a filter $K$ and template $h$ can be expressed as

\[ \frac{S}{N} = \frac{\langle u | h \rangle}{\sqrt{\langle u | u \rangle}} \]

where the function $u$ is related to the filter by $\tilde{u}(f) = \tfrac{1}{2} S_n(f) \tilde{K}(f)$.

Optimal Filter and Maximum SNR#

Using the Cauchy–Schwarz inequality, the SNR is maximized when $u$ is proportional to $h$, which implies the optimal filter is

\[ \tilde{K}(f) \propto \frac{\tilde{h}(f)}{S_n(f)} \]

This filter weights each frequency according to its signal strength relative to the noise.

Substituting this back gives the maximum achievable SNR:

\[ \left(\frac{S}{N}\right)_{\rm opt} = \sqrt{\langle h | h \rangle} = \left[ 4 \int_{f_{\min}}^{f_{\max}} \frac{|\tilde{h}(f)|^2}{S_n(f)} \, df \right]^{1/2} \]

Detection Criterion#

A GW signal is considered detectable if its optimal SNR exceeds a chosen threshold — typically around

\[ \rho_{\rm det} \simeq 8 \]

which corresponds to a low false-alarm probability in Gaussian noise.

GW parameters#

The parameters describind a GW event are the following (they are already provided with the same nomenclature as used in the code):

chirp_mass: chirp mass of the binary in [Msol] (in detector frame) $$ {\mathcal{M}}_{\rm chirp} = \frac{\left(m_1 m_2\right)^{3/5}}{\left(m_1 + m_2\right)^{1/5}} $$
mass_ratio: ratio of the secondary mass over the primary mass, so that it ranges in $[0,1]$
redshift: the redshift of the merger
luminosity_distance: the luminosity distance of the merger in [Mpc] $$ d_L=\frac{H_0}{c}(1+z) \int_0^z \frac{\mathrm{~d} z^{\prime}}{H\left(z^{\prime}\right)} $$
theta_jn: the angle between the line of observation and the total angular momentum (orbital, spin and GR corrections) of the binary [rad] (it reduces to the so-called inclination angle or iota if the spin component is absent); it ranges in $[0, \pi]$
dec: declination angle in [rad], it varies in $[-\pi/2, +\pi/2]$
ra: right ascension in [rad], it varies in $[0, 2/pi]$
psi: the polarization angle in [rad]; it ranges in $[0, \pi]$ $$h(t)=F_{+}(\theta, \phi, \psi) h_{+}(t)+F_x(\theta, \phi, \psi) h_x(t)$$ For an L-shaped detector:
psi = 0 → + and × aligned with the detector arms
psi = π/4 → rotated by 45°, i.e. equal mix of + and ×
phase: the initial phase of the merger in [rad]; it ranges in $[0, 2\pi]$
geocent_time: merger time as GPS time in [s]; it starts from 1980
a_1: dimensionless spin parameter of primary component; it ranges in $[0, 1]$
a_2: dimensionless spin parameter of secondary component; it ranges in $[0, 1]$
tilt_1: zenith angle between the spin and orbital angular momenta for the primary component in [rad]; it ranges in $[0, \pi]$
tilt_2: zenith angle between the spin and orbital angular momenta for the secondary component in [rad]; it ranges in $[0, \pi]$
phi_12: difference between total and orbital angular momentum azimuthal angles in [rad]; it ranges in $[0, 2\pi]$
phi_jl: difference between the azimuthal angles of the individual spin vector projections on to the orbital plane in [rad]; it ranges in $[0, 2\pi]$
lambda_1: dimensionless tidal polarizabilty of primary component
lambda_2: dimensionless tidal polarizabilty of secondary component

\[\Lambda_i=\frac{2}{3}k\left(\frac{R_i c^2}{G m_i}\right)^5\]

The lambda_1 and lambda_2 parameters are for neutron stars only and their value spans from a few hundreds to a thousand.

Screenshot 2025-10-28 at 6.03.44 PM

Screenshot%202025-10-28%20at%206.05.08%E2%80%AFPM.png

Look at https://arxiv.org/pdf/2404.16103 to see an exhaustive descriptions of parameters

Note on mass conventions#

In GWFish the input masses can be passed in the following different combinations (the combination has to be the same when selecting the fisher_params):

chirp_mass (in detector-frame and in [Msol]) and mass_ratio
chirp_mass_source (in source-frame and in [Msol]) and mass_ratio
mass_1 and mass_2, both in detector-frame and in [Msol]
mass_1_source and mass_2_source, both in source-frame and in [Msol]

Pay attention that when masses are passed to lal they are always converted into mass_1 and mass_2 (in [kg]), as lal always takes masses in detector frame!

Why Fisher analysis?#

When studying the performance of a new detector, such as the Einstein Telescope, which has a much improved sensitivity and is predicted to detect entire populations of events ($10^6$ events per years against the current tens that we are detecting), we want a tool to make forecasts in a reasonable amoiunt of time. Since we do not have still a fast full parameter estimation, the Fisher matrix approximation is the state-of-the-art for forecasts.

Let’s start!#

The implementation of a Fisher matrix code relies on three main pillars:

Analytic waveform approximation: GWFish uses all the waveforms from lalsimulation in frequency domain (although it is also possble to work in time domain)
Derivatives: these are done numerically at the second order (except for some parameters, like distance, phase and time, which are straightforward analytically)
Matrix inversion: singular value decomposition and normalization is used to safely invert the Fisher matrix

Import packages#

# suppress warning outputs for using lal in jupuyter notebook
import warnings 
warnings.filterwarnings("ignore", "Wswiglal-redir-stdio")

import GWFish.modules as gw
from tqdm import tqdm
import matplotlib
import matplotlib.pyplot as plt
import corner
import numpy as np
import pandas as pd
import json
import os
from astropy.cosmology import Planck18

# plotting settings
matplotlib.rcParams.update({
    'font.size': 10,
    'axes.labelsize': 10,
    'axes.titlesize': 10,
    'xtick.labelsize': 10,
    'ytick.labelsize': 10,
    'legend.fontsize': 10,
    'figure.titlesize': 10
})

Single Event Analysis: GW170817-like#

# Event's parameters should be passed as Pandas dataframe

parameters = {
    'chirp_mass': np.array([1.1858999987203738]) * (1 + 0.00980), 
    'mass_ratio': np.array([0.8308538032620448]), 
    'luminosity_distance': Planck18.luminosity_distance(0.00980).value,
    'theta_jn': np.array([2.545065595974997]),
    'ra': np.array([3.4461599999999994]),
    'dec': np.array([-0.4080839999999999]),
    'psi': np.array([0.]),
    'phase': np.array([0.]),
    'geocent_time': np.array([1187008882.4]),
    'a_1':np.array([0.005136138323169717]), 
    'a_2':np.array([0.003235146993487445]), 
    'lambda_1':np.array([368.17802383555687]), 
    'lambda_2':np.array([586.5487031450857])}
parameters = pd.DataFrame(parameters)
parameters

	chirp_mass	mass_ratio	luminosity_distance	theta_jn	ra	dec	psi	phase	geocent_time	a_1	a_2	lambda_1	lambda_2
0	1.197522	0.830854	43.747554	2.545066	3.44616	-0.408084	0.0	0.0	1.187009e+09	0.005136	0.003235	368.178024	586.548703

# We choose a waveform approximant suitable for BNS analysis
# In this case we are taking into account tidal polarizability effects
waveform_model = 'IMRPhenomD_NRTidalv2'
f_ref = 10.

⚠️ Note:#

The default reference frequency f_ref is set to $50$, the user can pass a different value, together with the waveform_model parameter (see examples below)

Play with the signal#

# To access the available detectors, use the following command
gw.utilities.get_available_detectors()

dict_keys(['ET', 'VOY', 'CE1', 'CE2', 'LLO', 'LHO', 'VIR', 'VIR_O2', 'KAG', 'LIN', 'LISA', 'LGWA', 'LGWA_Nb', 'LGWA_Soundcheck', 'LLO_#', 'LHO_#', 'VIR_#'])

A quick note on detectors setup#

Detectors are all described in the .yaml file. The general settings are as follows (in case you want to customize your own detector):

ET: # name label of the detector
            lat:              (40 + 31. / 60 ) * np.pi / 180.
            lon:              (9 + 25. / 60) * np.pi / 180.
            opening_angle:    np.pi / 3.
            azimuth:          70.5674 * np.pi / 180. #angle between the main arm and the north, clockwise 
            psd_data:         ET_psd.txt # file containg two columns: frequency, psd
            duty_factor:      0.85
            detector_class:   earthDelta # for triangle-shaped detector or earthL if usual-shape detector
            plotrange:        3, 1000, 1e-25, 1e-20
            fmin:             2. # minimum frequency of the detector
            fmax:             2048. # maximum frequency of the detector
            spacing:          geometric
            df:               1./16.
            npoints:          1000
            arm_length:       10000   

The spacing variable can either be geometric (logarithmic spacing of the frequency vector for waveform evaluation with a number of points specified by the npoints variable, faster solution) or linear (linear spacing of the frequency vector to evaluate the waveform with spacing given by df, slower solution)

Plot the signal#

Let’s start getting a feeling of how the signal appears

# Choose the detector onto which you want to project the signal
detector = 'ET'
plt.figure(figsize=(4, 3), dpi=200)
# The following function outputs the signal projected onto the chosen detector

waveform_model_1 = 'IMRPhenomD_NRTidalv2'
signal_1, _ = gw.utilities.get_fd_signal(parameters, detector, waveform_model_1, f_ref) # waveform_model and f_ref are passed together

waveform_model_2 = 'TaylorF2'
signal_2, _ = gw.utilities.get_fd_signal(parameters, detector, waveform_model_2, f_ref) # waveform_model and f_ref are passed together

frequency = gw.detection.Detector(detector).frequencyvector[:, 0]
# This signal has three components since ET comprises three detectors, let's plot one of them
plt.loglog(frequency, np.abs(signal_1[:, 0]), label='%s' %waveform_model_1)
plt.loglog(frequency, np.abs(signal_2[:, 0]), label='%s' %waveform_model_2)

plt.legend()
plt.xlabel('Frequency [Hz]')
plt.ylabel('ASD [Hz$^{-1/2}$]')
plt.grid(linestyle='dotted', linewidth='0.6', which='both')
plt.show()

../_images/e97b2ff6ca3de1f1d3b86dcd47b5fe9f4b562d4edc77f8b61d5209fe0834483d.png

Gravitational Wave Waveform Approximants Overview#

This guide provides a comprehensive overview of waveform approximants available in LALSimulation for gravitational wave analysis.

Reference: LALSimulation Documentation

Basic Difference: IMRPhenom vs SEOBNR#

IMRPhenom (IMRP)
- Phenomenological waveform fitted to numerical relativity (NR) + post-Newtonian (PN) results
- Mainly frequency-domain
- Fast to compute, covers inspiral-merger-ringdown (IMR)
- Good for typical spins/mass ratios, less accurate in extreme cases
SEOBNR (SEOB)
- Effective-One-Body (EOB) waveform based on analytical GR, calibrated to NR
- Mainly time-domain
- More accurate for high spins, precession, eccentricity, and higher modes
- Slower to compute than IMRPhenom

Summary Table:

Feature	IMRPhenom (IMRP)	SEOBNR (SEOB)
Basis	Phenomenological fit	Analytical EOB + NR calibration
Domain	Frequency-domain	Time-domain
Speed	Fast	Slower
Accuracy	Good for common cases	High, especially for extreme cases
Use	Quick parameter estimation	High-accuracy studies / GR tests

Time Domain Waveforms#

TaylorT1: PN time-domain waveform, 1st order in time
TaylorT2: PN time-domain waveform, 2nd order in time
TaylorT3: PN time-domain waveform, 3rd order in time
TaylorT4: PN time-domain waveform, 4th order in time
SEOBNRv4: Effective-one-body waveform with aligned spins (v4)
SEOBNRv5: Effective-one-body waveform with aligned spins (v5)

Frequency Domain Waveforms#

TaylorF2: PN frequency-domain waveform (most commonly used PN)
IMRPhenomA: Phenomenological IMR waveform (non-spinning)
IMRPhenomB: Phenomenological IMR waveform (aligned spins)
IMRPhenomD: Phenomenological IMR waveform (aligned spins, BBH)
IMRPhenomD_NRTidalv2: IMRPhenomD + tidal effects for BNS

Binary Neutron Star (BNS) Specific#

IMRPhenomD_NRTidalv2: Includes tidal deformability
IMRPhenomPv2_NRTidal: Tides + precession
SEOBNRv4_ROM_NRTidalv2: EOB reduced order model with tides

Higher Modes & Precession#

Precession necessary when the spins are large and mis-aligned wrt the orbital angular momentum. Higher Modes (beyond the quadrupole approximation) necessary for large mass ratios and large total mass.

IMRPhenomHM: Higher multipole moments (l=2,3,4)
IMRPhenomXHM: Next-gen higher modes
IMRPhenomXPHM: Precessing + higher modes
SEOBNRv4PHM: EOB with precession and higher modes

NSBH (Neutron Star–Black Hole) Systems#

Typical properties: Large mass ratio, often misaligned spins, and tidal effects from the neutron star.
Recommended waveforms:
- IMRPhenomNSBH — frequency-domain model including spin precession and tidal effects.
- IMRPhenomPv2_NRTidal — precessing waveform with tidal deformability (good general-purpose choice, but not is the BH-NS mass ratio is very large ).
- SEOBNRv4_ROM_NRTidalv2 — EOB model with tides, accurate but slower.
When to include:
- Precession: If BH spin is tilted.
- Higher modes: For large mass ratios (≥ 1:5) or edge-on systems.

Time as a function of frequency#

\[ t(f) = t_c -\frac{5}{256 \pi^{8/3}} \cdot \frac{1}{\mathcal{M}_c^{5/3}} \cdot \frac{1}{f^{8/3}} \]

# Plot the time before the merger as a function of the frequency
_, t_of_f = gw.utilities.get_fd_signal(parameters, detector, waveform_model, f_ref)

detector

'ET'

plt.figure(figsize=(4, 3), dpi=200)

convert_from_seconds_to_hours = 3600
plt.semilogx(frequency, (t_of_f - parameters['geocent_time'].iloc[0]) / convert_from_seconds_to_hours)
plt.xlabel('Frequency [Hz]')
plt.ylabel('Time to merger [hours]')
plt.grid(linestyle='dotted', linewidth='0.6', which='both')
plt.title('Time to merger as a function of frequency')
plt.show()

../_images/ebe25c4ffd241af2b989af0002bf0c7c937caeb16eb9fc2de24a0909314fd2cf.png

Calculate SNR#

# The networks are the combinations of detectors that will be used for the analysis
# The detection_SNR is the minimum SNR for a detection:
#   --> The first entry specifies the minimum SNR for a detection in a single detector
#   --> The second entry specifies the minimum network SNR for a detection
detectors = ['ET', 'CE1', 'LLO', 'LHO', 'VIR']
network = gw.detection.Network(detector_ids = detectors, detection_SNR = (0., 8.))

snr_1 = gw.utilities.get_snr(parameters, network, 'IMRPhenomPv2_NRTidal', f_ref)
snr_2 = gw.utilities.get_snr(parameters, network, 'TaylorF2', f_ref)

snr_1

	ET	CE1	LLO	LHO	VIR	network
event_0	645.185225	1126.175241	85.506804	101.145441	27.430804	1304.924868

snr_2

	ET	CE1	LLO	LHO	VIR	network
event_0	654.652687	1139.647308	87.783936	103.842227	28.060927	1321.606133

snr_1['network']/snr_2['network']

event_0    0.987378
Name: network, dtype: float64

Horizon Distance vs. Range: Key Differences#

In gravitational wave astronomy, horizon distance and range are two related but distinct concepts used to characterize detector sensitivity:

Horizon Distance#

The horizon distance is the maximum luminosity distance at which a gravitational wave detector can observe a source with a signal-to-noise ratio (SNR) equal to a specified detection threshold (typically SNR = 8).

Key characteristics:

Assumes an optimally oriented and located binary system (face-on, overhead)
Represents the best-case scenario for detection
Provides an upper limit on detector reach

Range#

The range is the distance at which a detector can observe sources with random orientations and sky locations, averaged over all possible angles, at the detection threshold SNR.

Key characteristics:

Accounts for the angular dependence of GW signal strength
Represents a more realistic detection capability
Typically smaller than the horizon distance by a factor of ~2-3

Relation to horizon: $$d_{\rm range} \approx \frac{d_{\rm horizon}}{2.26}$$

for a uniformly distributed population in orientation and sky position.

# example event
masses = np.logspace(-0.1, 4, 50)
hor_params = {
        'mass_ratio':1.,
        'theta_jn':0.,
        'psi': 0.,
        'phase': 0.,
        'geocent_time': 0.
    }

network = gw.detection.Network(['ET'])
redshifts_snr = []

for mass in tqdm(masses):
    hor_params['chirp_mass_source'] = mass
    opt_parameters = gw.horizon.find_optimal_location(hor_params, network, waveform_model='IMRPhenomD')
    distance, redshift_det = gw.horizon.horizon(opt_parameters, network, target_SNR=8, waveform_model='IMRPhenomD')
    redshifts_snr.append(redshift_det)

100%|██████████| 50/50 [02:17<00:00,  2.75s/it]

# convert from chirp mass and mass ratio to total mass
def chirp_mass_q_to_total_mass(chirp_mass, mass_ratio):
    return chirp_mass * (1 + mass_ratio) ** (6/5) / (mass_ratio ** (3/5))

plt.figure(figsize=(4, 3), dpi=200)

# plot the results: mass vs redshift
plt.loglog(chirp_mass_q_to_total_mass(masses, np.ones_like(masses)), redshifts_snr, color='tab:orange', linewidth=4, alpha=1)
# color the space below the curve
plt.fill_between(chirp_mass_q_to_total_mass(masses, np.ones_like(masses)), 0, redshifts_snr, color='tab:orange', alpha=0.1)

plt.xlabel(r'$M_{\rm tot}$ $[M_{\odot}]$')
plt.ylabel(r'$\rm Redshift$')

plt.grid(True, which="both", linestyle="--", linewidth=0.5, alpha=0.25)
plt.ylim(0.1, 150)
plt.xlim(1.5, 10000)
plt.show()

../_images/b376a56087f9c655c2c6406ae7afc3c88138cc560d56dde0198ae5830b137f36.png

Parameter estimation#

Bayesian gravitational wave data analysis is used to infer the properties of gravitational wave sources and make predictions about their parameters (see below what are in detail the parameters we are talking about). It combines the principles of Bayesian statistics with the analysis of gravitational wave signals detected by ground-based observatories like LIGO and Virgo.

The goal of Bayesian gravitational wave data analysis is to extract this information from the noisy gravitational wave signals detected by the observatories: $$s(t) = h_0(t) + n(t)$$ where $h_0(t)$ is the true (unknown) signal and $n(t)$ is the detector noise, assumed to be Gaussian and stationary.

Mathematically, Bayes’ theorem can be expressed as:

\[p(\vec{\theta}|s) \propto \pi(\vec{\theta})\mathscr{L}(s|\vec{\theta})\]

where $p(\vec{\theta}|s)$ is the posterior distribution, $\mathscr{L}(s|\vec{\theta})$ is the likelihood function, $\pi(\vec{\theta})$ is the prior distribution, and we neglected the evidence or marginal likelihood at the denominator.

To perform Bayesian gravitational wave data analysis, we use various techniques such as Markov Chain Monte Carlo (MCMC) sampling and nested sampling. The mostly used software is bilby, and typically full Baysian parameter estimation is computationally expensive.

Fisher-matrix approximation#

The gravitational-wave likelihood is defined as the probability of noise realization:

\[ \mathscr{L}(d|\vec{\theta}) \propto \exp\left[-\frac{1}{2}\langle s - h(\vec{\theta})| s - h(\vec{\theta}) \rangle \right] \]

The inner product $\langle \cdot|\cdot\rangle$ measures the overlap between two signals given the noise characteristics of the detector:

\[ \langle a, b \rangle \equiv 4\operatorname{Re}\int_{f_{\rm min}}^{f_{\rm max}} \frac{\tilde{a}(f)\tilde{b}^*(f)}{S_n(f)}df \]

We can approximate the likelihood by expanding the template around the true signal:

\[\begin{split} \begin{aligned} \vec{\theta} &\sim \vec{\theta}_0 + \Delta \vec{\theta}\\ h(\vec{\theta}) &\sim h(\vec{\theta}_0) + \frac{\partial h}{\partial \theta_i}\Delta \theta^i, \end{aligned} \end{split}\]

so that the likelihood becomes a multivariarte Gaussian distribution:

\[\begin{split} \begin{aligned} \mathscr{L} \propto &\exp\left[-\frac{1}{2}\langle n|n \rangle + \langle n| \frac{\partial h}{\partial \theta_i}\rangle \Delta \theta^i -\frac{1}{2}\Delta \theta^i \langle\frac{\partial h}{\partial \theta_i}| \frac{\partial h}{\partial \theta_j} \rangle \Delta \theta^j \right]\\ =&\exp\left[-\frac{1}{2}\langle n|n \rangle\right] \exp\left[ -\frac{1}{2}\left(\Delta \theta^i F_{ij} \Delta \theta^j -2 \langle n| \frac{\partial h}{\partial \theta_i}\rangle \Delta \theta^i \right) \right]\\ =&\exp\left[-\frac{1}{2}\langle n|n \rangle\right]\\ & \times \exp\left[-\frac{1}{2}\left( \Delta \theta^i - (F^{-1})_{ik} \langle n | \frac{\partial h}{\partial \theta_k}\rangle\right) F_{ij} \left( \Delta \theta^j - (F^{-1})_{jl} \langle n | \frac{\partial h}{\partial \theta_l}\rangle\right) \right]\\ &\times \exp\left[\frac{1}{2}\langle n | \frac{\partial h}{\partial \theta_i}\rangle (F^{-1})_{ij} \langle n | \frac{\partial h}{\partial \theta_j}\rangle\right] \end{aligned} \end{split}\]

The truncation in the expansion is done at first-order in partial derivatives, known as linearized signal approximation (LSA)
LSA approximation is equivalent to the leading term in posterior expansion as a series in 1/SNR (this is the reason why Fisher matrix is said to work in high-SNR limit) [see Vallisneri 2008]

The dominant term is the one with the Fisher likelihood:

\[ \mathscr{L} \propto \exp\left[-\frac{1}{2} \left(\vec{\theta} - \vec{\theta}_{\rm inj}\right)^{\rm T} F \left(\vec{\theta} - \vec{\theta}_{\rm inj}\right) \right] \]

The inverse of the Fisher matrix gives us the covariance matrix among parameters:

\[ \sigma_i = \sqrt{\Sigma_{ii}} \ \leftrightarrow \Sigma_{ij} = \left[F^{-1}\right]_{ij}\]

This is the basic math behind Fisher matrix codes, like GWFish.

Calculate $1\sigma$ Errors#

For a more realistic analysis we can include the duty cycle of the detectors using use_duty_cycle = True

# The fisher parameters are the parameters that will be used to calculate the Fisher matrix
# and on which we will calculate the errors
fisher_parameters = ['chirp_mass', 'mass_ratio', 'luminosity_distance', 'theta_jn', 'dec','ra',
                     'psi', 'phase', 'geocent_time', 'a_1', 'a_2', 'lambda_1', 'lambda_2']

detected, network_snr, parameter_errors, sky_localization = gw.fishermatrix.compute_network_errors(
        network = gw.detection.Network(detector_ids = ['ET'], detection_SNR = (0., 8.)),
        parameter_values = parameters,
        fisher_parameters=fisher_parameters, 
        waveform_model = waveform_model,
        f_ref = 20.,
        )   
        # use_duty_cycle = False, # default is False anyway
        # save_matrices = False, # default is False anyway, put True if you want Fisher and covariance matrices in the output
        # save_matrices_path = None, # default is None anyway,
                                     # otherwise specify the folder
                                     # where to save the Fisher and
                                     # corresponding covariance matrices
    

100%|██████████| 1/1 [00:01<00:00,  1.21s/it]

print('The network SNR of the event is ', network_snr)

The network SNR of the event is  [645.00540645]

print('The sky localization of the event is ', sky_localization)

The sky localization of the event is  [4.75415021e-05]

# Choose percentile factor of sky localization and pass from rad2 to deg2
percentile = 90.
sky_localization_90cl = sky_localization * gw.fishermatrix.sky_localization_percentile_factor(percentile)
sky_localization_90cl # in deg2!!

array([0.71872681])

# One can create a dictionary with the parameter errors, the order is the same as the one given in fisher_parameters
parameter_errors_dict = {}
for i, parameter in enumerate(fisher_parameters):
    parameter_errors_dict['err_' + parameter] = np.squeeze(parameter_errors)[i]

print('The parameter errors of the event are ')
parameter_errors_dict

The parameter errors of the event are

{'err_chirp_mass': 7.72772062478848e-07,
 'err_mass_ratio': 0.012457691886787943,
 'err_luminosity_distance': 2.251713928473747,
 'err_theta_jn': 0.07997827997775477,
 'err_dec': 0.0036304744266439223,
 'err_ra': 0.004552362939462272,
 'err_psi': 0.12334740159080766,
 'err_phase': 0.24840265150763793,
 'err_geocent_time': 9.92535556871366e-05,
 'err_a_1': 0.1748839167388211,
 'err_a_2': 0.21749201995398468,
 'err_lambda_1': 2555.01137110076,
 'err_lambda_2': 4606.837234927146}

Save results to file#

There is another function analyze_and_save_to_txt that allows to save the results to a file. The difference with respect to the compute_network_errors function is that one can pass different network combinations and get results files for each of them. This means that if your detectors list is something like ['LHO', 'LLO', 'VIR', 'CE1', 'ET'] and you want to create 3 different networks out of it, i.e. ['LHO', 'LLO', 'VIR'], ['CE1', 'ET'] and ['ET'] alone, then one should inizialize the analyze_and_save_to_txt function as follows:

network = gw.detection.Network(detector_ids = ['LHO', 'LLO', 'VIR', 'CE1', 'ET'], detection_SNR = (0., 8.))

and then specify the different network combinations:

sub_network_ids_list = [[0, 1, 2], [3, 4], [4]]

# create forlder where store results
!mkdir tutorial_results

data_folder = 'tutorial_results' # the one we just created
network = gw.detection.Network(detector_ids = ['ET'], detection_SNR = (0., 8.))
gw.fishermatrix.analyze_and_save_to_txt(network = network,
                                        parameter_values  = parameters,
                                        fisher_parameters = fisher_parameters, 
                                        sub_network_ids_list = [[0]],
                                        population_name = 'BNS',
                                        waveform_model = waveform_model,
                                        f_ref = 20.,
                                        save_path = data_folder,
                                        save_matrices = True)

100%|██████████| 1/1 [00:01<00:00,  1.22s/it]

fisher_matrix = np.load(data_folder + '/' + 'fisher_matrices_ET_BNS_SNR8.npy')

errors = pd.read_csv(data_folder + '/' + 'Errors_ET_BNS_SNR8.txt', delimiter = ' ')

# One can access all the column names of the errors output file:
errors.keys()

Index(['network_SNR', 'chirp_mass', 'mass_ratio', 'luminosity_distance',
       'theta_jn', 'ra', 'dec', 'psi', 'phase', 'geocent_time', 'a_1', 'a_2',
       'lambda_1', 'lambda_2', 'err_chirp_mass', 'err_mass_ratio',
       'err_luminosity_distance', 'err_theta_jn', 'err_dec', 'err_ra',
       'err_psi', 'err_phase', 'err_geocent_time', 'err_a_1', 'err_a_2',
       'err_lambda_1', 'err_lambda_2', 'err_sky_location'],
      dtype='object')

Same errors as before just save to a .txt file:

errors

	network_SNR	chirp_mass	mass_ratio	luminosity_distance	theta_jn	ra	dec	psi	phase	geocent_time	...	err_dec	err_ra	err_psi	err_phase	err_geocent_time	err_a_1	err_a_2	err_lambda_1	err_lambda_2	err_sky_location
0	645.005406	1.198	0.8309	43.75	2.545	3.446	-0.4081	0.0	0.0	1.187000e+09	...	0.00363	0.004552	0.1233	0.2484	0.000099	0.1749	0.2175	2555.0	4607.0	0.000048

1 rows × 28 columns

A quick test#

One would expect that the Fisher matrix entry corresponding to dL-dL should be approximated by the ratio between the SNR and the luminosity distance squared as follows:

\[ \frac{1}{SNR} = \frac{\Delta d_L}{d_L} \]

where $\Delta d_L = \sqrt{\left[F\right]^{-1}_{d_L,d_L}}$, with $F$ the Fisher matrix.

This can be derived from the fact that $\partial_{d_L}h = -\frac{1}{d_L}h$. Indeed $$ h(f) = \frac{1}{d_L} \tilde{h}_0(f) $$

where $\tilde{h}_0(f)$ is the intrinsic waveform amplitude independent of distance. Taking the derivative with respect to luminosity distance:

\[ \partial_{d_L}h = -\frac{1}{d_L^2} \tilde{h}_0(f) = -\frac{h}{d_L} \]

Therefore:

\[\frac{1}{\left(\Delta d_L\right)^2} \sim F_{d_L,d_L}=\langle \partial_{d_L}h|\partial_{d_L}h\rangle \sim \frac{SNR^2}{d_L^2}\]

where the inverse of the error on distance is the corresponding entry of the Fisher matrix $F_{d_L,d_L}$ (assuming correlations are negligible).

A rough approximation in literature takes: $\frac{\Delta d_L}{d_L} \sim \frac{2}{SNR}$.

my_fisher = fisher_matrix[0, :, :]
print('We expect Delta dL/dL to scale as 1/SNR')
print('fisher matrix dL-dL: ', my_fisher[2, 2])
print('(SNR/dL)^2: ', (errors['network_SNR'].iloc[0] / errors['luminosity_distance'].iloc[0])**2)

We expect Delta dL/dL to scale as 1/SNR
fisher matrix dL-dL:  217.37978198571082
(SNR/dL)^2:  217.35548047427832

Corner plot#

Using the covariance matrix one show all the correlations between pairs of parameters in a corner plot. Using as inputs the injected values and the covariance matrix, one samples from a multivariate Gaussian distribution and plot the samples.

CORNER_KWARGS = dict(
    bins = 50, # number of bins for histograms
    smooth = 0.99, # smooths out contours. 
    plot_datapoints = True, # choose if you want datapoints
    label_kwargs = dict(fontsize = 12), # font size for labels
    show_titles = True, #choose if you want titles on top of densities.
    title_kwargs = dict(fontsize = 12), # font size for title
    plot_density = False,
    title_quantiles = [0.16, 0.5, 0.84],  # add quantiles to plot densities for 1d hist
    levels = (1 - np.exp(-0.5), 1 - np.exp(-2), 1 - np.exp(-9 / 2.)), # 1, 2 and 3 sigma contours for 2d plots
    fill_contours = True, #decide if you want to fill the contours
    max_n_ticks = 2, # set a limit to ticks in the x-y axes.
    title_fmt=".3f"
    )

corner_lbs = [r'${\mathcal{M}}_{\rm chirp}$ $[M_{\odot}]$', r'$q$', r'$D_{\rm L}$ [Mpc]',
                r'$\iota$ [rad]', r'$DEC$ [rad]', r'$RA$ [rad]', r'$\Psi$ [rad]',
                r'$phase$', r'$t_c$ [s]', r'$a_1$', r'$a_2$', r'$\Lambda_1$', r'$\Lambda_2$']

mean_lbs = ['chirp_mass', 'mass_ratio', 'luminosity_distance', 'theta_jn', 'dec', 'ra', 'psi',
            'phase', 'geocent_time', 'a_1', 'a_2', 'lambda_1', 'lambda_2']
mean_values = parameters[mean_lbs].iloc[0] # mean values of the parameters
cov_matrix = np.load(data_folder + '/' + 'inv_fisher_matrices_ET_BNS_SNR8.npy')[0, :, :]

# Sample from a multi-variate gaussian with the given covariance matrix and injected mean values
samples = np.random.multivariate_normal(mean_values, cov_matrix, int(1e6))
fig = corner.corner(samples, labels = corner_lbs, truths = mean_values, truth_color = 'red',
                    **CORNER_KWARGS)
plt.show()

/var/folders/5q/zf85wh2d0sxgdd55ws_1kk94pzxs0q/T/ipykernel_2579/2349962579.py:2: RuntimeWarning: covariance is not symmetric positive-semidefinite.
  samples = np.random.multivariate_normal(mean_values, cov_matrix, int(1e6))

../_images/e4f88fb8dc65eacbd748f499e9c575f7fb037b17adb71a77884812e79fde37ae.png

Validity of the Fisher matrix approach#

Check appendix section of https://arxiv.org/pdf/2205.02499

Population Analysis#

The functions are the same explained in the previous tutorial, but applied to a population sample instead of a single event

# Example for BNS population
nev = 50 # number of events
my_dict = {}
my_dict['redshift'] = np.random.uniform(0.01, 0.1, nev)
my_dict['mass_1'] = np.random.uniform(1.1, 2.5, nev)
my_dict['mass_2'] = np.random.uniform(1.1, 2.5, nev)
my_dict['luminosity_distance'] = Planck18.luminosity_distance(my_dict['redshift']).value
my_dict['theta_jn'] = np.arccos(np.random.uniform(-1., 1., nev))
my_dict['ra'] = np.random.uniform(0., 2 * np.pi, nev)
my_dict['dec'] = np.arcsin(np.random.uniform(-1., 1., nev))
my_dict['psi'] = np.random.uniform(0., np.pi, nev)
my_dict['phase'] = np.random.uniform(0., 2 * np.pi, nev)
my_dict['geocent_time'] = np.random.uniform(1577491218, 1609027217, nev)
my_dict['a_1'] = np.random.uniform(0., 0.1, nev)
my_dict['a_2'] = np.random.uniform(0., 0.1, nev)


aux_mass = my_dict['mass_1'] # sort the two masses so that m1>m2
my_dict['mass_1'] = np.maximum(aux_mass, my_dict['mass_2'])
my_dict['mass_2'] = np.minimum(aux_mass, my_dict['mass_2'])
my_pop = pd.DataFrame(my_dict)
my_pop.head(5)

	redshift	mass_1	mass_2	luminosity_distance	theta_jn	ra	dec	psi	phase	geocent_time	a_1	a_2
0	0.088836	1.720496	1.327960	419.563663	1.570763	4.005041	-0.104657	1.998333	1.472574	1.588648e+09	0.095327	0.096206
1	0.091909	2.330022	2.277427	434.970362	2.685763	2.314377	0.098768	0.235053	6.013609	1.607205e+09	0.060711	0.056208
2	0.076305	2.323697	1.440571	357.327209	2.480739	5.318288	0.473725	1.173636	5.593847	1.577508e+09	0.023457	0.050211
3	0.060699	1.776289	1.299748	281.189077	1.848795	3.653918	0.072894	0.201880	0.299531	1.585179e+09	0.040577	0.003360
4	0.090505	2.150098	1.697875	427.924442	1.453009	2.711202	0.842196	2.211414	2.975095	1.585081e+09	0.027579	0.048794

detectors_pop = ['ET']
network_pop = gw.detection.Network(detector_ids = detectors_pop, detection_SNR = (0., 8.))
waveform_model_pop = 'IMRPhenomHM'
fisher_parameters_pop = ['mass_1', 'mass_2', 'luminosity_distance', 'theta_jn', 'dec','ra',
                     'psi', 'phase', 'geocent_time', 'a_1', 'a_2']

!mkdir pop_gwfish_results

mkdir: pop_gwfish_results: File exists

data_folder_pop = 'pop_gwfish_results'
gw.fishermatrix.analyze_and_save_to_txt(network = network_pop,
                                        parameter_values  = my_pop,
                                        fisher_parameters = fisher_parameters_pop, 
                                        sub_network_ids_list = [[0]],
                                        population_name = 'BNS_POP',
                                        waveform_model = waveform_model_pop, # if no f_ref is passed, the code will use the default value of 50 Hz
                                        save_path = data_folder_pop,
                                        save_matrices = False)

100%|██████████| 50/50 [00:33<00:00,  1.47it/s]

pop_errors = pd.read_csv(data_folder_pop + '/' + 'Errors_ET_BNS_POP_SNR8.txt', delimiter = ' ')
pop_errors.head(5)

	network_SNR	redshift	mass_1	mass_2	luminosity_distance	theta_jn	ra	dec	psi	phase	...	err_luminosity_distance	err_theta_jn	err_dec	err_ra	err_psi	err_phase	err_geocent_time	err_a_1	err_a_2	err_sky_location
0	35.765094	0.08884	1.720	1.328	419.6	1.571	4.005	-0.10470	1.9980	1.4730	...	106.00	0.017220	0.32310	0.11850	0.07294	0.07440	0.005964	0.3202	0.4398	0.082180
1	96.657502	0.09191	2.330	2.277	435.0	2.686	2.314	0.09877	0.2351	6.0140	...	226.70	1.091000	0.01853	0.02774	4.75800	9.54300	0.000577	1.0740	1.1040	0.001492
2	78.204855	0.07631	2.324	1.441	357.3	2.481	5.318	0.47370	1.1740	5.5940	...	78.20	0.334100	0.01065	0.01559	1.05500	2.10000	0.000354	0.2188	0.3771	0.000442
3	94.342000	0.06070	1.776	1.300	281.2	1.849	3.654	0.07289	0.2019	0.2995	...	21.48	0.011450	0.11120	0.13060	0.05223	0.04876	0.001242	0.2956	0.4244	0.040370
4	81.174093	0.09050	2.150	1.698	427.9	1.453	2.711	0.84220	2.2110	2.9750	...	23.00	0.006802	0.25050	0.31100	0.21810	0.04901	0.001057	0.3569	0.4655	0.161000

5 rows × 25 columns

plt.hist(pop_errors['network_SNR'], bins = 10, color = 'purple', alpha = 0.5, linewidth = 2)
plt.xlabel('Network SNR')
plt.ylabel('Number of events')
plt.grid(linestyle='dotted', linewidth='0.6', which='both')
plt.tight_layout()
plt.show()

../_images/77ea55b61ad0bdc6a114074d0ff984f6f2bd099ea25308265044d8e223e2cfa5.png

percentile_pop = 90
sky_loc_90cl_pop = pop_errors['err_sky_location'] * gw.fishermatrix.sky_localization_percentile_factor(percentile_pop)
sc = plt.scatter(pop_errors['err_luminosity_distance'] / pop_errors['luminosity_distance'],
                 sky_loc_90cl_pop, c = pop_errors['network_SNR'], cmap = 'inferno')
plt.xscale('log')
plt.yscale('log')
plt.xlabel('$\Delta d_L/d_L$')
plt.ylabel('$\Delta \Omega_{%s}$' %int(percentile))
plt.colorbar(sc, label = 'SNR')
plt.grid(which = 'both', color = 'lightgray', alpha = 0.5, linewidth = 0.5)
plt.show()

../_images/2f096e883c5f5bf80e471cbd7761628be292d128f152d2928f044b3c74264f48.png

GWFish meets Priors#

In Bayesian analysis, we start with prior beliefs about the parameters of the model, which represent our initial knowledge or assumptions. These priors can be informed by astrophysical models, previous observations, or theoretical predictions. As we observe a gravitational wave signal, we update these beliefs using the likelihood function, which quantifies the probability of observing the data given the parameters. The result is the posterior distribution, which represents our updated knowledge about the parameters after considering the data.

One of the advantages of the Bayesian approach is its ability to incorporate prior information and quantify uncertainties in a rigorous way. By combining prior knowledge with the observed data, we can make more informed inferences about the properties of the gravitational wave source.

\[p(\vec{\theta}|s) \propto \pi(\vec{\theta}){\mathcal{L}}(s|\vec{\theta})\]

The Fisher matrix analysis works in the assumption that when the signal to noise ratio (SNR) is sufficiently large, the likelihood is a highly peaked function and no priors are needed. However, this is rarely the case, as correlations may spread the likelihood and sampling from the multivariate Gaussian may leak outside the physical range of parameters.

In this tutorial, we show how to integrate prior information, $\pi(\vec{\theta})$, into the Fisher likelihood while maintaining computational efficiency in the sampling process. Additionally, we offer insights into the comparative behavior of different prior choices.

The procedure is addressed in a second GWFish paper available on arXiv. The usage of the code as for the paper’s purposes can be found in this repo.

Introducing the new modules#

This part is based on the new module priors.py and the sampling procedure relies on the minimax_tilting_sampler.py module (see below). It consists in building on top of the standard GWFishoutput and incorporate prior information in post-processing.

import warnings
warnings.filterwarnings("ignore", "Wswiglal-redir-stdio")
import GWFish.modules as gw
from GWFish.modules.priors import *

import numpy as np
import pandas as pd

import matplotlib
import matplotlib.pyplot as plt

# Event's parameters should be passed as Pandas dataframe
parameters = {
    'chirp_mass': np.array([20]), 
    'mass_ratio': np.array([0.75]), 
    'luminosity_distance': np.array([500.]),
    'theta_jn': np.array([0.06*np.pi]),
    'ra': np.array([np.pi/4]),
    'dec': np.array([np.pi/4]),
    'psi': np.array([np.pi/4]),
    'phase': np.array([np.pi/4]),
    'geocent_time': np.array([1187008882]),
    'a_1':np.array([0.2]), 
    'a_2':np.array([0.2]),
    'tilt_1':np.array([np.pi/4]),
    'tilt_2':np.array([np.pi/4]),
    'phi_12':np.array([np.pi/4]),
    'phi_jl':np.array([np.pi/4])}
parameters = pd.DataFrame(parameters)
parameters

	chirp_mass	mass_ratio	luminosity_distance	theta_jn	ra	dec	psi	phase	geocent_time	a_1	a_2	tilt_1	tilt_2	phi_12	phi_jl
0	20	0.75	500.0	0.188496	0.785398	0.785398	0.785398	0.785398	1187008882	0.2	0.2	0.785398	0.785398	0.785398	0.785398

# The fisher parameters are the parameters that will be used to calculate the Fisher matrix
# and on which we will calculate the errors
fisher_parameters = list(parameters.keys())

detected, network_snr, parameter_errors, sky_localization = gw.fishermatrix.compute_network_errors(
        network = gw.detection.Network(detector_ids = ['LLO'], detection_SNR = (0., 8.)),
        parameter_values = parameters,
        fisher_parameters=fisher_parameters, 
        waveform_model = 'IMRPhenomXPHM',
        save_matrices = True # you need to save the covariance matrix for the next steps
        )

100%|██████████| 1/1 [00:04<00:00,  4.76s/it]

Post-processing with priors#

np.random.seed(42) # fix for reproducibility
rng = np.random.default_rng()
# Decide the number of samples to draw
num_samples = int(1e6)

param_lbs = {'chirp_mass': r'$\mathcal{M}_c$ $[M_{\odot}]$', 'mass_ratio': r'$q$', 'luminosity_distance': r'$d_L$ [Mpc]',
                'dec': r'$\texttt{DEC}$ [rad]', 'ra': r'$\texttt{RA}$ [rad]', 'theta_jn': r'$\theta_{JN}$ [rad]', 'psi': r'$\Psi$ [rad]',
                'phase': r'$\phi$ [rad]', 'geocent_time': r'$t_c$ [rad][s]', 'a_1': r'$a_1$', 'a_2': r'$a_2$',
                'tilt_1': r'$\theta_1$ [rad]', 'tilt_2': r'$\theta_2$ [rad]', 'phi_12': r'$\phi_{12}$ [rad]',
                'phi_jl': r'$\phi_{JL}$ [rad]'}
 
mean_values = np.array(parameters[fisher_parameters ].iloc[0]) # mean values of the parameters
cov_matrix = np.load('inv_fisher_matrices.npy')[0, :, :]
fisher_samples = pd.DataFrame(rng.multivariate_normal(mean_values, cov_matrix, int(1e6)), columns = fisher_parameters)

/var/folders/5q/zf85wh2d0sxgdd55ws_1kk94pzxs0q/T/ipykernel_2579/341241274.py:14: RuntimeWarning: covariance is not symmetric positive-semidefinite.
  fisher_samples = pd.DataFrame(rng.multivariate_normal(mean_values, cov_matrix, int(1e6)), columns = fisher_parameters)

# Calculate and print the eigenvalues of the covariance matrix
eigenvalues = np.linalg.eigvals(cov_matrix)
print("Eigenvalues of the covariance matrix:")
print(eigenvalues)

Eigenvalues of the covariance matrix:
[ 3.58380157e+04  1.96983955e+01  5.18814843e+00  1.84197175e+00
  7.47926969e-01  2.24173241e-01  1.84966754e-02  4.04390188e-03
  1.52665604e-03  1.15612301e-04  3.21511017e-06  9.29855798e-07
  3.70096283e-08  2.72211042e-20 -4.72698808e-16]

cov_matrix = cov_matrix + 1e-3*np.eye(len(parameters))

# In priors.py the function get_truncated_likelihood_samples takes the following arguments:
# - fisher_parameters: list of parameters
# - mean_values: mean values of the parameters
# - cov_matrix: covariance matrix
# - num_samples: number of samples to draw
# - min_array: minimum values of the parameters
# - max_array: maximum values of the parameters
# The function returns the samples from the truncated likelihood
# You can get this information using the command shown in the cell above

# If min_array and max_array are not provided, the function will use the default range for each parameter
# specified in the default prior dictionary (these ranges are used to truncate the likelihood)

samples_from_truncated_lkh = get_truncated_likelihood_samples(fisher_parameters, mean_values, cov_matrix, num_samples)#, min_array, max_array)

/Users/sjs8171/opt/anaconda3/envs/gwfish_env/lib/python3.9/site-packages/GWFish/modules/minimax_tilting_sampler.py:359: RuntimeWarning: invalid value encountered in multiply
  pr = np.ones_like(z) * np.inf  # compute marginal prob.

Warning: Method may fail as covariance matrix is close to singular!

# Plot the posterior distribution of the parameter of choice
sel_param = 'luminosity_distance'

injections = np.array(mean_values)
fig = plt.figure(figsize=(6, 4))
plt.hist(fisher_samples[sel_param], bins=50, histtype='step', density=False,
            label='Fisher', alpha = 1., color = 'black')
plt.hist(samples_from_truncated_lkh[sel_param], bins=50, histtype='step', density=False,
            label='Fisher + Truncated', alpha = 1., color = 'blue')
#plt.hist(samples_from_posterior[sel_param], bins=50, histtype='step', density=True,
#            label='Fisher + Truncated + Priors', alpha = 1., color = 'red')
plt.axvline(injections[fisher_parameters.index(sel_param)], color='black', linestyle='--', label='True Value')
plt.xlabel(param_lbs[sel_param])
plt.ylabel('PDF')
fig.legend(loc='center', bbox_to_anchor=(0.5, 0.95), ncol=2, frameon=False)
plt.show()

../_images/304ffd3adc5dddcd3457e8a4b7252841c712db585b7927e17e4b921ccfa65b34.png

# Plot the posterior distribution of the parameter of choice
sel_param = 'theta_jn'

injections = np.array(mean_values)
fig = plt.figure(figsize=(6, 4))
plt.hist(fisher_samples[sel_param], bins=50, histtype='step', density=False,
            label='Fisher', alpha = 1., color = 'black')
plt.hist(samples_from_truncated_lkh[sel_param], bins=50, histtype='step', density=False,
            label='Fisher + Truncated', alpha = 1., color = 'blue')
#plt.hist(samples_from_posterior[sel_param], bins=50, histtype='step', density=True,
#            label='Fisher + Truncated + Priors', alpha = 1., color = 'red')
plt.axvline(injections[fisher_parameters.index(sel_param)], color='black', linestyle='--', label='True Value')
plt.xlabel(param_lbs[sel_param])
plt.ylabel('PDF')
fig.legend(loc='center', bbox_to_anchor=(0.5, 0.95), ncol=2, frameon=False)
plt.show()

../_images/de7a961456a4a2899af97c2219095232850ae01538177a2ecf0a35a18b719f93.png

Priors dictionary example#

priors_dict = {}
priors_dict['chirp_mass'] = {
    'prior_type': 'uniform_in_component_masses_chirp_mass',
    'lower_prior_bound': parameters['chirp_mass'].iloc[0] - 5.,
    'upper_prior_bound': parameters['chirp_mass'].iloc[0] + 5.
}
priors_dict['mass_ratio'] = {
    'prior_type': 'uniform_in_component_masses_mass_ratio',
    'lower_prior_bound': 0.05,
    'upper_prior_bound': 0.99
}
priors_dict['luminosity_distance'] = {
    'prior_type': 'uniform_in_distance_squared',
    'lower_prior_bound': 0,
    'upper_prior_bound': 2300
}
priors_dict['theta_jn'] = {
    'prior_type': 'uniform_in_cosine',
    'lower_prior_bound': 0,
    'upper_prior_bound': np.pi/12
}
priors_dict['ra'] = {
    'prior_type': 'uniform',
    'lower_prior_bound': 0.,
    'upper_prior_bound': 2*np.pi
}
priors_dict['dec'] = {
    'prior_type': 'uniform_in_cosine',
    'lower_prior_bound': -np.pi/2,
    'upper_prior_bound': np.pi/2
}
priors_dict['psi'] = {
    'prior_type': 'uniform',
    'lower_prior_bound': 0.,
    'upper_prior_bound': np.pi
}
priors_dict['phase'] = {
    'prior_type': 'uniform',
    'lower_prior_bound': 0.,
    'upper_prior_bound': 2*np.pi
}
priors_dict['geocent_time'] = {
    'prior_type': 'uniform',
    'lower_prior_bound': parameters['geocent_time'].iloc[0] - 3,
    'upper_prior_bound': parameters['geocent_time'].iloc[0] + 3
}
priors_dict['a_1'] = {
    'prior_type': 'uniform',
    'lower_prior_bound': 0.,
    'upper_prior_bound': 1.
}
priors_dict['a_2'] = {
    'prior_type': 'uniform',
    'lower_prior_bound': 0.,
    'upper_prior_bound': 1.
}
priors_dict['tilt_1'] = {
    'prior_type': 'uniform_in_sine',
    'lower_prior_bound': 0.,
    'upper_prior_bound': np.pi
}
priors_dict['tilt_2'] = {
    'prior_type': 'uniform_in_sine',
    'lower_prior_bound': 0.,
    'upper_prior_bound': np.pi
}
priors_dict['phi_12'] = {
    'prior_type': 'uniform',
    'lower_prior_bound': 0.,
    'upper_prior_bound': 2*np.pi
}
priors_dict['phi_jl'] = {
    'prior_type': 'uniform',
    'lower_prior_bound': 0.,
    'upper_prior_bound': 2*np.pi
}

# To avoid having a large number of samples outside the physical range, we apply the prior to the samples from
# the truncated likelihood. The function get_posteriors_samples takes the following arguments:
# - fisher_parameters: list of parameters
# - samples_from_truncated_lkh: samples from the truncated likelihood
# - num_samples: number of samples to draw
# - priors_dict: dictionary with the prior ranges for each parameter (if not provided, the default prior dictionary is used)

samples_from_posterior = get_posteriors_samples(fisher_parameters, samples_from_truncated_lkh, num_samples, priors_dict=priors_dict)#, priors_dict)

# Plot the posterior distribution of the parameter of choice
sel_param = 'luminosity_distance'

injections = np.array(mean_values)
fig = plt.figure(figsize=(12, 4))
#plt.hist(fisher_samples[sel_param], bins=50, histtype='step', density=True,
#            label='Fisher', alpha = 1., color = 'black')
plt.hist(samples_from_truncated_lkh[sel_param], bins=50, histtype='step', density=True,
            label='Fisher + Truncated', alpha = 1., color = 'blue')
plt.hist(samples_from_posterior[sel_param], bins=50, histtype='step', density=True,
            label='Fisher + Truncated + Priors', alpha = 1., color = 'red')
plt.axvline(injections[fisher_parameters.index(sel_param)], color='black', linestyle='--', label='True Value')
plt.xlabel(param_lbs[sel_param])
plt.ylabel('PDF')
fig.legend(loc='center', bbox_to_anchor=(0.5, 0.95), ncol=2, frameon=False)
plt.show()

../_images/454adb5ed59c94c3ac819f298efdef1d4e4127aceace21eb4ef15a5cbe2f76f1.png

# Plot the posterior distribution of the parameter of choice using corner
plt.figure(dpi=180)

sel_params = ['luminosity_distance', 'theta_jn']  # Parameters to plot
# Extract the indices of the selected parameters
sel_indices = [fisher_parameters.index(param) for param in sel_params]
mean_lbs = ['chirp_mass', 'mass_ratio', 'luminosity_distance', 'theta_jn', 'dec', 'ra', 'psi',
            'phase', 'geocent_time', 'a_1', 'a_2']
mean_values = parameters[mean_lbs].iloc[0] # mean values of the parameters

injections = np.array(mean_values)
# Extract the corresponding samples
samples_fisher_truncated = np.column_stack([samples_from_truncated_lkh[param] for param in sel_params])
samples_fisher_truncated_priors = np.column_stack([samples_from_posterior[param] for param in sel_params])

# Extract the true values for the selected parameters
true_values = injections[sel_indices]

# Create the corner plot
fig = corner.corner(samples_fisher_truncated, labels=[param_lbs[param] for param in sel_params],
                    truths=true_values, truth_color='black', color='blue', alpha=0.5,
                    label_kwargs={'fontsize': 14}, show_titles=True, title_kwargs={"fontsize": 12}, dpi = 180)

corner.corner(samples_fisher_truncated_priors, labels=[param_lbs[param] for param in sel_params],
              truths=true_values, truth_color='black', color='red', alpha=0.5, fig=fig)
# Add legend
handles = [plt.Line2D([], [], color='blue', label='Fisher Truncated'),
           plt.Line2D([], [], color='red', label='Posterior with Priors')]
fig.legend(handles=handles, loc='upper right', fontsize=12)

plt.show()

<Figure size 1152x864 with 0 Axes>

../_images/48b99ab122e8d2be961256a60300e9e80ff43c739fac6586d613466f810c0072.png

Parameter estimation for pre-merger#

#parameter estimation fot pre-merger

modify this line janosch314/GWFish

pre_merger_time = 60. polarizations[ np.where(self.t_of_f > (self.gw_params[‘geocent_time’] - pre_merger_time)), :] = 0.j

This approach has been used in https://www.aanda.org/articles/aa/pdf/2023/10/aa45850-23.pdf

from IPython.display import Image, display

# Add an image to the notebook

# You can display an image from a file path
display(Image(filename='Screenshot 2025-10-29 at 12.27.24 PM.png'))

# Or from a URL
# display(Image(url='https://example.com/image.png'))

../_images/1602ec1aaeb3b47e27ce9685ac1823d99c133cb8a34f3c60cbd9cdc9748c2bb5.png

Exercoses for the tutorial: try yourself

Check how parameter estimation improves if you fix any of the GW parameters. Fix everything apart distance, inclination and sky position
Compare the sky localization for a 1.4 + 1.4 M_sun BNS @ 200 Mpc, face on, with LVKI and ET
repeat the exercise of adding priors assuming we have a precise measurment of the redshift (e.g. 1% in accuracy)

GWFish Tutorial

Contents

GWFish Tutorial#

Settings for working locally with GWFish#

Install GWFish in a conda environment#

What is the code about#

Starter: GW data analysis#

SNR Calculation#

Matched Filtering#

Defining the SNR#

Optimal Filter and Maximum SNR#

Detection Criterion#

GW parameters#

Note on mass conventions#

Why Fisher analysis?#

Let’s start!#

Import packages#

Single Event Analysis: GW170817-like#

⚠️ Note:#

Play with the signal#

A quick note on detectors setup#

Plot the signal#

Gravitational Wave Waveform Approximants Overview#

Basic Difference: IMRPhenom vs SEOBNR#

Time Domain Waveforms#

Frequency Domain Waveforms#

Binary Neutron Star (BNS) Specific#

Higher Modes & Precession#

NSBH (Neutron Star–Black Hole) Systems#

Time as a function of frequency#

Calculate SNR#

Horizon Distance vs. Range: Key Differences#

Horizon Distance#

Range#

Parameter estimation#

Fisher-matrix approximation#

Calculate \(1\sigma\) Errors#

Save results to file#

A quick test#

Corner plot#

Validity of the Fisher matrix approach#

Population Analysis#

GWFish meets Priors#

Introducing the new modules#

Post-processing with priors#

Priors dictionary example#

Parameter estimation for pre-merger#

Install GWFish in a `conda` environment#