Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash 97 tok/s
Gemini 2.5 Pro 58 tok/s Pro
GPT-5 Medium 25 tok/s
GPT-5 High 31 tok/s Pro
GPT-4o 112 tok/s
GPT OSS 120B 460 tok/s Pro
Kimi K2 211 tok/s Pro
2000 character limit reached

Open Data from LIGO, Virgo, and KAGRA through the First Part of the Fourth Observing Run (2508.18079v1)

Published 25 Aug 2025 in gr-qc and astro-ph.HE

Abstract: LIGO, Virgo, and KAGRA form a network of gravitational-wave observatories. Data and analysis results from this network are made publicly available through the Gravitational Wave Open Science Center. This paper describes open data from this network, including the addition of data from the first part of the fourth observing run (O4a) and selected periods from the preceding engineering run, collected from May 2023 to January 2024. The public data set includes calibrated strain time series for each instrument, data from additional channels used for noise subtraction and detector characterization, and analysis data products from version 4.0 of the Gravitational-Wave Transient Catalog.

List To Do Tasks Checklist Streamline Icon: https://streamlinehq.com

Collections

Sign up for free to add this paper to one or more collections.

Summary

  • The paper presents the first public O4a release of gravitational-wave data from LIGO, Virgo, and KAGRA, detailing calibration methods and auxiliary channel information.
  • It employs near realtime calibration uncertainty estimates using continuous photon-calibrator injections to enhance noise mitigation and event validation.
  • Multiple strain data formats, detailed data quality flags, and extensive auxiliary channels support robust signal analysis and reproducible research.

Open Data from LIGO, Virgo, and KAGRA through the First Part of the Fourth Observing Run

Introduction and Scope

This paper presents the public release of gravitational-wave (GW) data from the LIGO, Virgo, and KAGRA observatories, focusing on the first segment of the fourth observing run (O4a), spanning May 2023 to January 2024. The release is facilitated through the Gravitational Wave Open Science Center (GWOSC), providing calibrated strain time series, auxiliary instrumental channels, and analysis products from GWTC-4.0. The dataset is structured to maximize accessibility and scientific utility, supporting a broad range of GW analyses, including compact binary coalescence (CBC), burst, continuous wave (CW), and stochastic background searches.

Observing Runs and Data Products

The GW network operates in discrete observing runs, with O4a representing the latest public release. During O4a, only LIGO Hanford (LHO) and Livingston (LLO) provided data suitable for analysis, as Virgo was offline for commissioning and KAGRA/GEO sensitivity was suboptimal. The primary data product is the calibrated strain h(t)h(t), sampled at 16,384 Hz, with a typical annual data rate of \sim4 TB per instrument. The strain data is accompanied by hundreds of thousands of auxiliary channels monitoring environmental and instrumental states.

Instrument sensitivity is characterized by the binary neutron star (BNS) inspiral range, which fluctuates due to noise and operational interruptions. The BNS range during O4a frequently approached 160 Mpc, with periods of reduced sensitivity due to elevated noise or downtime. Figure 1

Figure 1: BNS inspiral range over time for O4a, illustrating sensitivity fluctuations and operational intervals.

Calibration Methodology and Uncertainties

Calibration reconstructs the differential arm motion ΔL(t)\Delta L(t) using the interferometer response function RR and the error signal derrd_{\rm err}, yielding the strain h(t)=ΔL(t)/Lh(t) = \Delta L(t)/L. For O4a, calibration uncertainties are quantified hourly, incorporating continuous photon-calibrator sinusoidal injections at discrete frequencies. The systematic error and 1σ1\sigma uncertainty envelopes are provided for each hour, with direct measurements at calibration frequencies. Figure 2

Figure 2

Figure 2: Frequency-dependent calibration error for LIGO Hanford and Livingston during a one-hour period in O4a, showing magnitude and phase uncertainties.

The O4a release marks the first instance where near-realtime calibration uncertainty estimates are provided, enabling their use as the final calibrated strain product. The valid frequency range for analysis is $10$–5000 Hz5000~\mathrm{Hz}, constrained by calibration and anti-aliasing filter roll-off.

Data Quality, Noise Mitigation, and Hardware Injections

Strain data is affected by non-Gaussian, non-stationary noise artifacts, including glitches and spectral lines. Data-quality flags (CAT1, CAT2, CAT3) are used to exclude compromised segments, with CAT1 marking severe issues. For CBC analyses, the iDQ supervised-learning framework provides statistical flags based on auxiliary channel activity, enabling dynamic vetoing or re-ranking of candidate events.

Hardware injections simulate GW signals for detector characterization and safety studies. In O4a, only CW-type injections (simulating spinning neutron stars) were present during observing mode, with minimal impact on transient searches.

Spectral line catalogs are maintained to identify persistent narrowband features, critical for CW and stochastic searches. Glitch subtraction is performed using BayesWave and linear noise subtraction, with 16 O4a candidate events requiring targeted mitigation.

Data Structure, Formats, and Access

Calibrated strain data is released in 4096-second files, available in HDF5 and GWF formats, at both 16 kHz and 4 kHz sampling rates. The file structure encodes metadata, strain arrays, data-quality masks, and injection masks. The bitmask structure for data quality and injections is standardized across runs, with additional bits for STOCH and CW searches in O4a.

Alternate strain releases provide multiple versions of the strain channel, including raw, narrowband-subtracted, broadband-subtracted, and glitch-gated data. These are accessible via OSDF and NDS2 interfaces, with AR (Analysis Ready) tags indicating segments suitable for analysis.

Auxiliary Channels and Instrumental Monitoring

Approximately 200,000 auxiliary channels per instrument are recorded, with a curated subset released for noise subtraction and data-quality flagging. These channels include environmental sensors and instrumental diagnostics. Auxiliary data is available for all ANALYSIS_READY times, with documentation specifying channel names and sampling rates. Additional releases support machine learning studies and noise-subtraction research.

Event Portal and Analysis Products

The GWOSC Event Portal provides a database of published GW transients, including strain data, segment lists, detection confidence, source parameters, and posterior samples. The portal supports HTML browsing and REST API queries, with a Python client for automated access. Figure 3

Figure 3: Event Query Form interface for custom selection of GW events based on multiple attributes.

Parameter-estimation results are provided as credible intervals, with standardized naming conventions. Supplemental data releases include posterior samples, localizations, and versioned snapshots on Zenodo. Community catalogs from external authors are ingested using a standardized JSON schema, expanding the scope of available events.

Implications and Future Directions

The public release of O4a data, with comprehensive calibration, data quality, and auxiliary information, enables reproducible GW science and facilitates cross-disciplinary research. The dataset supports advanced noise modeling, machine learning applications, and multi-messenger astrophysics. The inclusion of real-time calibration uncertainties and multiple strain channel versions enhances the reliability of parameter estimation and event validation.

The release strategy, with planned future datasets from O4b and O4c, will further increase the volume and diversity of GW events available for analysis. The infrastructure and data standards established here set a precedent for open data practices in GW astronomy, supporting both methodological innovation and broad community engagement.

Conclusion

This paper provides a detailed account of the open data products from LIGO, Virgo, and KAGRA for O4a, including calibration procedures, data quality management, file structures, and event cataloging. The release maximizes scientific utility and transparency, supporting robust GW analyses and fostering future developments in GW data science. The forthcoming releases from subsequent observing segments will continue to expand the scope and impact of open GW data.

Ai Generate Text Spark Streamline Icon: https://streamlinehq.com

Paper Prompts

Sign up for free to create and run prompts on this paper using GPT-5.

Don't miss out on important new AI/ML research

See which papers are being discussed right now on X, Reddit, and more:

“Emergent Mind helps me see which AI papers have caught fire online.”

Philip

Philip

Creator, AI Explained on YouTube