search_query=cat:astro-ph.*+AND+lastUpdatedDate:[202606022000+TO+202606082000]&start=0&max_results=5000

New astro-ph.* submissions cross listed on cs.AI, cs.LG, physics.data-an, stat.* staritng 202606022000 and ending 202606082000

Feed last updated: 2026-06-08T08:43:25Z

A Model Selection Criterion for Multidimensional Gaussian Processes: Application to Radial Velocities

Authors: Barragán Oscar
Comments: Accepted for publication in MNRAS letters
Primary Category: astro-ph.IM
All Categories: astro-ph.IM, astro-ph.EP, astro-ph.SR, physics.data-an, stat.ME

Multidimensional Gaussian Process (multi-GP) regression is widely used to disentangle stellar and planetary signals in radial velocities (RVs) by jointly modelling ancillary activity indicators. However, identifying the combination of indicators that best constrains the stellar signal in the RVs is non-trivial, as classical model comparison methods are not directly applicable when multi-GPs involve different time series combinations. In this work, we present an information criterion to compare multi-GP models based on their ability to explain the RV component, $\mathrm{MGIC}_{\rm rv}$. This metric combines the conditional RV likelihood with an effective parameter count that accounts for the regularisation imposed by the multi-GP model on the RV component. We demonstrate that $\mathrm{MGIC}_{\rm rv}$ provides a quantitative and robust framework for multi-GP model comparison, identifying the activity indicators that most effectively constrain the RV signal. Although developed in the context of RV analysis, the proposed criterion is general and applicable to multi-GP problems in which the inference focuses on a specific observable.


Gaussian Process Latent Factor Regression for Low-Data, High-Dimensional Output Problems

Authors: Edward T. Stevenson, Eric T. Wolf, Mei Ting Mak, N. J. Mayne, Miles Cranmer
Comments: 9 pages content + 22 pages appendix/references. Supporting code at https://github.com/edstevenson/GPLFR
Primary Category: cs.LG
All Categories: cs.LG, astro-ph.EP, astro-ph.IM, stat.ML

In the sciences, regression tasks often require predicting high-dimensional outputs from few training examples. Multi-output Gaussian processes excel in low-data regimes but typically struggle with high-dimensional outputs. Compress-then-predict pipelines such as PCA-GP (principal component analysis plus Gaussian process regression) handle high dimensionality, but rely on bases optimized for reconstruction rather than prediction. To address this gap, we propose a model that represents each output as a linear-Gaussian decoding of a low-dimensional latent state drawn from a Gaussian process prior. By analytically marginalizing the decoder weights, we couple compression and prediction in a single objective that scales to high-dimensional outputs. We refer to this model as Gaussian process latent factor regression (GPLFR). We demonstrate GPLFR by building the first spatially resolved emulator of global climate models for rocky exoplanets.


Identifying Gems from Roman RAPIDly

Authors: Karan Gandhi, Ashish A. Mahabal, Jacob E. Jencson, Russ R. Laher, Ben Rusholme, Lin Yan, Ryan M. Lau, Schuyler D. Van Dyk, Mansi M. Kasliwal
Comments: 15 pages, 10 figures, Submitted to the Publications of the Astronomical Society of the Pacific
Primary Category: cs.LG
All Categories: cs.LG, astro-ph.IM, cs.CV, stat.ML

The Nancy Grace Roman Space Telescope (Roman), set for launch as early as September 2026, will conduct wide-field infrared imaging surveys with unprecedented spatial resolution and cadence, enabling the discovery of millions of astronomical transients. Hence, it is necessary to have automated pipelines for generating alerts in place so that the telescope can begin discovering reliable transients and variable objects soon after it is launched. However, no real Roman data currently exist, making the development of such pipelines difficult. In this work, we present a machine learning model $RuBR$ and a general methodology for distinguishing genuine transient and variable detections from spurious (bogus) detections within the RAPID pipeline. In particular, we present three models using this methodology: $RuBR_{comb}$ trained and tested on combined locally injected and OpenUniverse2024 transients, $RuBR_{loc}$ trained on locally injected transients and tested on OpenUniverse2024 transients, and $RuBR_{DA}$ that combines locally injected transients with a fraction of OpenUniverse2024 transients in domain-adaptation mode for training. This paves the way for strategies to adapt the $RuBR_{comb}$ model to real observations in the absence of any ground-truth labels during the early phases of the Roman mission. While the image differencing pipeline continues to be improved, our experimental results demonstrate the effectiveness of the proposed approach and its promise for robust real-bogus classification in the Roman era.