Javascript must be enabled to continue!
Machine-learning Regression of Stellar Effective Temperatures in the Second Gaia Data Release
View through CrossRef
Abstract
This paper reports on the application of the supervised machine-learning algorithm to the stellar effective temperature regression for the second Gaia data release, based on the combination of the stars in four spectroscopic surveys: the Large Sky Area Multi-Object Fiber Spectroscopic Telescope, Sloan Extension for Galactic Understanding and Exploration, the Apache Point Observatory Galactic Evolution Experiment, and the Radial Velocity Extension. This combination, of about four million stars, enables us to construct one of the largest training samples for the regression and further predict reliable stellar temperatures with a rms error of 191 K. This result is more precise than that given by the Gaia second data release that is based on about sixty thousands stars. After a series of data cleaning processes, the input features that feed the regressor are carefully selected from the Gaia parameters, including the colors, the 3D position, and the proper motion. These Gaia parameters are used to predict effective temperatures for 132,739,323 valid stars in the second Gaia data release. We also present a new method for blind tests and a test for external regression without additional data. The machine-learning algorithm fed with the parameters only in one catalog provides us with an effective approach to maximize the sample size for prediction, and this methodology has a wide application prospect in future studies of astrophysics.
American Astronomical Society
Title: Machine-learning Regression of Stellar Effective Temperatures in the Second Gaia Data Release
Description:
Abstract
This paper reports on the application of the supervised machine-learning algorithm to the stellar effective temperature regression for the second Gaia data release, based on the combination of the stars in four spectroscopic surveys: the Large Sky Area Multi-Object Fiber Spectroscopic Telescope, Sloan Extension for Galactic Understanding and Exploration, the Apache Point Observatory Galactic Evolution Experiment, and the Radial Velocity Extension.
This combination, of about four million stars, enables us to construct one of the largest training samples for the regression and further predict reliable stellar temperatures with a rms error of 191 K.
This result is more precise than that given by the Gaia second data release that is based on about sixty thousands stars.
After a series of data cleaning processes, the input features that feed the regressor are carefully selected from the Gaia parameters, including the colors, the 3D position, and the proper motion.
These Gaia parameters are used to predict effective temperatures for 132,739,323 valid stars in the second Gaia data release.
We also present a new method for blind tests and a test for external regression without additional data.
The machine-learning algorithm fed with the parameters only in one catalog provides us with an effective approach to maximize the sample size for prediction, and this methodology has a wide application prospect in future studies of astrophysics.
Related Results
Stellar occultations by Near Earth Asteroids: challenges and results
Stellar occultations by Near Earth Asteroids: challenges and results
The observation of stellar occultation by asteroids is an intrinsically challenging activity in the case of Near Earth Objects, that produce very short events on narrow occultation...
Selection of Injectable Drug Product Composition using Machine Learning Models (Preprint)
Selection of Injectable Drug Product Composition using Machine Learning Models (Preprint)
BACKGROUND
As of July 2020, a Web of Science search of “machine learning (ML)” nested within the search of “pharmacokinetics or pharmacodynamics” yielded over 100...
paired: A Statistical Framework for Detecting Stellar Binarity with Gaia RVs. I. Sensitivity to Unresolved Binaries
paired: A Statistical Framework for Detecting Stellar Binarity with Gaia RVs. I. Sensitivity to Unresolved Binaries
Abstract
Data Release 3 (DR3) from the Gaia Mission includes radial velocity measurements of over 33 million targets. Among many scientific applications, the overlap...
Asteroids' satellites in Gaia astrometric data
Asteroids' satellites in Gaia astrometric data
It is known to the astronomical community that asteroids with satellites are not uncommon in the Solar System. So far we have more than 500 documented asteroid systems encompassing...
Parameters of 220 million stars from Gaia BP/RP spectra
Parameters of 220 million stars from Gaia BP/RP spectra
ABSTRACT
We develop, validate and apply a forward model to estimate stellar atmospheric parameters (Teff, log g, and [Fe/H]), revised distances and extinctions for 2...
Performance of occultation astrometry with Gaia DR2
Performance of occultation astrometry with Gaia DR2
Stellar occultations provide an instantaneous relative astrometric measurement, as the position of the asteroid center of mass, at the mid-time of the event, is very close to the t...
One star, two star, red star, blue star: an updated planetary nebula central star distance catalogue from Gaia EDR3
One star, two star, red star, blue star: an updated planetary nebula central star distance catalogue from Gaia EDR3
Context. Planetary nebulae (PNe) are a brief but important phase of stellar evolution. The study of Galactic PNe has historically been hampered by uncertain distances, but the para...
Impact of stellar evolution on planetary habitability
Impact of stellar evolution on planetary habitability
With the ever growing number of detected and confirmed exoplanets, the probability to find a planet that looks like the Earth increases continuously. While it is clear that being i...

