Hostname: page-component-cd9895bd7-q99xh Total loading time: 0 Render date: 2024-12-18T15:25:48.030Z Has data issue: false hasContentIssue false

Identification of Interesting Objects in Large Spectral Surveys Using Highly Parallelized Machine Learning

Published online by Cambridge University Press:  30 May 2017

Petr Škoda
Affiliation:
Astronomical Institute of the Czech Academy of Sciences, Fričova 298, 251 65 Ondřejov, Czech Republic email: [email protected]
Andrej Palička
Affiliation:
Faculty of Information Technology, Czech Technical University in Prague, Thákurova 9, 160 00 Prague 6, Czech Republic
Jakub Koza
Affiliation:
Faculty of Information Technology, Czech Technical University in Prague, Thákurova 9, 160 00 Prague 6, Czech Republic
Ksenia Shakurova
Affiliation:
Faculty of Information Technology, Czech Technical University in Prague, Thákurova 9, 160 00 Prague 6, Czech Republic
Rights & Permissions [Opens in a new window]

Abstract

Core share and HTML view are not available for this content. However, as you have access to this content, a full PDF is available via the ‘Save PDF’ action button.

The current archives of LAMOST multi-object spectrograph contain millions of fully reduced spectra, from which the automatic pipelines have produced catalogues of many parameters of individual objects, including their approximate spectral classification. This is, however, mostly based on the global shape of the whole spectrum and on integral properties of spectra in given bandpasses, namely presence and equivalent width of prominent spectral lines, while for identification of some interesting object types (e.g. Be stars or quasars) the detailed shape of only a few lines is crucial. Here the machine learning is bringing a new methodology capable of improving the reliability of classification of such objects even in boundary cases.

We present results of Spark-based semi-supervised machine learning of LAMOST spectra attempting to automatically identify the single and double-peak emission of Hα line typical for Be and B[e] stars. The labelled sample was obtained from archive of 2m Perek telescope at Ondřejov observatory. A simple physical model of spectrograph resolution was used in domain adaptation to LAMOST training domain. The resulting list of candidates contains dozens of Be stars (some are likely yet unknown), but also a bunch of interesting objects resembling spectra of quasars and even blazars, as well as many instrumental artefacts. The verification of a nature of interesting candidates benefited considerably from cross-matching and visualisation in the Virtual Observatory environment.

Type
Contributed Papers
Copyright
Copyright © International Astronomical Union 2017 

References

Chapelle, O., Schölkopf, B., & Zien, A. 2006, Semi-supervised learning, MIT press, Cambridge, Massachusetts Google Scholar
Cui, X. Q., et al. 2012, Research in Astronomy and Astrophysics, 12, 1197 Google Scholar
Luo, A. L., et al. 2015, Research in Astronomy and Astrophysics, 15, 1095 CrossRefGoogle Scholar
Nandrekar-Heinis, D., Michel, L., Louys, M., & Bonnarel, F. 2014, Astronomy and Computing, 7, 37 Google Scholar
Palička, A. 2016, Master Thesis, Czech Technical University in Prague, Faculty of ITGoogle Scholar
Porter, J. M. & Rivinius, T. 2003, PASP, 115, 1153 CrossRefGoogle Scholar
Shakurova, K. 2016, Master Thesis, Czech Technical University in Prague, Faculty of ITGoogle Scholar
Silaj, J., Jones, C. E., Tycner, C., Sigut, T. A. A., & Smith, A. D. 2010, ApJS, 187, 228 Google Scholar
Tody, D. et al. 2012, IVOA Recommendation: Simple Spectral Access Protocol Version 1.1, ArXiv:1203.5725Google Scholar
Zickgraf, F.-J. 2003, A&A, 408, 257 Google Scholar