This list is obviously a work in progress. There are hundreds of fascinating research papers analyzing and mining survey and catalog data, some of the early work dating back more than 10 years. I hope to summarize the best of them in the months to come. -jnr
Books
Chaisson and McMillan, 2007. Astronomy Today. [An outstanding textbook on general astronomy! -jnr]
Fayyad, Grinstein, and Wierse; 2002. Information Visualization in Data Mining and Knowledge Discovery.
Mitchell, 1997. Machine Learning.
Witten and Frank, 2005. Data Mining: Practical Machine Learning Tools and Techniques, 2nd Ed.
Nisbet, Elder, and Miner, 2009. Handbook of Statistical Analysis & Data Mining Applications
Tan, Steinbach, and Kumar, 2006. Introduction to Data Mining
Papers
(freely available at arXiv.org)
Accomazzi A. (2010). Astronomy 3.0 Style
Bailey, S., C. Aragon, et al. (2007). “How to Find More Supernovae with Less Work: Object Classification Techniques for Difference Imaging.”
Ball, N., R. Brunner, et al. (2008). “Robust Machine Learning Applied to Terascale Astronomical Datasets.”
Bamford, S., R. Nichol, et al. (2008). “Galaxy Zoo: the independence of morphology and colour.”
Becla, J., A. Hanushevsky, et al. (2006). “Designing a Multi-petabyte Database for LSST.”
Borne, K. (2000). “Data Mining in Astronomical Databases.”
Borne, K. (2000). “Science User Scenarios for a Virtual Observatory Design Reference Mission: Science Requirements for Data Mining.”
Borne, K. (2009). Scientific Data Mining in Astronomy. [highly recommended summary of the informatics challenges facing astronomers in this new age of survey astronomy.]
Brunner, T. Prince, et al. (2000). “The Digital Sky Project: Prototyping Virtual Observatory Technologies.”
Brunner, R. (2001). “Panchromatic Mining for Quasars: An NVO Keystone Science Application.”
Brunner, R., G. Djorgovski, et al. (2001). “Massive Datasets in Astronomy.”
Budavari, T. and A. Szalay. (2008). “Probabilistic Cross-Identification of Astronomical Sources.”
Cantu-Paz, E. and C. Kamath (2000). “Combining evolutionary algorithms with oblique decision trees to detect bent-double galaxies.” Applications and Science of Neural Networks, Fuzzy Systems, and Evolutionary Computation III.
Djorgovski, G. and R. Brunner (2000). “Digital Sky Surveys: Software Tools and Technologies.”
Djorgovski, G., A. Mahabal, et al. (2001). “Searches for Rare and New Types of Objects.” Virtual Observatories of the Future, ASP Conference Series, Brunner, Djorgovski and Szalay, eds. 225.
Djorgovski, S. G., C. Baltay, et al. (2008). “The Palomar-Quest Digital Synoptic Sky Survey.”
Djorgovski, S. G., R. Brunner, et al. (2002). “Challenges for Cluster Analysis in a Virtual Observatory.”
Djorgovski, S. G., R. J. Brunner, et al. (2000). “Exploration of Large Digital Sky Surveys.”
Djorgovski, S. G., C. Donalek, et al. (2006). “Some Pattern Recognition Challenges in Data-Intensive Astronomy.”
Djorgovski, S. G., R. R. Gal, et al. (1998). “The Palomar Digital Sky Survey (DPOSS).”
Eyer, L. (2004). “Variability Analysis: Detection and Classification.”
Fodor, Cantu-Paz., et al. (2000). “Finding Bent-Double Radio Galaxies: A Case Study in Data Mining.” Computing Science and Statistics 33.
Gray, J., A. Szalay, et al. (2002). “Data Mining the SDSS SkyServer Database.”
Ivezic, Z., J. A. Tyson, et al. (2008). “LSST: from Science Drivers to Reference Design and Anticipated Data Products.”
Lamer, G., M. Hoeft, et al. (2008). “2XMM J083026+524133: The most X-ray luminous cluster at redshift 1.”
Land et al., (2008). “Galaxy Zoo: The large-scale spin statistics of spiral galaxies in the Sloan Digital Sky Survey.”
Lawrence, A. (2007). “Wide Field Surveys and Astronomical Discovery Space.”
Longo, M. (2007). “Does the Universe Have a Handedness?”
Mahabal, A. A., S. G. Djorgovski, et al. (2008). “Automated Probabilistic Classification of Transients and Variables.”
Mieske, S. and H. Jerjen (2007). “Near-field cosmology with the VLT.”
Szalay, A., J. Gray, et al. (2002). “The SDSS SkyServer – Public Access to the Sloan Digital Sky Server Data.” ACM SIGMOD 2002 Proceedings.
Szalay, A., J. Gray, et al. (2002). “Petabyte Scale Data Mining: Dream or Reality?”
Szalay, A. S. and R. J. Brunner (1998). “Astronomical Archives of the Future: A Virtual Observatory.”
Tagliaferri, R., G. Longo, et al. (2003). “Neural networks in astronomy.” Neural Netw 16(3-4): 297-319.
Walker, A. (2003). “The Large Synoptic Survey Telescope (LSST) and its Impact on Variable Star Research.”
Watson, M. G., Schr, et al. (2008). “The XMM-Newton Serendipitous Survey. VI. The Second XMM-Newton Serendipitous Source Catalogue.”