Galaxy Zoo recently migrated onto Zooniverse’s new architecture, as announced on our blog. For details, see Zooniverse's blog post here.
All of the data which is currently publicly available from any phase of Galaxy Zoo is available on our Data Archive site.
Galaxy Zoo is now arguably the world’s best-known online citizen science project, and is certainly the one with the largest number of publications based on citizen scientists input. Our success inspired the creation of The Zooniverse, hosting project using the same technique across many research areas.
It all started back in July 2007, with a data set made up of a million galaxies imaged by the Sloan Digital Sky Survey, who still provide some of the images in the site today. With so many galaxies, we assumed it would take years for visitors to the site to work through them all, but within 24 hours of launch we were stunned to be receiving almost 70,000 classifications an hour. In the end, more than 50 million classifications were received by the project during its first year, contributed by more than 150,000 people.
That meant that many different participants saw each galaxy. This is deliberate; having multiple independent classifications of the same object is important, as it allows us to assess how reliable our results are. For example, for projects where we may only need a few thousand galaxies but want to be sure they're all spirals before using up valuable telescope time on them, there's no problem — we can just use those that 100% of classifiers agree are spiral. For other projects, we may need to look at the properties of hundreds of thousands of galaxies, and can use those that a majority say are spiral.
The task in that first Galaxy Zoo was slightly simpler than yours; all we asked volunteers to do was to split the galaxies into ellipticals, mergers and spirals and — if the galaxy was spiral — to record the direction of the arms. But it was enough to show that the classifications Galaxy Zoo provides were as good as those from professional astronomers, and were of use to a large number of researchers. You can find out what people have been up to with Galaxy Zoo data in our list of papers published and on the Galaxy Zoo blog, as well as below. Apart from the publications, another tangible sign of success is the fact that we’ve been granted time on some of the largest telescopes in the world to follow up on many Galaxy Zoo discoveries: the list currently includes the Isaac Newton and William Herschel Telescopes on the island of La Palma in the Canaries, Gemini South in Chile, the WIYN telescope on Kitt Peak, Arizona, the IRAM radio telescope in the Sierra Nevada of Spain, The Very Large Array in New Mexico, Swift, GALEX, Chandra, XMM-Newton and Suzaku up in space, and perhaps most excitingly the Hubble Space Telescope.
The second phase of Galaxy Zoo was inspired by our newfound confidence in the ability of you, our volunteer classifiers, and so we asked for a closer look at just over 200,000 of the brightest of the Sloan galaxies. We asked about the number of spiral arms, the size of the galaxies' bulges and much else besides. Once again, we were thrilled with the response (and a little more prepared for it than we were for Zoo 1!) and in the 14 months the site was up we received a little more than 60 million classifications.
Along the way, we added more detailed images from an area of the survey known as 'Stripe 82', but it was soon time to look at the more distant Universe. The site’s third incarnation, Galaxy Zoo: Hubble drew from surveys conducted by the Hubble Space Telescope to view earlier epochs of galaxy formation. In these surveys, which involve many, many days of dedicated observing time, we can see light from galaxies, which has taken billions of years to reach us. The idea behind Galaxy Zoo: Hubble was to be able to compare galaxies then to galaxies now, giving us a clear understanding of what factors influence their growth, whether through mergers, active black holes or simply star formation.
The Galaxy Zoo site relaunched with an updated design in September 2014. In this version we combined new imaging from Sloan, giving us our best ever view of the local Universe, with the most distant images yet from Hubble's CANDELS survey. The CANDELS survey makes use of the new Wide Field Camera 3 — installed during the final shuttle mission to Hubble — to take ultra-deep images of the Universe. Our first surprise in this data was that contrary to predictions, galactic bars are still found in the most distant galaxies. Stay tuned to see what else we’ll discover!
The relaunched site also allowed us more flexibility to include smaller sets of galaxies. We have included images taken with the United Kingdom Infrared Telescope (UKIRT), in Hawaii, for the recently completed UKIDSS project. UKIDSS is the largest, deepest survey of the sky at infrared wavelengths. The UKIDSS infrared-light pictures and the SDSS visible-light pictures are different for two reasons. Firstly, older stars are brighter in infrared wavelengths (while in visible-light we see younger stars), and secondly, infrared light, like a fog light, can penetrate the dust and smoke in the centres of galaxies. Examining the UKIDSS images in Galaxy Zoo will enable us to see how old and young stars are arranged differently in the galaxies, and will reveal more of the structure in the inner parts of galaxies.
The data being put into Galaxy Zoo can now change quickly, thanks to the dedicated work of volunteers and the "retirement" of images that have already been completed. One of the new sets of images to be classified comes from the Dark Energy Camera Legacy Survey (DECaLS), a survey which uses DECam, one of the most sensitive and widest-area cameras ever built and a key tool in our search to measure and understand dark energy. The camera, which is mounted on the 4-m Blanco telescope at CTIO in Chile, images the sky at similar wavelengths to SDSS, but with better resolution and with the ability to detect much fainter galaxies. These images are extending the reach of the science we began with Galaxy Zoo 2 and SDSS. Recently, the site also had Galaxy Zoo's first-ever images of fully simulated galaxies; these were images that were created entirely on a computer, using sophisticated simulations of the Universe starting just after the Big Bang and then watching as dark matter, gas, and stars evolve to form the galaxies we see today. The classifications of these images, which came from the Illustris project, are a vital test of the physics that go into these simulations.
Starting in March 2016 the site began including a new set of DECaLS images, this time taken from the survey's second data release. This new data release covers a larger portion of the sky than the first. Finally, we have included a set of "lost images" from SDSS that should have been included in the Galaxy Zoo 2 sample. These images fill in the hole that currently exists in the SDSS bright galaxy data set.
With increasing number of very distant galaxies entering Galaxy Zoo (like the Hubble and CANDELS sets), it was clear that the GZ team needed to find a way to measure and account for any dependence classifications might have on galaxy distance. So, in 2013 a set of 288 nearby SDSS galaxies were simulated using the FERENGI code to create images of these galaxies as viewed at varying distances and brightnesses. The classifications for these were used with great success in debiasing the Galaxy Zoo: Hubble catalog. To continue this effort, in 2016 a second, larger sample of 936 galaxies were simulated again with FERENGI to create 7,488 images of the same galaxies at eight viewing distances. The new FERENGI classifications will aid in the analysis of GZ Hubble data to be used in some exciting high-redshift science projects currently in prep!
At the start of 2017 we added in images from another recent project that builds upon the legacy of SDSS, the Galaxy And Mass Assembly Survey (GAMA). Specifically, we have included optical images from the Kilo-Degree Survey (KiDS) over the region of sky covered by GAMA's comprehensive multiwavelength dataset. The Galaxy Zoo classifications of these galaxies will help us study how the properties of galaxies are affected by their neighbours and the surrounding large scale structure.
For access to the ever-growing list of team-led publications based on Galaxy Zoo data, please take a look at our publications page. This includes results from multiple phases of the project, including the original Galaxy Zoo, Galaxy Zoo 2, Galaxy Zoo: Hubble, and Galaxy Zoo: CANDELS. Access to all of the Galaxy Zoo data products is available on our archive page, or through links in the published papers.