This project has been built using the Zooniverse Project Builder but is not yet an official Zooniverse project. Queries and issues relating to this project directed at the Zooniverse Team may not receive any response.

AI on Trial: How Well Do LLMs Classify Astronomical Images?

Help judge how well an LLM explains astronomical images.

Learn more
Get Started!
  1. Look at the three images
  2. Read the explanation given by the LLM
  3. Decide if the description is coherent with the images
  4. Decide if the "interest" level assigned by the LLM is coherent with its own description of the image (even if the description of the image is wrong, we want to know if the LLM is self-consistent)

Zooniverse Talk

Chat with the research team and other volunteers!

Join in

AI on Trial: How Well Do LLMs Classify Astronomical Images? Statistics

View more stats

All Time Stats

Volunteers0
Classifications0
Project not launched

Active Stats

Active stats provide information about currently active workflows and subjects.

0%
Percent complete
Classifications0
Subjects0
Completed subjects0

Message from the researcher

Connect with the research team on Talk to learn more about this project!

About AI on Trial: How Well Do LLMs Classify Astronomical Images?

For years, machines have learned from your input. Now, you’re the one testing them. You’ll receive three astronomical images—New, Reference, and Difference—and a prediction generated by an LLM. Judge how coherent and accurate that prediction really is.