AI on Trial: How Well Do LLMs Classify Astronomical Images?

This project has been built using the Zooniverse Project Builder but is not yet an official Zooniverse project. Queries and issues relating to this project directed at the Zooniverse Team may not receive any response.

Help judge how well an LLM explains astronomical images.

Learn more

Get Started!

Look at the three images
Read the explanation given by the LLM
Decide if the description is coherent with the images
Decide if the "interest" level assigned by the LLM is coherent with its own description of the image (even if the description of the image is wrong, we want to know if the LLM is self-consistent)

Zooniverse Talk

Chat with the research team and other volunteers!

Join in

AI on Trial: How Well Do LLMs Classify Astronomical Images? Statistics

View more stats

All Time Stats

Volunteers0

Classifications0

Project not launched

Active Stats

Active stats provide information about currently active workflows and subjects.

Percent complete

Classifications0

Subjects0

Completed subjects0

Message from the researcher

Connect with the research team on Talk to learn more about this project!

Go to Talk

About AI on Trial: How Well Do LLMs Classify Astronomical Images?

For years, machines have learned from your input. Now, you’re the one testing them. You’ll receive three astronomical images—New, Reference, and Difference—and a prediction generated by an LLM. Judge how coherent and accurate that prediction really is.