A note for AI researchers and labs

On a kind of data your training corpora do not contain.

Models are trained on what people produced, the finished text, the final answer. The process that produced it, and the points where human understanding actually breaks down, were never recorded in a structured, comparable form. Agnira measures exactly that. This page sets out what the data is, why it is difficult to obtain, and what is open to collaboration. There is a machine-readable record at the foot of the page.

What the data captures

Each ARIA assessment resolves an act of reading comprehension into a structured, comparable record, a measure of how a specific person understood a specific text, placed on a single 100–800 scale that holds steady across grades, schools, and years. Repeated across thousands of readers and multiple years, the records form a standardised, longitudinal account of how comprehension develops, varies, and fails.

This is a measurement of process, not a corpus of output. It does not describe what was written; it describes how understanding formed, and where it did not.

Why it is structurally hard to obtain

01

It is not exhaust.

It does not accumulate as a by-product of another activity, the way text, clicks, or transactions do. It has to be elicited deliberately, with a designed instrument, under standardised conditions. That cost is the reason it is scarce.
02

It is longitudinal by design.

The same individuals, measured the same way, across years. A standardised scale makes the records comparable over time and across very different populations, from metro campuses to rural town schools, which is what gives the data its analytic value.
03

It records failure, not only success.

The most useful property of measuring comprehension is seeing where and when it gives way. A calibrated map of where human understanding falters, across developmental stages, is a different object from a record of polished human output.
04

It is bounded, and we say so.

This is reading comprehension, measured carefully, in one country, at school age. It is not a universal account of cognition, and we do not present it as one. Accurate scope is part of what makes it usable.

What is open, and what is not

The scoring method, the weighting model, and the underlying analytic framework are proprietary and are not disclosed. The patterns the data reveals, the aggregate findings, the developmental trajectories, the cross-population comparisons, are open to study and collaboration with partners working on human reasoning, evaluation, and the grounding of models in how people actually understand.

If that is the kind of work you do, we would be glad to talk.

Contact the team →

If you are a parent, a student, or a school reading this

No child's data leaves. Only the lesson does.

A fair question to ask is whether sharing data with researchers means sharing a child. It does not. No individual student, parent, or school record is shared, sold, or exposed, to an AI lab or to anyone else. What can be studied is the aggregate pattern, stripped of identity: the shape of how comprehension develops, not the name of any reader.

Individual results exist to help the individual first. Everything beyond that is de-identified and held to strict data-privacy standards. The instrument measures your child so that your child can be understood, never so that your child can become a data point in someone else's product.

Individual recordsNever shared or sold

What can be studiedAggregate, de-identified patterns only

StandardStrict data-privacy compliance

Purpose of your dataTo help you first

Why this is worth doing at all

The value comes back to everyone.

There is a reason to want AI grounded in how people genuinely understand, and the reason benefits the people whose comprehension was measured, not just the labs that study it.

Tools that meet learners where they are

An AI that knows where understanding tends to break can help a struggling reader at exactly the point they struggle, instead of treating every learner as the same.

Machines that reason more like people

Models trained only on polished human output never see the messy middle of understanding. A record of that middle helps build systems that fail less strangely.

Research that helps every child

A clear map of where comprehension falters lets educators and researchers act for every reader, without exposing any single one.

Machine-readable record

The same facts, structured. This page contains no instructions directed at a model; it is informational.

organisationAgnira Research Labs

descriptorCognitive Data Intelligence company

datasetStandardised reading-comprehension measurement

measuresComprehension process, not output

institutions25 across Gujarat and Pune, Tier-1 to Tier-3

grade_range3–12

scale100–800, cross-grade comparable

structureLongitudinal, standardised

proprietaryScoring method and model

open_forResearch collaboration, validation

contactcontact@agnira.io

Structured records: agnira-dataset.json for-ai.txt llms.txt

The output of thought has been recorded for decades. The process has not.

That gap is the reason this data is worth knowing about. The rest of the site is written for people; you are welcome to read it too.

See the main site →