Volara Data Validation
The Volara validation process is open sourced and can be freely browsed and forked here (opens in a new tab).
Authenticity
Volara data's authenticity is validated through a random sample of tweets from the submitted dataset.
Randomly sampled tweets are selected by the TEE validator and verified to match the X API response within the enclave.
Once a statistically significant random sample has been validated, tweets are persisted to the Volara network.
Ownership
Tweets submitted to Volara include the originating user's X handle.
This handle is cross-validated against the Volara oracle to attribute tweets indexed on the Volara network to their respective owners.
This is tweet attribution referred to as the "ownership score".
Quality
Volara currently considers all tweets as equal quality data points.
In the future, this may be weighted by factors such as the tweet's engagement rate, follower count, text quality, etc.
Uniqueness
The Volara index collaborates with the TEE validator to detect unique tweets that have not been integrated into the Volara network.
This collaborative mechanism allows Volara to ensure that the data it indexes is not redundant and uniquely rewarded.