Athena Methodology

How Athena evaluates credibility. Scores reflect observed behavior, not opinions.

Version 1.0.0 • Last updated: June 2025

6
Athena Dimensions
100+
Domain-Specific Signals
v1.1.0
Score Version
What Athena Measures

Athena Index exists to document credibility in public claims.

Across markets, politics, technology, and media, influence often travels faster than accountability.

Athena records what was claimed, what occurred, and how behavior evolved - transparently, historically, and without persuasion.

Credibility here is not a verdict.
It is a living record.

What credibility means:

  • Consistency - Behavior that remains stable over time
  • Transparency - Open disclosure of conflicts and limitations
  • Accountability - Willingness to acknowledge errors and revise positions
  • Explainability - Ability to articulate reasoning and evidence
  • Alignment with evidence - Claims that correspond to observable outcomes

Athena does not determine who should be trusted. It provides the record so others can decide.

The 6 Credibility Pillars

Every credibility score is built on these six pillars - stable across all domains. The underlying factors vary depending on subject area (crypto, finance, politics) and data availability, but the pillars remain constant.

Same Pillars, Different Signals

The six pillars adapt to each domain without changing their core meaning. Here's how the same behavioral framework applies across different fields:

PillarCryptoStocks/FinancePolitics
Outcome AlignmentPrice targets, market callsEarnings forecasts, sector picksPolicy outcomes, election results
CalibrationConfidence vs. ROI accuracyGuidance accuracy, risk ratingsCertainty vs. outcomes
ConsistencyThesis stability, flip-flopsNarrative stability, revisionsPosition consistency over time
SpecificityTargets, timeframes, conditionsData citations, quantified claimsPolicy specificity, measurability
TransparencyWallets, sponsors, affiliationsHoldings, conflicts of interestFunding, affiliations, donors
AccountabilityCorrections, error acknowledgmentRetractions, methodology updatesVoting record changes, flip-flops

Note: Athena currently evaluates crypto influencers. Finance and politics are planned expansions using this same pillar framework.

Each pillar is weighted by importance in overall scoring:

1. Outcome Alignment (22%)Highest Weight

Do their claims match reality?

Measures how often predictions align with actual market outcomes. We check:

  • Direction accuracy (50%): Did the price go up/down as predicted?
  • Target accuracy (30%): Did it reach the specific price target?
  • Timeframe accuracy (20%): Did it happen in the predicted window?
✓ Good Example:

"Bitcoin will reach $50K by end of Q1" → BTC hits $51K on March 28th

✗ Poor Example:

"Ethereum going to $10K this month" → ETH drops 15% instead

2. Calibration (13%)Lowest Weight

Does their confidence match their accuracy?

Well-calibrated influencers are highly confident when they're right, less confident when uncertain. We penalize overconfidence--making bold claims that don't pan out.

✓ Good Example:

High confidence predictions: 85% accuracy • Medium confidence: 60% accuracy

✗ Poor Example:

"I'm 100% certain SOL will 10x" → SOL drops 30% (overconfidence penalty)

3. Consistency (15%)Medium Weight

Do they maintain stable positions over time?

Detects flip-flopping--making opposite calls on the same asset within short timeframes without acknowledging the change. Consistency builds trust.

✓ Good Example:

Maintains bullish stance on BTC for 6 months, even during dips

✗ Poor Example:

"BTC to $100K!" (Monday) → "BTC will crash, sell now!" (Friday) → No explanation

4. Specificity Quality (20%)High Weight

Are their claims precise and falsifiable?

Vague claims like "crypto will moon soon" are unfalsifiable. We reward specific targets and timeframes that can be objectively verified.

✓ Good Example:

"ETH will reach $4,200 by March 31, 2025"

✗ Poor Example:

"Altcoins looking good, major moves coming" (no asset, target, or timeframe)

5. Transparency (15%)Medium Weight

Do they disclose conflicts of interest?

Measures disclosure of sponsorships, paid promotions, and asset holdings. Transparency prevents pump-and-dump schemes and builds audience trust.

✓ Good Example:

"Full disclosure: I hold SOL and was paid by Solana Foundation for this video"

✗ Poor Example:

Promotes token heavily, later revealed they dumped before audience could buy

6. Accountability (15%)Medium Weight

Do they acknowledge mistakes and update positions?

Great influencers admit when they were wrong and explain why their thesis changed. This shows intellectual honesty and builds long-term trust.

✓ Good Example:

"My BTC prediction was wrong. Here's what I missed: [detailed explanation]"

✗ Poor Example:

Never mentions past failed predictions; deletes old videos with wrong calls

How We Calculate Overall Scores
Formula:
Score = (Outcome × 22%) + (Calibration × 13%) + (Consistency × 15%) +
(Specificity × 20%) + (Transparency × 15%) + (Accountability × 15%)

Each pillar is scored 0-1, then weighted and combined into a final 0-100 score. The weights reflect importance: track record (outcome) and specificity matter most, with balanced emphasis on transparency, accountability, and consistency.

Confidence Levels:

  • HIGH: 50+ evaluated claims (statistically significant)
  • MEDIUM: 20-49 claims (meaningful sample)
  • LOW: 5-19 claims (preliminary assessment)
  • MINIMAL: <5 claims (insufficient data)

Note on Methodology: Some integrity and manipulation-resistance signals are intentionally not disclosed to preserve system fairness and prevent gaming. What matters is whether the behavioral pillars are sound and the results are verifiable.

Gaming Resistance

Our scoring system is designed to prevent manipulation:

  • No single factor dominates: Even perfect outcome alignment only gets you to 22/100. You need balance across all dimensions.
  • Verifiable outcomes: We check claims against real market data, not sentiment or likes.
  • Time-weighted: Recent behavior matters more, but flip-flopping is penalized.
  • Transparency required: Hidden affiliations hurt your score even if predictions are good.
  • Accountability matters: Deleting failed predictions or ignoring mistakes reduces trust.

Bottom line: The only way to consistently maintain a high score is to make accurate, specific, well-calibrated claims while being transparent about conflicts and accountable for mistakes.

Trust Badges

Badges describe what influencers do, not what score they have. They're earned through consistent behavior and can be lost if performance drops.

🔍 Discloses Conflicts Consistently

Earned by: Transparency score >0.8
Means: Consistently discloses sponsorships, holdings, and conflicts of interest

📊 Consistent Track Record (30+ claims)

Earned by: Consistency >0.75 + 30+ evaluated claims
Means: Maintains stable positions across market cycles, doesn't flip-flop

✅ Demonstrated Accuracy (50+ claims)

Earned by: Outcome alignment >0.7 + 50+ evaluated claims
Means: Proven accuracy over statistically significant sample size

🔄 Admits & Corrects Errors

Earned by: Accountability >0.7
Means: Acknowledges mistakes publicly and updates positions when wrong

🎯 Confidence Matches Reality

Earned by: Calibration >0.75
Means: When confident, they're right; when uncertain, they say so

Questions or Concerns?

We're committed to transparency and fairness. If you:

  • Believe your score is incorrect
  • Have evidence of misclassified predictions
  • Want to understand specific driver explanations
  • See suspicious scores or gaming attempts

Athena Index is credibility infrastructure for the internet - starting where claims are measurable, expanding where trust matters.

Formal Dispute & Correction Process

We have a transparent 4-step process for challenging scores and correcting factual errors. All corrections are logged and visible.

View Dispute Process