h2o.ai 67 C
🛡️ SEO 40 🤖 GEO 79 ⚡ Perf 59 🏗️ Arch 89

h2o.ai — Global SEODiff Score 67/100

h2o.ai
📊

With a solid 71/100 ACRI, h2o.ai is well-positioned for AI search — better than 70% of sites in the Radar. In the government sector, h2o.ai outperforms the average (57), suggesting strong competitive positioning in AI search. Its server-rendered architecture ensures AI crawlers receive complete HTML on first request, a key advantage for extractability. Token bloat registers at 5.2× — acceptable, but reducing inline scripts and redundant markup could yield measurable gains. Zero schema blocks puts this site at a disadvantage in knowledge graph and AI-answer pipelines that rely on explicit structured data. The site maintains an open-door policy for AI crawlers — GPTBot, ClaudeBot, and other major agents are all allowed.

67
C — Global SEODiff Score
Comprehensive search visibility assessment
Strong foundations, but Traditional SEO (40) is your bottleneck.
🎯 Top Fix: Add Organization + WebSite JSON-LD → +5–8 pts
🔬 Automated SEODiff Assessment · Snapshot: Mar 21, 2026 · 📋 API
📈 ACRI Trend 2 snapshots
Mar 10 Mar 21
🔔 Recent AI Indexing Activity
No recent changes detected by adaptive crawler.
Does your site score higher than h2o.ai?
Run the same 40-signal audit on your own domain — free, instant results.
Scan Your Site Free →
🧮 Score Transparency — How is this calculated?
🛡️ Traditional SEO (25% weight)40 × 0.25 = 10.0
🤖 AI Readiness / GEO (40% weight)79 × 0.40 = 31.6
⚡ Performance (20% weight)59 × 0.20 = 11.8
🏗️ Architecture & Trust (15% weight)89 × 0.15 = 13.3
Weighted sum = 10.0 + 31.6 + 11.8 + 13.3
Global SEODiff Score = 67 (C)
📊 ACRI Sub-Scores (AI Readiness Detail)
100
Bot Access
avg 92
97
Rendering
avg 93
47
Structure
avg 35
0
Schema
avg 9
70
Tech Stack
avg 63
🔀
Visibility Delta: Google vs AI
Google (Tranco)
Top 7%
Rank #67391
+24 pts
Gap
AI (ACRI)
Top 30%
Score 71/100

h2o.ai punches above its weight in AI — AI visibility exceeds Google ranking. This is a competitive moat worth protecting. ACRI measures technical crawler readiness. Read the methodology →

Why h2o.ai ranks here

Tech stackHubSpot CMS
Industrygovernment
RenderingHybrid
Schema coverage0 blocks
Token bloat5.2×

Fastest improvements

  • Add basic Organization and WebSite JSON-LD to fix “0 schema blocks” (see Schema Coverage).
  • Reduce token bloat (navigation/footer/code) so agents reach your main content faster (see Token Bloat).
  • Create an llms.txt file so AI crawlers can discover your content structure without heavy crawling. Generate llms.txt →
  • Run a full entropy audit to find which DOM regions waste the most tokens. Run Entropy Audit →
🧪

JavaScript Rendering Check

We check what AI crawlers miss when they skip JavaScript execution.

Running headless browser to simulate AI extraction…
🛡️

Traditional SEO

40/100 25 % of Global Score 🟢 High Confidence

📝 Title Tag

99 chars
Too long

Optimal range: 30–60 characters for SERP display.

📋 Meta Description

147 chars
Good length

Optimal range: 120–160 characters for snippet control.

🔤 Heading Hierarchy

  • ✗ Exactly 1 <h1> tag — found 4
  • ✓ Has <h2> headings — found 8
  • ✓ <h2> not before <h1>

🔍 Indexability

  • ✓ Canonical tag present → https://h2o.ai/
  • ✓ No noindex directive
  • ✓ Meta viewport set
  • ✓ HTML lang attribute → en
  • ➖ Hreflang tags — N/A (single language site)
  • ✓ Googlebot allowed by robots.txt

🌐 Social / OpenGraph

  • ✗ og:title
  • ✗ og:description
  • ✗ og:image
  • ✗ twitter:card
📐 How the SEO Pillar score is calculated

SEO Pillar = Title (20 pts) + Meta Desc (20 pts) + Heading Hierarchy (20 pts) + Indexability (20 pts) + Social/OG (20 pts)

Each sub-score is derived from the checks above. Canonical tag, lang attribute, og:image, and a single H1 are the highest-impact items.

🤖

AI Readiness / GEO

79/100 40 % of Global Score 🟢 High Confidence

This pillar aggregates citation share, hallucination risk, bot access, schema health, and content extractability. The individual diagnostic sections below contribute to this score.

🔗

Citation Alternatives

Research
💡
Insight: In the government sector, gep.com (ACRI: 83) currently has stronger AI extractability. AI models tend to prefer sources with higher semantic structure and schema coverage. Domains with ACRI < 40 see 3.5× more hallucinations. Read the research →
h2o.ai
55
Your ACRI Score
83
Industry Peer ACRI
AI models prioritize pages with strong semantic structure and schema coverage. gep.com has schema coverage of 3 blocks and uses Drupal. Improve your score by implementing the remediation patches below.
📊 Side-by-Side Comparison →
🚨

Hallucination Risk

Research

Is AI lying about your brand? This panel measures how likely LLMs are to hallucinate facts when extracting information from your page.

Analyzing hallucination risk…

🤖 Bot Access Matrix

GPTBot (OpenAI)
Allowed
ClaudeBot (Anthropic)
Allowed
CCBot (Common Crawl)
Allowed
Google-Extended
Allowed
Googlebot
Allowed

👻 Rendering (Ghost Ratio) Docs

Ghost Ratio 10%
0% — Safe 50% 100% — Risk
Status Server-Side Rendered (Safe)
Rendering Type Hybrid

📊 Structure & Information Density Docs

Structure Grade 47/100 — Fair
Structured Elements 77 elements (77 lists, 0 rows, 0 headers)
Total Words1145
Raw Density6.7%

🏷️ Schema Health Docs

Organization Schema ❌ Missing
Product / Service Schema ⚠️ Not Found
Total Schema Blocks0 — No JSON-LD detected

Schema Coverage Map

0/7 schema types detected
❌ Organization
❌ Product/Service
❌ Breadcrumb
❌ FAQ
❌ Article
❌ WebSite
💡Organization schema missing. AI models cannot identify your brand entity. Without it, your brand won't appear in Knowledge Panels or be associated with your content.
💡Product / Service schema missing. AI models don't know this is a SaaS product. Add Product or SoftwareApplication schema so AI understands what you offer and can surface pricing/features.
💡BreadcrumbList schema missing. AI cannot understand your site hierarchy or how pages relate to each other.
💡FAQ schema missing. Adding FAQPage schema lets AI models directly extract Q&A pairs for Featured Snippets and chatbot answers.
💡WebSite schema missing. Add WebSite + SearchAction so Google can generate a Sitelinks Search Box for your brand in AI results.

📐 AI Efficiency Metrics Docs

58
AI Extractability
Medium
Crawl Cost
None
Blocklist Risk
Extractability58/100 — AI models can partially extract answers from this page
Crawl CostMedium (50/100) — moderate for AI crawlers to process
Blocklist RiskNone — 0 of 5 AI crawlers blocked

Token Bloat Research

19%
🗑️ 81%
Useful Content (40.1 KB)Bloat (170.4 KB)
Token Bloat Ratio5.2× — Normal

Multimodal Readiness

Visual Context76% Optimized for Vision
Image Alt Coverage170 / 225 images have alt text

TDM Rights

TDM-Reservation HeaderNot set
X-Robots-Tag: noaiNot set

🔥 Structural Entropy Check Research

0 Entropy
Poor Token Bloat: High
Noise Ratio: 81.0% · SNR: 0.24 · Signal: 10263 / Noise: 43615 tokens

🔬 AI-Crawler Simulation

See your website the way AI crawlers do. CSS stripped, structure labeled, content chunked.

🌐
This is what humans see — styled, branded, visual.
Toggle to "AI Agent View" to see what GPTBot, ClaudeBot, and other AI crawlers actually extract from this page.
🤖

AI Answer Preview

NEW

See how AI models summarize your site. Left: your actual content. Right: what the LLM extracts and says about you.

Simulating AI extraction…
🧠

The LLM Interpretation

AI-VERIFIED

SEODiff AI analyzed the extracted content of h2o.ai and produced this structured business intelligence. Fields marked SEMANTIC VOID indicate information the AI could not find — a critical gap in your site’s machine-readability.

Core Offering
AI-powered platform for identifying individual animals from images, enabling wildlife conservation and tracking by analyzing vast amounts of image data from various sources.
Target Audience
Conservationists, researchers, wildlife biologists, NGOs, government agencies (e.g., US Department of Interior), and citizen scientists involved in wildlife monitoring and biodiversity research.
Pricing Model
Platform is currently non-profit and relies on donations and grants. There is a potential for future premium features or services, but currently, access is free.
🔗 Integration Partners
iNaturalistTweet-a-Whale
🏆 Competitive Moat
Unique combination of AI-powered individual animal identification, a large and growing database of image data (Flukebook), and a human-in-the-loop approach for ensuring accuracy and minimizing noise.
📊 Content Depth
8/10
🔄 Programmatic SEO Signals
Integration directory pagesTemplate comparison pages
⚡ Key Pain Points
• Lack of structured FAQ schema
• Thin landing pages for features
Analyzed by SEODiff AI · 2026-03-03

🔧 Tech Stack

FrameworkHubSpot CMS
AI-Readiness Score70/100
Server
CDN
HTTP Status200
Load Time580 ms
Raw HTML Size210.5 KB
Visible Text Size40.1 KB

Performance & Speed

59/100 20 % of Global Score 🟢 High Confidence

⏱️ Time to First Byte

580 ms
Acceptable — room for improvement

Google considers <200 ms "good". AI crawlers may have even shorter timeouts.

📦 Page Weight

1494
DOM nodes
210 KB
HTML payload
Moderate weight — acceptable for most scenarios

🗄️ Cache & CDN

  • ✓ Cache-Control header → max-age=120,s-maxage=600,stale-while-revalidate=43200,stale-if-error=43200
  • ✗ CDN cache status
  • ✗ CDN detected

🔬 Tracker Tax

1
tracker scripts
1
third-party domains
0.0%
token overhead
Minimal tracker load — clean signal for bots
js.hs-scripts.com
📐 How the Performance Pillar score is calculated

Perf Pillar = TTFB (35 pts) + Page Weight (25 pts) + Cache/CDN (20 pts) + Tracker Tax (20 pts)

TTFB <200 ms = full marks. DOM >3000 or payload >300 KB incurs heavy penalties. Tracker scripts beyond 5 reduce score.

🏗️

Architecture & Trust

89/100 15 % of Global Score 🟢 High Confidence

🗺️ Sitemap & Robots

  • ✓ Sitemap declared in robots.txt → https://www.h2o.ai/sitemap.xml
  • ✓ Googlebot allowed
  • ✓ GPTBot allowed
  • ✓ ClaudeBot allowed

🔗 Linking

122
internal links
5
external links
Good internal linking — helps crawlers discover content

🔒 Security & Trust

  • ✓ HSTS header (Strict-Transport-Security)
  • ✗ Content-Security-Policy header
  • ✓ HTTP status 200 OK (got 200)

♿ Accessibility Signals

  • ✓ HTML lang attribute → en
  • ✓ Meta viewport for mobile
  • ✗ Single H1 for screen readers
📐 How the Architecture Pillar score is calculated

Arch Pillar = Sitemap & Robots (30 pts) + Linking (25 pts) + Security (25 pts) + Accessibility (20 pts)

Having a valid sitemap, allowing AI bots, HSTS, and a good internal link count are the highest-impact items.

🏅 AI-Verified Trust Badge

Your site scores 55/100. Reach 80+ to unlock the green "AI-Verified" badge. Fix the issues below to improve your score.

AI-Verified badge for h2o.ai
Pending Audit — score below 80 threshold
<a href="https://seodiff.io/radar/domains/h2o.ai" rel="noopener"><img src="https://seodiff.io/api/v1/badge?domain=h2o.ai" alt="AI-Verified by SEODiff" width="280" height="52"></a>

💡 Paste in your site footer, GitHub README, or email signature. Badge updates automatically as your score changes.

� Deep Crawl Analysis 58 pages · Deep-10

Homepage ACRI
55
Single-page score
+11
Subpages outperform homepage
Δ delta
Site-Wide ACRI
66
Avg across 58 pages · Range 0–90
Topical Cohesion
15%
Topical Drift
TF-IDF cosine similarity
Total Words
85999
Avg Bloat
30.3×
RAG Fractures [?]
1
⚠️
1 RAG-Chunking Fracture Detected

Poorly formatted tables or pricing grids on 1 page will be split incorrectly during RAG chunking, causing AI models to hallucinate prices and features.

Page Type ACRI Token Bloat Words Status
https://h2o.ai/blog/2023/ai-and-humans-combating-extinction-together-with-dr-tanya-berger-wolf/
AI and Humans Combating Extinction Together with Dr. Tanya Berger-Wolf
pricing 90 7.2× 4037 💰 Pricing
https://h2o.ai/blog/2023/building-a-fraud-detection-model-with-h2o-ai-cloud/
Building a Fraud Detection Model with H2O AI Cloud | H2O.ai
pricing 90 8.3× 3643 💰 Pricing
https://h2o.ai/blog/2023/entrenando-tu-propio-llm-sin-programacion/
Entrenando Tu Propio LLM Sin Programación
pricing 90 9.3× 3426 💰 Pricing
https://h2o.ai/blog/2023/introduction-to-h2o-document-ai/
Introduction to H2O Document AI | H2O.ai
pricing 90 5.4× 5727 💰 Pricing
https://h2o.ai/blog/2023/att-panel-ai-as-a-service-aiaas/
AT&T panel: AI as a Service (AIaaS)
pricing 90 5.8× 5278 💰 Pricing
https://h2o.ai/blog/2024/building-your-first-agent-step-by-step-with-h2ogpte---llm-chains/
Building your first Agent step-by-step with h2oGPTe & LLM Chains
pricing 85 9.2× 3338 💰 Pricing
https://h2o.ai/blog/2023/ai-in-insurance-resolution-lifes-ai-journey-with-rajesh-malla/
AI in Insurance: Resolution Life's AI Journey with Rajesh Malla | H2O.ai
pricing 80 13.1× 2059 💰 Pricing
https://h2o.ai/blog/2023/genai-app-store/
Introducing the H2O GenAI App Store: A Playground of Generative AI Innovation
pricing 80 12.5× 2178 💰 Pricing
https://h2o.ai/blog/2023/h2o-world-sydney-tim-fountaine/
H2O World Sydney With Dr. Tim Fountaine
pricing 80 10.6× 2517 💰 Pricing
https://h2o.ai/blog/2023/testing-large-language-model-llm-vulnerabilities-using-adversarial-attacks/
Testing Large Language Model (LLM) Vulnerabilities Using Adversarial Attacks | H2O.ai
pricing 80 12.6× 2223 💰 Pricing
https://h2o.ai/blog/2023/deploy-a-wave-app-on-an-aws-ec2-instance/
Deploy a WAVE app on an AWS EC2 instance | H2O.ai
pricing 80 14.9× 1911 💰 Pricing
https://h2o.ai/blog/2023/effortless-fine-tuning-of-large-language-models-with-open-source-h2o-llm-studio/
Effortless Fine-Tuning of Large Language Models with Open-Source H2O LLM Studio | H2O.ai
pricing 80 14.1× 2303 💰 Pricing
https://h2o.ai/blog/2023/navigating-the-challenges-of-time-series-forecasting/
Navigating the challenges of time series forecasting | H2O.ai
pricing 80 11.0× 2678 💰 Pricing
https://h2o.ai/blog/2023/streamlining-data-preparation-for-fine-tuning-of-large-language-models/
H2O LLM DataStudio: Streamlining Data Curation and Data Preparation for LLMs related tasks | H2O.ai
pricing 80 16.1× 1748 💰 Pricing
https://h2o.ai/blog/2023/h2o-releases-3-40-0-1-and-3-42-0-1/
H2O Releases 3.40.0.1 and 3.42.0.1 | H2O.ai
pricing 80 13.6× 2161 💰 Pricing
https://h2o.ai/blog/2023/how-horse-racing-predictions-with-h2o-ai-saved-a-local-insurance-company-8m-a-year/
How Horse Racing Predictions with H2O.ai Saved a Local Insurance Company $8M a Year
pricing 80 10.8× 2512 💰 Pricing
https://h2o.ai/blog/2023/introducing-h2o-llm-app-studio-sketch2app-part-1/
Generating LLM Powered Apps using H2O LLM AppStudio – Part1: Sketch2App
pricing 80 15.0× 1878 💰 Pricing
https://h2o.ai/blog/2023/how-commonwealth-bank-is-transforming-operations-with-document-ai/
How Commonwealth Bank is transforming operations with Document AI | H2O.ai
pricing 77 19.6× 1297 💰 Pricing
https://h2o.ai/blog/2023/building-a-manufacturing-product-defect-classification-model-and-application-using-h2o-hydrogen-torch-h2o-mlops-and-h2o-wave/
Building a Manufacturing Product Defect Classification Model and Application using H2O Hydrogen Torch, H2O MLOps, and H2O Wave | H2O.ai
pricing 77 19.8× 1377 💰 Pricing
https://h2o.ai/blog/2016/fast-csv-writing-for-r/
Fast csv writing for R | H2O.ai
pricing 75 12.0× 2217 💰 Pricing
Showing 20 of 58 pages. Unlock full subpage table →
📂
Health by Sub-Directory
Average ACRI and top issues aggregated by URL path prefix
Path Pages Avg ACRI Ghost % Bloat Top Issue
/blog/ 50 74 0% 27.5× High JS Bloat
/contact/ 1 49 0% 121.0× High JS Bloat
/about/ 1 0 0% 0.0× Low AI Readiness
/case-studies/ 1 36 0% 182.7× High JS Bloat
/pricing/ 1 0 0% 0.0× Low AI Readiness
/features/ 1 0 0% 0.0× Low AI Readiness
/faq/ 1 0 0% 0.0× Low AI Readiness
/integrations/ 1 0 0% 0.0× Low AI Readiness
/products/ 1 59 0% 79.1× High JS Bloat
🔗
Outbound External Citations
0 unique external domains cited across 58 pages
genai.h2o.ai ×53
twitter.com ×53
youtube.com ×53
gitter.im ×53
github.com ×53
support.h2o.ai ×53
id.cloud.h2o.ai ×53
docs.h2o.ai ×53
🔄 Re-Crawl & Update 📡 Track this Domain

Scores update automatically each month. Create a free account for on-demand re-crawls (3/month free).

🔌 API Access

Pull this data programmatically. All sub-page metrics are available via our public API.

curl https://seodiff.io/api/v1/deep10/domain/h2o.ai

Get your free API key — 100 requests/month included.

🔗 Similar government Sites

Domains with a similar tech stack, industry, and AI readiness profile to h2o.ai. Compare side-by-side.

Domain ACRI AI Score Tech Stack Token Bloat Schema
h2o.ai (this site) 55 71 HubSpot CMS 5.2× 0
muskegon-mi.gov 78 88 WordPress 5.2× 1 Compare →
adaptiveplanning.com 78 91 Adobe Experience Manager 5.0× 1 Compare →
adaptiveinsights.com 78 91 Adobe Experience Manager 5.0× 1 Compare →
saskatoon.ca 78 81 Drupal 1.5× 0 Compare →
allinternationalconference.com 78 80 Custom / Proprietary 1.5× 0 Compare →
Compare All 5 Similar Sites →

📊 Semantic Share of Voice

How often would an AI cite h2o.ai when users ask about topics in this domain's niche? We run entity queries through our 188k-page search index and measure citation probability.

Analyzing citation landscape…

🩹

Remediation Patches

COPY-PASTE

Auto-generated code fixes tailored to h2o.ai. Copy and paste these into your codebase to improve AI visibility. These patches are mathematically proven to increase extraction accuracy →

Add Organization JSON-LD
High Impact ⏱ 5 min
AI models cannot identify your brand entity without Organization schema. This is the #1 fix for AI visibility.
html
<script type="application/ld+json">
{
  "@context": "https://schema.org",
  "@type": "Organization",
  "name": "H2o",
  "url": "https://h2o.ai",
  "logo": "https://h2o.ai/etc.clientlibs/h2o/clientlibs/clientlib-site/resources/images/favicon.ico",
  "sameAs": []
}
</script>
Add WebSite + SearchAction JSON-LD
High Impact ⏱ 5 min
Enables the Sitelinks Search Box in Google and allows AI to understand your site structure.
html
<script type="application/ld+json">
{
  "@context": "https://schema.org",
  "@type": "WebSite",
  "name": "H2o",
  "url": "https://h2o.ai",
  "potentialAction": {
    "@type": "SearchAction",
    "target": "https://h2o.ai/search?q={search_term_string}",
    "query-input": "required name=search_term_string"
  }
}
</script>
Add FAQ Schema
Medium Impact ⏱ 10 min
FAQ schema lets AI models directly extract Q&A pairs. This is the easiest way to get featured in AI responses.
html
<script type="application/ld+json">
{
  "@context": "https://schema.org",
  "@type": "FAQPage",
  "mainEntity": [
    {
      "@type": "Question",
      "name": "What is H2o?",
      "acceptedAnswer": {
        "@type": "Answer",
        "text": "Add your answer here — describe what H2o does in 1-2 sentences."
      }
    },
    {
      "@type": "Question",
      "name": "How does H2o work?",
      "acceptedAnswer": {
        "@type": "Answer",
        "text": "Explain the key features and how users interact with H2o."
      }
    }
  ]
}
</script>
📈

Projected Impact

ROI EST.

If you apply the patches above, here's the estimated improvement for h2o.ai:

Current Score
71
Projected Score
87
Improvement
+16 pts
Add Organization schema +6 pts
Add WebSite schema +4 pts
Reduce token bloat +3 pts
Add FAQ schema +3 pts

*Estimates based on SEODiff's scoring model. Actual results depend on implementation quality.

📋 Data Export

Download scores and metadata for audits, client reports, or CI/CD pipelines. Exports contain computed metrics only (no copyrighted content).

All data is generated automatically and updated with each crawl. JSON exports contain scores and metadata only (no copyrighted content).

Is this your company?

Monitor your AI visibility score weekly and get alerted when changes happen.

Start Free →

🧭 Self-Diffing (Private Layer)

For owned domains, combine this world snapshot with private drift + regression history.
Template Drift
Track in My Site
Drift → Traffic Impact
In development coming soon
Regression Incidents
Track in My Site
Internal Linking
Deep Audit graph
Semantic Structure
GEO view in Deep Audit
Content Quality
Thin/duplicate tracking

🕒 History

Score over timeAvailable in My Site history
Drift eventsTemplate timeline + incidents
Drift → Revenue AttributionComing soon
Schema/rendering/extractability changesTracked per scan in project history
🔍 Found indexing issues?
Run a free deep audit to diagnose crawled-not-indexed, soft 404s, redirect errors, and more.
Free Deep Audit → GSC Error Guide →