pdfroom.com 50 D
🛡️ SEO 46 🤖 GEO 49 ⚡ Perf 42 🏗️ Arch 70

pdfroom.com — Global SEODiff Score 50/100

pdfroom.com
📊

pdfroom.com scores 63/100 on the AI-Readiness Index, placing it in the middle tier of indexed domains — there is clear room for improvement. Within the infrastructure vertical, this places pdfroom.com above the industry average of 57 —, suggesting strong competitive positioning in AI search. The rendering approach is hybrid, with a moderate ghost ratio of 30% — most content is accessible without JS, though some elements are client-rendered. A 21.7× token bloat ratio means crawlers must process significantly more tokens to reach the actual content — a drag on extraction efficiency. Zero schema blocks puts this site at a disadvantage in knowledge graph and AI-answer pipelines that rely on explicit structured data. Robots.txt grants unrestricted access to the key AI user-agents, which is the strongest starting position for AI visibility.

50
D — Global SEODiff Score
Comprehensive search visibility assessment
Below average — Performance (42) has the most room for improvement.
🎯 Top Fix: Reduce token bloat (22×) → +5–10 pts
🔬 Automated SEODiff Assessment · Snapshot: Mar 18, 2026 · 📋 API
📈 ACRI Trend 6 snapshots
Feb 23 Mar 18
🔔 Recent AI Indexing Activity
📉 Mar 18 ACRI -1 (37→36)
Does your site score higher than pdfroom.com?
Run the same 40-signal audit on your own domain — free, instant results.
Scan Your Site Free →
🧮 Score Transparency — How is this calculated?
🛡️ Traditional SEO (25% weight)46 × 0.25 = 11.5
🤖 AI Readiness / GEO (40% weight)49 × 0.40 = 19.6
⚡ Performance (20% weight)42 × 0.20 = 8.4
🏗️ Architecture & Trust (15% weight)70 × 0.15 = 10.5
Weighted sum = 11.5 + 19.6 + 8.4 + 10.5
Global SEODiff Score = 50 (D)
📊 ACRI Sub-Scores (AI Readiness Detail)
100
Bot Access
avg 92
84
Rendering
avg 93
20
Structure
avg 35
0
Schema
avg 9
75
Tech Stack
avg 63
🔀
Visibility Delta: Google vs AI
Google (Tranco)
Top 7%
Rank #71156
+41 pts
Gap
AI (ACRI)
Top 48%
Score 63/100

pdfroom.com punches above its weight in AI — AI visibility exceeds Google ranking. This is a competitive moat worth protecting. ACRI measures technical crawler readiness. Read the methodology →

Why pdfroom.com ranks here

Tech stackCloudflare Pages
RenderingHybrid
Schema coverage0 blocks
Token bloat21.7×

Fastest improvements

  • Add basic Organization and WebSite JSON-LD to fix “0 schema blocks” (see Schema Coverage).
  • Reduce token bloat (navigation/footer/code) so agents reach your main content faster (see Token Bloat).
  • Create an llms.txt file so AI crawlers can discover your content structure without heavy crawling. Generate llms.txt →
  • Run a full entropy audit to find which DOM regions waste the most tokens. Run Entropy Audit →
🧪

JavaScript Rendering Check

We check what AI crawlers miss when they skip JavaScript execution.

Running headless browser to simulate AI extraction…
🛡️

Traditional SEO

46/100 25 % of Global Score 🟢 High Confidence

📝 Title Tag

43 chars
Good length

Optimal range: 30–60 characters for SERP display.

📋 Meta Description

151 chars
Good length

Optimal range: 120–160 characters for snippet control.

🔤 Heading Hierarchy

  • ✓ Exactly 1 <h1> tag — found 1
  • ✓ Has <h2> headings — found 6
  • ✗ <h2> not before <h1>

🔍 Indexability

  • ✓ Canonical tag present → https://pdfroom.com
  • ✓ No noindex directive
  • ✓ Meta viewport set
  • ✓ HTML lang attribute → en
  • ✅ Hreflang tags
  • ✓ Googlebot allowed by robots.txt

🌐 Social / OpenGraph

  • ✓ og:title — PDF Room - Your Search Engine For Free PDFs
  • ✓ og:description — The PDF Room search engine allows you to find the best educational and recreational PDFs online. Browse through high-quality PDFs from trusted sources.
  • ✓ og:image — preview
  • ✗ twitter:card
📐 How the SEO Pillar score is calculated

SEO Pillar = Title (20 pts) + Meta Desc (20 pts) + Heading Hierarchy (20 pts) + Indexability (20 pts) + Social/OG (20 pts)

Each sub-score is derived from the checks above. Canonical tag, lang attribute, og:image, and a single H1 are the highest-impact items.

🤖

AI Readiness / GEO

49/100 40 % of Global Score 🟢 High Confidence

This pillar aggregates citation share, hallucination risk, bot access, schema health, and content extractability. The individual diagnostic sections below contribute to this score.

🔗

Citation Alternatives

Research
💡
Insight: In the infrastructure sector, safely.co.jp (ACRI: 90) currently has stronger AI extractability. AI models tend to prefer sources with higher semantic structure and schema coverage. Domains with ACRI < 40 see 3.5× more hallucinations. Read the research →
pdfroom.com
36
Your ACRI Score
90
Industry Peer ACRI
AI models prioritize pages with strong semantic structure and schema coverage. safely.co.jp has schema coverage of 3 blocks and uses WordPress. Improve your score by implementing the remediation patches below.
📊 Side-by-Side Comparison →
🚨

Hallucination Risk

Research

Is AI lying about your brand? This panel measures how likely LLMs are to hallucinate facts when extracting information from your page.

👻
Shadow Content Detected: 30% of your page token budget is in non-rendered regions (JavaScript-dependent content that AI crawlers may not process). Combined with 21.7x token bloat, AI models are using most of their context window on noise instead of your real content. This dramatically increases hallucination probability — models fill the gap with made-up facts.
Analyzing hallucination risk…

🤖 Bot Access Matrix

GPTBot (OpenAI)
Allowed
ClaudeBot (Anthropic)
Allowed
CCBot (Common Crawl)
Allowed
Google-Extended
Allowed
Googlebot
Allowed

👻 Rendering (Ghost Ratio) Docs

Ghost Ratio 30%
0% — Safe 50% 100% — Risk
Status Server-Side Rendered (Safe)
Rendering Type Hybrid

📊 Structure & Information Density Docs

Structure Grade 20/100 — Low
Structured Elements 17 elements (17 lists, 0 rows, 0 headers)
Total Words1445
Raw Density1.2%
💡Low structure score (20/100). Your content appears as a wall of text with few structured HTML elements. You have 17 list items, 0 table rows, 0 table headers. Convert features into <ul> lists and data into <table> elements to help AI models extract structured information.

🏷️ Schema Health Docs

Organization Schema ❌ Missing
Product / Service Schema ⚠️ Not Found
Total Schema Blocks0 — No JSON-LD detected

Schema Coverage Map

0/7 schema types detected
❌ Organization
❌ Product/Service
❌ Breadcrumb
❌ FAQ
❌ Article
❌ WebSite
💡Organization schema missing. AI models cannot identify your brand entity. Without it, your brand won't appear in Knowledge Panels or be associated with your content.
💡Product / Service schema missing. AI models don't know this is a SaaS product. Add Product or SoftwareApplication schema so AI understands what you offer and can surface pricing/features.
💡BreadcrumbList schema missing. AI cannot understand your site hierarchy or how pages relate to each other.
💡FAQ schema missing. Adding FAQPage schema lets AI models directly extract Q&A pairs for Featured Snippets and chatbot answers.
💡WebSite schema missing. Add WebSite + SearchAction so Google can generate a Sitelinks Search Box for your brand in AI results.

📐 AI Efficiency Metrics Docs

32
AI Extractability
High
Crawl Cost
None
Blocklist Risk
Extractability32/100 — AI models can barely extract answers from this page
Crawl CostHigh (95/100) — expensive for AI crawlers to process
Blocklist RiskNone — 0 of 5 AI crawlers blocked

Token Bloat Research

4%
🗑️ 96%
Useful Content (34.5 KB)Bloat (714.9 KB)
Token Bloat Ratio21.7× — Heavy

Multimodal Readiness

Visual Context100% Optimized for Vision
Image Alt Coverage44 / 44 images have alt text

TDM Rights

TDM-Reservation HeaderNot set
X-Robots-Tag: noaiNot set
💡Your HTML is 749.4 KB, but only 34.5 KB is text. 4% useful / 96% bloat. AI crawlers have limited context windows (e.g. 128k tokens). This level of bloat (21.7×) risks context-window truncation by ChatGPT, Claude, and Gemini. Reduce inline scripts, CSS, hydration payloads, and tracking code.

🔥 Structural Entropy Check Research

0 Entropy
Poor Token Bloat: High
Noise Ratio: 95.4% · SNR: 0.05 · Signal: 8824 / Noise: 183011 tokens

🔬 AI-Crawler Simulation

See your website the way AI crawlers do. CSS stripped, structure labeled, content chunked.

🌐
This is what humans see — styled, branded, visual.
Toggle to "AI Agent View" to see what GPTBot, ClaudeBot, and other AI crawlers actually extract from this page.
🤖

AI Answer Preview

NEW

See how AI models summarize your site. Left: your actual content. Right: what the LLM extracts and says about you.

Simulating AI extraction…

🔧 Tech Stack

AI-Readiness Score75/100
Servercloudflare
CDNcloudflare
HTTP Status200
Load Time1480 ms
Raw HTML Size749.4 KB
Visible Text Size34.5 KB

Performance & Speed

42/100 20 % of Global Score 🟢 High Confidence

⏱️ Time to First Byte

1480 ms
Slow — bots may time out or deprioritise

Google considers <200 ms "good". AI crawlers may have even shorter timeouts.

📦 Page Weight

1048
DOM nodes
749 KB
HTML payload
Heavy page — consider reducing DOM complexity

🗄️ Cache & CDN

  • ✓ Cache-Control header → no-cache, private
  • ✓ CDN cache status → DYNAMIC
  • ✓ CDN detected → cloudflare

🔬 Tracker Tax

1
tracker scripts
1
third-party domains
0.0%
token overhead
Minimal tracker load — clean signal for bots
googletagmanager.com
📐 How the Performance Pillar score is calculated

Perf Pillar = TTFB (35 pts) + Page Weight (25 pts) + Cache/CDN (20 pts) + Tracker Tax (20 pts)

TTFB <200 ms = full marks. DOM >3000 or payload >300 KB incurs heavy penalties. Tracker scripts beyond 5 reduce score.

🏗️

Architecture & Trust

70/100 15 % of Global Score 🟡 Medium Confidence

🗺️ Sitemap & Robots

  • ✗ Sitemap declared in robots.txt
  • ✓ Googlebot allowed
  • ✓ GPTBot allowed
  • ✓ ClaudeBot allowed

🔗 Linking

230
internal links
4
external links
Good internal linking — helps crawlers discover content

🔒 Security & Trust

  • ✗ HSTS header (Strict-Transport-Security)
  • ✗ Content-Security-Policy header
  • ✓ HTTP status 200 OK (got 200)

♿ Accessibility Signals

  • ✓ HTML lang attribute → en
  • ✓ Meta viewport for mobile
  • ✓ Single H1 for screen readers
📐 How the Architecture Pillar score is calculated

Arch Pillar = Sitemap & Robots (30 pts) + Linking (25 pts) + Security (25 pts) + Accessibility (20 pts)

Having a valid sitemap, allowing AI bots, HSTS, and a good internal link count are the highest-impact items.

🏅 AI-Verified Trust Badge

Your site scores 36/100. Reach 80+ to unlock the green "AI-Verified" badge. Fix the issues below to improve your score.

AI-Verified badge for pdfroom.com
Pending Audit — score below 80 threshold
<a href="https://seodiff.io/radar/domains/pdfroom.com" rel="noopener"><img src="https://seodiff.io/api/v1/badge?domain=pdfroom.com" alt="AI-Verified by SEODiff" width="280" height="52"></a>

💡 Paste in your site footer, GitHub README, or email signature. Badge updates automatically as your score changes.

� Deep Crawl Analysis 57 pages · Deep-10

Homepage ACRI
36
Single-page score
+14
Subpages outperform homepage
Δ delta
Site-Wide ACRI
51
Avg across 57 pages · Range 0–60
Topical Cohesion
6%
Topical Drift
TF-IDF cosine similarity
Total Words
106863
Avg Bloat
162.3×
RAG Fractures [?]
1
⚠️
1 RAG-Chunking Fracture Detected

Poorly formatted tables or pricing grids on 1 page will be split incorrectly during RAG chunking, causing AI models to hallucinate prices and features.

Page Type ACRI Token Bloat Words Status
https://pdfroom.com/books/insights/QpdMNXAxgaX
Insights (PDF) - 1.96 MB @ PDF Room
pricing 60 58.7× 2454 💰 Pricing
https://pdfroom.com/books/insights/LbXgPXx4gev
Insights (PDF) - 1.47 MB @ PDF Room
pricing 60 54.9× 2623 💰 Pricing
https://pdfroom.com/books/insights/j9ZdYZWZgV4
Insights (PDF) - 3.2 MB @ PDF Room
pricing 60 44.8× 3240 💰 Pricing
https://pdfroom.com/books/insights/bLvgBAOYdDw
Insights (PDF) - 2.4 MB @ PDF Room
pricing 60 44.1× 3286 💰 Pricing
https://pdfroom.com/books/insights/9qlgykz45MG
Insights (PDF) - 4.08 MB @ PDF Room
pricing 60 64.3× 2232 💰 Pricing
https://pdfroom.com/books/core-light-healing-my-personal-journey-and-advanced-healing-concepts-for-creating-the-life-you/0andLVZ2e3N
Core Light Healing: My Personal Journey and Advanced... (PDF)
pricing 60 93.8× 1533 💰 Pricing
https://pdfroom.com/books/dietary-reference-intakes/n0YpgQV2Nzo
Dietary Reference Intakes (PDF) - 886 KB @ PDF Room
docs 60 70.3× 2052
https://pdfroom.com/books/improvement-of-buildings-structural-quality-by-new-technologies-proceedings-of-the-final-conference-of-cost-action-c12-20-22-january-2005-innsbruck-austria/vRPkdNvdXrq
Improvement of buildings' structural quality by new... (PDF)
pricing 60 68.2× 2125
https://pdfroom.com/books/job-hazard-analysis-second-edition-a-guide-for-voluntary-compliance-and-beyond/oEjndOGdRqL
Job Hazard Analysis, Second Edition: A Guide for... (PDF)
pricing 60 47.2× 3083 💰 Pricing
https://pdfroom.com/books/best-practices-in-state-and-regional-innovation-initiatives-competing-in-the-21st-century/OKkM5rLgE3n
Best Practices in State and Regional Innovation... (PDF)
other 60 71.5× 2020
https://pdfroom.com/books/the-mathematical-sciences-in-2025/KjGk207gpmy
The Mathematical Sciences in 2025 (PDF) - 5.54 MB @ PDF Room
other 60 60.4× 2404
https://pdfroom.com/books/forex-the-ultimate-guide-to-price-action-trading-pdf/L3jN2RLdvW9
Forex : The Ultimate Guide To Price Action Trading √PDF (PDF)
blog 60 74.7× 1938
https://pdfroom.com/books/rising-above-the-gathering-storm-energizing-and-employing-america-for-a-brighter-economic-future/1v0K2lzdape
Rising Above the Gathering Storm: Energizing and... (PDF)
other 60 64.8× 2232
https://pdfroom.com/books/protecting-our-forces/yxQpdMydaXZ
Protecting Our Forces (PDF) - 3.25 MB @ PDF Room
other 60 55.1× 2634
https://pdfroom.com/books/combatting-cybercrime/bOor5WqgqDv
Combatting Cybercrime (PDF) - 11.39 MB @ PDF Room
other 60 50.3× 2889
https://pdfroom.com/books/developing-capacities-for-teaching-responsible-science-in-the-mena-region-refashioning-scientific-dialogue/3wW5mwZgYoD
Developing Capacities for Teaching Responsible Science... (PDF)
pricing 60 64.6× 2246 💰 Pricing
https://pdfroom.com/books/how-people-learn-brain-mind-experience-and-school-expanded-edition/NPXn2GrgxV8
How People Learn: Brain, Mind, Experience, and School:... (PDF)
blog 60 69.6× 2075
https://pdfroom.com/books/a-new-biology-for-the-21st-century/XeKRd682ZpL
A New Biology for the 21st Century (PDF) @ PDF Room
pricing 60 64.4× 2242 💰 Pricing
https://pdfroom.com/books/living-in-the-light-a-guide-to-personal-and-planetary-transformation/J9qlgy4dMG0
Living in the Light: A Guide to Personal and Planetary... (PDF)
pricing 60 149.2× 956 💰 Pricing
https://pdfroom.com/books/materials-for-high-temperature-power-generation-and-process-plant-applications/MkLg8p7gZBm
Materials for High Temperature Power Generation and... (PDF)
pricing 60 80.5× 1790 💰 Pricing
Showing 20 of 57 pages. Unlock full subpage table →
📂
Health by Sub-Directory
Average ACRI and top issues aggregated by URL path prefix
Path Pages Avg ACRI Ghost % Bloat Top Issue
/books/ 46 60 1% 82.1× Bot Blocked
/about/ 1 32 1% 1211.7× Bot Blocked
/contact/ 1 22 1% 4050.7× Bot Blocked
/products/ 1 0 0% 0.0× Low AI Readiness
/pricing/ 1 0 0% 0.0× Low AI Readiness
/integrations/ 1 0 0% 0.0× Low AI Readiness
/faq/ 1 0 0% 0.0× Low AI Readiness
/case-studies/ 1 0 0% 0.0× Low AI Readiness
/features/ 1 0 0% 0.0× Low AI Readiness
/docs/ 1 0 0% 0.0× Low AI Readiness
/blog/ 1 52 1% 113.3× Bot Blocked
/dmca/ 1 52 1% 95.5× Bot Blocked
🔗
Outbound External Citations
0 unique external domains cited across 57 pages
x.com ×50
reddit.com ×50
forum.pdfroom.com ×50
pinterest.com ×46
twitter.com ×46
facebook.com ×46
copyright.gov ×1
lumendatabase.org ×1
🔄 Re-Crawl & Update 📡 Track this Domain

Scores update automatically each month. Create a free account for on-demand re-crawls (3/month free).

🔌 API Access

Pull this data programmatically. All sub-page metrics are available via our public API.

curl https://seodiff.io/api/v1/deep10/domain/pdfroom.com

Get your free API key — 100 requests/month included.

🔗 Similar infrastructure Sites

Domains with a similar tech stack, industry, and AI readiness profile to pdfroom.com. Compare side-by-side.

Domain ACRI AI Score Tech Stack Token Bloat Schema
pdfroom.com (this site) 36 63 Cloudflare Pages 21.7× 0
bluebaytravel.co.uk 61 75 Cloudflare Pages 15.8× 2 Compare →
kaliscan.com 61 82 Cloudflare Pages 6.1× 0 Compare →
xnxxporno.cc 61 76 Cloudflare Pages 4.2× 0 Compare →
qplushost.com 61 74 Cloudflare Pages 2.0× 0 Compare →
onemillionpredictions.com 61 79 WordPress 20.9× 0 Compare →
Compare All 5 Similar Sites →

📊 Semantic Share of Voice

How often would an AI cite pdfroom.com when users ask about topics in this domain's niche? We run entity queries through our 188k-page search index and measure citation probability.

Analyzing citation landscape…

🩹

Remediation Patches

COPY-PASTE

Auto-generated code fixes tailored to pdfroom.com. Copy and paste these into your codebase to improve AI visibility. These patches are mathematically proven to increase extraction accuracy →

Add Organization JSON-LD
High Impact ⏱ 5 min
AI models cannot identify your brand entity without Organization schema. This is the #1 fix for AI visibility.
html
<script type="application/ld+json">
{
  "@context": "https://schema.org",
  "@type": "Organization",
  "name": "Pdfroom",
  "url": "https://pdfroom.com",
  "logo": "https://pdfroom.com/apple-touch-icon.png",
  "sameAs": []
}
</script>
Add WebSite + SearchAction JSON-LD
High Impact ⏱ 5 min
Enables the Sitelinks Search Box in Google and allows AI to understand your site structure.
html
<script type="application/ld+json">
{
  "@context": "https://schema.org",
  "@type": "WebSite",
  "name": "Pdfroom",
  "url": "https://pdfroom.com",
  "potentialAction": {
    "@type": "SearchAction",
    "target": "https://pdfroom.com/search?q={search_term_string}",
    "query-input": "required name=search_term_string"
  }
}
</script>
Reduce Token Bloat
Medium Impact ⏱ 1–2 hrs
Only 4% of your HTML is useful content. AI crawlers waste context window tokens on bloat.
html
<!-- Move inline CSS to external stylesheets -->
<link rel="stylesheet" href="/css/main.css">

<!-- Move inline scripts to external files with defer -->
<script src="/js/app.js" defer></script>

<!-- Remove duplicate navigation blocks -->
<!-- Keep only ONE <nav> in the <header> -->

<!-- Ensure <main> wraps your primary content -->
<main>
  <!-- Your content here — this is what AI sees first -->
</main>
Add FAQ Schema
Medium Impact ⏱ 10 min
FAQ schema lets AI models directly extract Q&A pairs. This is the easiest way to get featured in AI responses.
html
<script type="application/ld+json">
{
  "@context": "https://schema.org",
  "@type": "FAQPage",
  "mainEntity": [
    {
      "@type": "Question",
      "name": "What is Pdfroom?",
      "acceptedAnswer": {
        "@type": "Answer",
        "text": "Add your answer here — describe what Pdfroom does in 1-2 sentences."
      }
    },
    {
      "@type": "Question",
      "name": "How does Pdfroom work?",
      "acceptedAnswer": {
        "@type": "Answer",
        "text": "Explain the key features and how users interact with Pdfroom."
      }
    }
  ]
}
</script>
📈

Projected Impact

ROI EST.

If you apply the patches above, here's the estimated improvement for pdfroom.com:

Current Score
63
Projected Score
81
Improvement
+18 pts
Add Organization schema +6 pts
Add WebSite schema +4 pts
Reduce token bloat +5 pts
Add FAQ schema +3 pts

*Estimates based on SEODiff's scoring model. Actual results depend on implementation quality.

📋 Data Export

Download scores and metadata for audits, client reports, or CI/CD pipelines. Exports contain computed metrics only (no copyrighted content).

All data is generated automatically and updated with each crawl. JSON exports contain scores and metadata only (no copyrighted content).

Is this your company?

Monitor your AI visibility score weekly and get alerted when changes happen.

Start Free →

🧭 Self-Diffing (Private Layer)

For owned domains, combine this world snapshot with private drift + regression history.
Template Drift
Track in My Site
Drift → Traffic Impact
In development coming soon
Regression Incidents
Track in My Site
Internal Linking
Deep Audit graph
Semantic Structure
GEO view in Deep Audit
Content Quality
Thin/duplicate tracking

🕒 History

Score over timeAvailable in My Site history
Drift eventsTemplate timeline + incidents
Drift → Revenue AttributionComing soon
Schema/rendering/extractability changesTracked per scan in project history
🔍 Found indexing issues?
Run a free deep audit to diagnose crawled-not-indexed, soft 404s, redirect errors, and more.
Free Deep Audit → GSC Error Guide →