penn.museum 45 D
🛡️ SEO 42 🤖 GEO 80 ⚡ Perf 65 🏗️ Arch 62

penn.museum — Global SEODiff Score 45/100

penn.museum
📊

penn.museum registers just 15/100 on the ACRI scale — a bottom-tier result indicating systemic AI visibility problems. Within the education vertical, this places penn.museum below the industry average of 57 —, indicating room for competitive improvement. Server-side rendering keeps the ghost ratio near zero, giving AI systems direct access to all visible content. The token bloat ratio sits at a lean 3.4×, meaning the ratio of code to visible content is efficient — crawlers spend their token budget on actual information. Only 1 schema block is present — adding Organization, WebSite, and Breadcrumb schemas would significantly improve structured data coverage. Most AI crawlers are restricted by robots.txt, limiting how AI-powered search engines can index and surface this content.

45
D — Global SEODiff Score
Comprehensive search visibility assessment
Below average — Traditional SEO (42) has the most room for improvement.
🎯 Top Fix: Allow GPTBot + ClaudeBot in robots.txt → lift the score cap
🔬 Automated SEODiff Assessment · Snapshot: Mar 21, 2026 · 📋 API
📈 ACRI Trend 4 snapshots
Mar 5 Mar 21
🔔 Recent AI Indexing Activity
🔄 Mar 21 Content change detected
🔄 Mar 16 Content change detected
Does your site score higher than penn.museum?
Run the same 40-signal audit on your own domain — free, instant results.
Scan Your Site Free →
🧮 Score Transparency — How is this calculated?
🛡️ Traditional SEO (25% weight)42 × 0.25 = 10.5
🤖 AI Readiness / GEO (40% weight)80 × 0.40 = 32.0
⚡ Performance (20% weight)65 × 0.20 = 13.0
🏗️ Architecture & Trust (15% weight)62 × 0.15 = 9.3
Weighted sum = 10.5 + 32.0 + 13.0 + 9.3
⚠️ Fatal multiplier: All major AI bots blocked → ×0.5
Global SEODiff Score = 45 (D)
🚫
Gatekeeper Rule: Score cannot exceed 15. Both GPTBot and ClaudeBot are blocked in robots.txt. No major AI assistant can cite this site. Allow at least one major AI crawler to lift the cap. See Bot Access →
📊 ACRI Sub-Scores (AI Readiness Detail)
40
Bot Access
avg 92
97
Rendering
avg 93
53
Structure
avg 35
42
Schema
avg 9
80
Tech Stack
avg 63
🔀
Visibility Delta: Google vs AI
Google (Tranco)
Top 5%
Rank #54389
+62 pts
Gap
AI (ACRI)
Top 67%
Score 15/100

penn.museum punches above its weight in AI — AI visibility exceeds Google ranking. This is a competitive moat worth protecting. ACRI measures technical crawler readiness. Read the methodology →

Why penn.museum ranks here

Tech stackShopify
Industryeducation
RenderingHybrid
Schema coverage1 blocks
Token bloat3.4×

Fastest improvements

  • Allow GPTBot in robots.txt so AI crawlers can access your pages (see Crawl Access).
  • Allow ClaudeBot (many assistants rely on it) — blocking it often correlates with “AI invisibility.”
  • Create an llms.txt file so AI crawlers can discover your content structure without heavy crawling. Generate llms.txt →
  • Run a full entropy audit to find which DOM regions waste the most tokens. Run Entropy Audit →
🧪

JavaScript Rendering Check

We check what AI crawlers miss when they skip JavaScript execution.

Running headless browser to simulate AI extraction…
🛡️

Traditional SEO

42/100 25 % of Global Score 🟡 Medium Confidence

📝 Title Tag

4 chars
Too short

Optimal range: 30–60 characters for SERP display.

📋 Meta Description

0 chars
Missing

Optimal range: 120–160 characters for snippet control.

🔤 Heading Hierarchy

  • ✓ Exactly 1 <h1> tag — found 1
  • ✓ Has <h2> headings — found 16
  • ✗ <h2> not before <h1>

🔍 Indexability

  • ✗ Canonical tag present
  • ✓ No noindex directive
  • ✓ Meta viewport set
  • ✓ HTML lang attribute → en-gb
  • ➖ Hreflang tags — N/A (single language site)
  • ✓ Googlebot allowed by robots.txt

🌐 Social / OpenGraph

  • ✗ og:title
  • ✗ og:description
  • ✗ og:image
  • ✗ twitter:card
📐 How the SEO Pillar score is calculated

SEO Pillar = Title (20 pts) + Meta Desc (20 pts) + Heading Hierarchy (20 pts) + Indexability (20 pts) + Social/OG (20 pts)

Each sub-score is derived from the checks above. Canonical tag, lang attribute, og:image, and a single H1 are the highest-impact items.

🤖

AI Readiness / GEO

80/100 40 % of Global Score 🟢 High Confidence

This pillar aggregates citation share, hallucination risk, bot access, schema health, and content extractability. The individual diagnostic sections below contribute to this score.

🔗

Citation Alternatives

Research
💡
Insight: In the education sector, educationdirectory.com.au (ACRI: 86) currently has stronger AI extractability. AI models tend to prefer sources with higher semantic structure and schema coverage. Domains with ACRI < 40 see 3.5× more hallucinations. Read the research →
penn.museum
56
Your ACRI Score
86
Industry Peer ACRI
AI models prioritize pages with strong semantic structure and schema coverage. educationdirectory.com.au has schema coverage of 4 blocks and uses Custom / Proprietary. Improve your score by implementing the remediation patches below.
📊 Side-by-Side Comparison →
🚨

Hallucination Risk

Research

Is AI lying about your brand? This panel measures how likely LLMs are to hallucinate facts when extracting information from your page.

Analyzing hallucination risk…

🤖 Bot Access Matrix

GPTBot (OpenAI)
Blocked
ClaudeBot (Anthropic)
Blocked
CCBot (Common Crawl)
Blocked
Google-Extended
Allowed
Googlebot
Allowed
💡GPTBot is blocked. To appear in ChatGPT citations, add Allow: / under User-agent: GPTBot in your robots.txt.
💡ClaudeBot is blocked. To be cited by Claude, allow ClaudeBot in robots.txt.

👻 Rendering (Ghost Ratio) Docs

Ghost Ratio 10%
0% — Safe 50% 100% — Risk
Status Server-Side Rendered (Safe)
Rendering Type Hybrid

📊 Structure & Information Density Docs

Structure Grade 53/100 — Fair
Structured Elements 96 elements (96 lists, 0 rows, 0 headers)
Total Words1145
Raw Density8.4%

🏷️ Schema Health Docs

Organization Schema ✅ Present
Product / Service Schema ⚠️ Not Found
Total Schema Blocks1 block(s) — Basic (low value for AI)

Schema Coverage Map

2/7 schema types detected
✅ Organization
❌ Product/Service
❌ Breadcrumb
❌ FAQ
❌ Article
✅ WebSite
💡Product / Service schema missing. AI models don't know this is a SaaS product. Add Product or SoftwareApplication schema so AI understands what you offer and can surface pricing/features.
💡BreadcrumbList schema missing. AI cannot understand your site hierarchy or how pages relate to each other.
💡FAQ schema missing. Adding FAQPage schema lets AI models directly extract Q&A pairs for Featured Snippets and chatbot answers.

📐 AI Efficiency Metrics Docs

71
AI Extractability
Low
Crawl Cost
High
Blocklist Risk
Extractability71/100 — AI models can easily extract answers from this page
Crawl CostLow (30/100) — efficient for AI crawlers to process
Blocklist RiskHigh — 3 of 5 AI crawlers blocked

Token Bloat Research

29%
🗑️ 71%
Useful Content (17.2 KB)Bloat (41.7 KB)
Token Bloat Ratio3.4× — Lean

Multimodal Readiness

Visual Context52% Optimized for Vision
Image Alt Coverage23 / 44 images have alt text

TDM Rights

TDM-Reservation HeaderNot set
X-Robots-Tag: noaiNot set

🔥 Structural Entropy Check Research

40 Entropy
Poor Token Bloat: High
Noise Ratio: 70.8% · SNR: 0.41 · Signal: 4393 / Noise: 10665 tokens

🔬 AI-Crawler Simulation

See your website the way AI crawlers do. CSS stripped, structure labeled, content chunked.

🌐
This is what humans see — styled, branded, visual.
Toggle to "AI Agent View" to see what GPTBot, ClaudeBot, and other AI crawlers actually extract from this page.
🤖

AI Answer Preview

NEW

See how AI models summarize your site. Left: your actual content. Right: what the LLM extracts and says about you.

Simulating AI extraction…
🧠

The LLM Interpretation

AI-VERIFIED

SEODiff AI analyzed the extracted content of penn.museum and produced this structured business intelligence. Fields marked SEMANTIC VOID indicate information the AI could not find — a critical gap in your site’s machine-readability.

Core Offering
The Penn Museum utilizes its collections to inspire creative projects and research.
Target Audience
Researchers, artists, filmmakers, designers, and students interested in cultural heritage.
Pricing Model
⚠ SEMANTIC VOID
🏆 Competitive Moat
Unique access to a vast and diverse collection of historical and cultural artifacts.
📊 Content Depth
5/10
Analyzed by SEODiff AI · 2026-03-03

🔧 Tech Stack

FrameworkShopify
AI-Readiness Score80/100
ServerApache
CDN
HTTP Status200
Load Time656 ms
Raw HTML Size58.8 KB
Visible Text Size17.2 KB

Performance & Speed

65/100 20 % of Global Score 🟢 High Confidence

⏱️ Time to First Byte

656 ms
Slow — bots may time out or deprioritise

Google considers <200 ms "good". AI crawlers may have even shorter timeouts.

📦 Page Weight

822
DOM nodes
59 KB
HTML payload
Lean page — fast for bots and users

🗄️ Cache & CDN

  • ✓ Cache-Control header → no-cache
  • ✗ CDN cache status
  • ✗ CDN detected

🔬 Tracker Tax

0
tracker scripts
0
third-party domains
0.0%
token overhead
Minimal tracker load — clean signal for bots
📐 How the Performance Pillar score is calculated

Perf Pillar = TTFB (35 pts) + Page Weight (25 pts) + Cache/CDN (20 pts) + Tracker Tax (20 pts)

TTFB <200 ms = full marks. DOM >3000 or payload >300 KB incurs heavy penalties. Tracker scripts beyond 5 reduce score.

🏗️

Architecture & Trust

62/100 15 % of Global Score 🟡 Medium Confidence

🗺️ Sitemap & Robots

  • ✗ Sitemap declared in robots.txt
  • ✓ Googlebot allowed
  • ✗ GPTBot allowed
  • ✗ ClaudeBot allowed

🔗 Linking

169
internal links
15
external links
Good internal linking — helps crawlers discover content

🔒 Security & Trust

  • ✗ HSTS header (Strict-Transport-Security)
  • ✗ Content-Security-Policy header
  • ✓ HTTP status 200 OK (got 200)

♿ Accessibility Signals

  • ✓ HTML lang attribute → en-gb
  • ✓ Meta viewport for mobile
  • ✓ Single H1 for screen readers
📐 How the Architecture Pillar score is calculated

Arch Pillar = Sitemap & Robots (30 pts) + Linking (25 pts) + Security (25 pts) + Accessibility (20 pts)

Having a valid sitemap, allowing AI bots, HSTS, and a good internal link count are the highest-impact items.

🏅 AI-Verified Trust Badge

Your site scores 56/100. Reach 80+ to unlock the green "AI-Verified" badge. Fix the issues below to improve your score.

AI-Verified badge for penn.museum
Pending Audit — score below 80 threshold
<a href="https://seodiff.io/radar/domains/penn.museum" rel="noopener"><img src="https://seodiff.io/api/v1/badge?domain=penn.museum" alt="AI-Verified by SEODiff" width="280" height="52"></a>

💡 Paste in your site footer, GitHub README, or email signature. Badge updates automatically as your score changes.

� Deep Crawl Analysis 21 pages · Deep-10

Homepage ACRI
56
Single-page score
-53
Severe hidden bloat
Δ delta
Site-Wide ACRI
3
Avg across 21 pages · Range 0–49
🔍
Hidden Bloat Detected

Homepage scores 56, but internal pages average only 3 — a -53-point gap. Blogs, docs, and legacy content are dragging down AI readability site-wide.

Topical Cohesion
1%
Topical Drift
TF-IDF cosine similarity
Total Words
276
Avg Bloat
5.4×
Page Type ACRI Token Bloat Words Status
https://penn.museum/blog
Voices
blog 49 20.1× 273
https://penn.museum/support
Giving to Penn
support 23 93.3× 3
https://penn.museum/pricing pricing 0 0.0× 0
https://penn.museum/api docs 0 0.0× 0
https://penn.museum/guides blog 0 0.0× 0
https://penn.museum/help support 0 0.0× 0
https://penn.museum/about about 0 0.0× 0
https://penn.museum/resources blog 0 0.0× 0
https://penn.museum/features product 0 0.0× 0
https://penn.museum/products product 0 0.0× 0
https://penn.museum/solutions product 0 0.0× 0
https://penn.museum/docs docs 0 0.0× 0
https://penn.museum/get-started conversion 0 0.0× 0
https://penn.museum/demo conversion 0 0.0× 0
https://penn.museum/case-studies social-proof 0 0.0× 0
https://penn.museum/integrations integrations 0 0.0× 0
https://penn.museum/contact support 0 0.0× 0
https://penn.museum/faq support 0 0.0× 0
https://penn.museum/security trust 0 0.0× 0
https://penn.museum/trust trust 0 0.0× 0
Showing 20 of 21 pages. Unlock full subpage table →
📂
Health by Sub-Directory
Average ACRI and top issues aggregated by URL path prefix
Path Pages Avg ACRI Ghost % Bloat Top Issue
/blog/ 1 49 0% 20.1× High JS Bloat
/solutions/ 1 0 0% 0.0× Low AI Readiness
/features/ 1 0 0% 0.0× Low AI Readiness
/case-studies/ 1 0 0% 0.0× Low AI Readiness
/contact/ 1 0 0% 0.0× Low AI Readiness
/integrations/ 1 0 0% 0.0× Low AI Readiness
/support/ 1 23 1% 93.3× Bot Blocked
/pricing/ 1 0 0% 0.0× Low AI Readiness
/security/ 1 0 0% 0.0× Low AI Readiness
/help/ 1 0 0% 0.0× Low AI Readiness
/resources/ 1 0 0% 0.0× Low AI Readiness
/about/ 1 0 0% 0.0× Low AI Readiness
/trust/ 1 0 0% 0.0× Low AI Readiness
/careers/ 1 0 0% 0.0× Low AI Readiness
/get-started/ 1 0 0% 0.0× Low AI Readiness
🔄 Re-Crawl & Update 📡 Track this Domain

Scores update automatically each month. Create a free account for on-demand re-crawls (3/month free).

🔌 API Access

Pull this data programmatically. All sub-page metrics are available via our public API.

curl https://seodiff.io/api/v1/deep10/domain/penn.museum

Get your free API key — 100 requests/month included.

🔗 Similar education Sites

Domains with a similar tech stack, industry, and AI readiness profile to penn.museum. Compare side-by-side.

Domain ACRI AI Score Tech Stack Token Bloat Schema
penn.museum (this site) 56 15 Shopify 3.4× 1
jdsports.ca 79 89 Shopify 5.3× 2 Compare →
edutap.in 79 86 WordPress 3.2× 1 Compare →
geidai.ac.jp 80 90 WordPress 3.5× 1 Compare →
lsusports.net 79 84 WordPress 2.8× 1 Compare →
ouhs.jp 80 84 WordPress 3.8× 1 Compare →
Compare All 5 Similar Sites →

📊 Semantic Share of Voice

How often would an AI cite penn.museum when users ask about topics in this domain's niche? We run entity queries through our 188k-page search index and measure citation probability.

Analyzing citation landscape…

🩹

Remediation Patches

COPY-PASTE

Auto-generated code fixes tailored to penn.museum. Copy and paste these into your codebase to improve AI visibility. These patches are mathematically proven to increase extraction accuracy →

Allow GPTBot in robots.txt
High Impact ⏱ 2 min
GPTBot is blocked — your content cannot appear in ChatGPT citations. Add this to your robots.txt:
text
User-agent: GPTBot
Allow: /

User-agent: ChatGPT-User
Allow: /
Allow ClaudeBot in robots.txt
Medium Impact ⏱ 2 min
ClaudeBot is restricted — Claude may not be able to cite your content. Many AI assistants rely on it.
text
User-agent: ClaudeBot
Allow: /

User-agent: anthropic-ai
Allow: /
Add FAQ Schema
Medium Impact ⏱ 10 min
FAQ schema lets AI models directly extract Q&A pairs. This is the easiest way to get featured in AI responses.
html
<script type="application/ld+json">
{
  "@context": "https://schema.org",
  "@type": "FAQPage",
  "mainEntity": [
    {
      "@type": "Question",
      "name": "What is Museum?",
      "acceptedAnswer": {
        "@type": "Answer",
        "text": "Add your answer here — describe what Museum does in 1-2 sentences."
      }
    },
    {
      "@type": "Question",
      "name": "How does Museum work?",
      "acceptedAnswer": {
        "@type": "Answer",
        "text": "Explain the key features and how users interact with Museum."
      }
    }
  ]
}
</script>
📈

Projected Impact

ROI EST.

If you apply the patches above, here's the estimated improvement for penn.museum:

Current Score
15
Projected Score
31
Improvement
+16 pts
Allow GPTBot +8 pts
Allow ClaudeBot +5 pts
Add FAQ schema +3 pts

*Estimates based on SEODiff's scoring model. Actual results depend on implementation quality.

📋 Data Export

Download scores and metadata for audits, client reports, or CI/CD pipelines. Exports contain computed metrics only (no copyrighted content).

All data is generated automatically and updated with each crawl. JSON exports contain scores and metadata only (no copyrighted content).

Is this your company?

Monitor your AI visibility score weekly and get alerted when changes happen.

Start Free →

🧭 Self-Diffing (Private Layer)

For owned domains, combine this world snapshot with private drift + regression history.
Template Drift
Track in My Site
Drift → Traffic Impact
In development coming soon
Regression Incidents
Track in My Site
Internal Linking
Deep Audit graph
Semantic Structure
GEO view in Deep Audit
Content Quality
Thin/duplicate tracking

🕒 History

Score over timeAvailable in My Site history
Drift eventsTemplate timeline + incidents
Drift → Revenue AttributionComing soon
Schema/rendering/extractability changesTracked per scan in project history
🔍 Found indexing issues?
Run a free deep audit to diagnose crawled-not-indexed, soft 404s, redirect errors, and more.
Free Deep Audit → GSC Error Guide →