Technical SEO
Chapter 08 / 09
Website architecture
How URLs are organised, how internal links flow, and how depth-from-homepage affects what gets crawled and ranked. The structure that compounds vs the structure that fragments.

Website architecture is what decides which pages get crawled, which pages accumulate authority, and how clearly the site signals topical expertise to search engines. It’s the combination of URL structure, content hierarchy, and internal linking topology — and it’s the single most under-invested area of technical SEO.
This article covers the four architecture decisions that compound for years: URL structure, click depth, hub-and-spoke clustering, and internal linking topology.
“Architecture is the multiplier on every other piece of SEO work. A great article in a bad architecture ranks worse than a mediocre article in a great architecture. The structure decides how much of every effort actually compounds.”
Decision 1 — URL structure
URL structure should be (1) human-readable, (2) topically descriptive, (3) stable over time, and (4) consistent across the site. The four common patterns:
| Pattern | Example | When to use |
|---|---|---|
| Flat | /article-slug/ | Editorial content, academy, blog — when migration safety matters more than nesting |
| Single-level hierarchy | /category/article-slug/ | Topical clusters where the category adds meaning |
| Deep hierarchy | /category/subcategory/sub-subcategory/page/ | E-commerce / marketplace catalogs where the hierarchy IS the content |
| Date-based | /2026/05/article-slug/ | News-style sites with timeliness as a signal — generally avoid for evergreen |
The default for editorial content (articles, academy, knowledge base): flat. The cost of nesting (longer URLs, harder migrations, weaker link equity flow) outweighs the benefit (slightly better topical signal). For e-commerce / catalog content the hierarchy is the data — there’s no avoiding it, the question is making sure the hierarchy maps to user mental models.
Decision 2 — Click depth
Click depth = the minimum number of clicks from the homepage to reach a given page. Critical principle: every meaningful page should be reachable in 3 clicks or fewer.
- Click depth 1: homepage, primary navigation pages — pillar pages, top categories, key landing pages.
- Click depth 2: cluster gateway pages, category pages, hub pages.
- Click depth 3: individual articles, products, specific landing pages — the bulk of indexable content.
- Click depth 4+: long-tail content, archive pages, deep filter combinations. Reachable but lower-priority.
Pages at depth 4+ get crawled less often, accumulate authority more slowly, and lose ranking compounding. Sites with traffic plateaus often have a click-depth problem buried somewhere — articles with potential trapped at depth 5 because the navigation structure didn’t evolve as the content scaled.
Decision 3 — Hub-and-spoke clustering
The dominant pattern for editorial / knowledge content in 2026 is hub-and-spoke (also called pillar-and-cluster, topic-cluster):
- Pillar page (the hub). Comprehensive overview of a major topic. Links out to all supporting articles. Targets the high-volume head term.
- Cluster articles (the spokes). Each one covers a specific sub-topic in depth. All link back up to the pillar; many cross-link to relevant siblings.
- Cluster gateway pages. Optional intermediate level for very large clusters — the gateway lists all articles in a topic.
The structural argument: search engines reward sites that demonstrate topical authority on a theme. A pillar with 8–15 well-linked supporting articles signals depth on a topic in a way that 8 isolated articles cannot.
This very academy is built on hub-and-spoke: /seo/ is the pillar, the eight cluster gateways are intermediate hubs, and individual articles are spokes that link back to the gateway and across to siblings.
Decision 4 — Internal linking topology
Internal links are how PageRank-equivalent authority flows through the site. The four patterns to deploy in combination:
| Pattern | Where | Job |
|---|---|---|
| Top-down | Pillar → cluster gateway → article | Distributes authority from highest-traffic pages downward |
| Bottom-up | Article → its cluster gateway → pillar | Concentrates authority back to topical hubs; signals cluster cohesion |
| Sibling cross-links | Article → related article in same cluster | Builds topical density; passes intent context |
| Cross-cluster bridges | Article in cluster A → relevant article in cluster B | Models the real-world topical relationships |
All four patterns should appear; the absence of any one is a topology gap. Pillar-only linking creates lonely articles; sibling-only linking misses authority concentration; cross-cluster bridges only across unrelated topics dilute focus.
Anchor-text discipline
Internal anchor text matters more than most teams treat it. The principles:
- Descriptive, varied anchor text. “Click here” and “learn more” pass context-free signals; topic-relevant anchor text passes topical signals.
- No exact-match keyword spam. Linking to your “best CRM software” page from 80 internal pages with the exact anchor “best CRM software” is the kind of pattern that historically triggered manipulation flags.
- Natural variation. Mix exact match, partial match, branded, and contextual phrases — that’s how organic linking looks.
- Don’t link the same anchor to multiple destinations. Confuses signals; pick one destination per concept and link consistently.
Special cases
Faceted navigation (e-commerce)
Faceted nav (filter by colour, size, price, brand, etc) is a major source of architecture problems. Every filter combination is a URL Google might try to crawl. For a catalog with 5 filters of 10 values each, that’s 100,000+ URL combinations from a few hundred actual products.
Standard mitigations:
- Decide which filter combinations have search demand and let those be crawlable / indexable (e.g. “red Nike running shoes”).
- Canonical all other filter combinations to the parent category.
- Add
noindex, followon combinations that should be reachable but not indexed. - Block low-value parameter combinations via robots.txt if crawl budget is tight.
Multi-locale architecture
Three URL patterns for multi-language sites, in decreasing order of common preference:
- Subdirectories —
/en/,/es-mx/. Single domain consolidates authority; cleanest for SEO; easiest to maintain. - Subdomains —
en.example.com,es.example.com. Each subdomain accumulates separate authority — slower to scale across markets. - ccTLDs —
example.com,example.mx. Strongest geo signal per market but each TLD is a separate site, separate authority, separate crawl budget. Justifies the complexity only at large scale.
For most sites, subdirectories with hreflang are the right answer.
Architecture audit checklist
- Crawl the site with a tool that reports click depth (Screaming Frog, Sitebulb).
- Plot click-depth distribution. Identify pages at depth 4+; verify they belong there.
- Identify orphan pages — indexable URLs with zero internal links pointing at them.
- Verify pillar / cluster structure — every cluster has a gateway page; gateway links to all articles; articles link back to gateway.
- Audit internal anchor text for diversity and topical relevance.
- Check faceted nav — confirm crawlable / indexable URL combinations match search demand.
- Verify URL stability — no recent template changes that broke historical URLs without 301s.
- Check breadcrumbs and structured data — every page should have
BreadcrumbListschema reflecting the actual hierarchy.
The bottom line
Architecture is the multiplier on every other piece of SEO work. URL structure that’s readable, stable, and consistent. Click depth that keeps important pages within 3 clicks of the homepage. Hub-and-spoke clustering that signals topical authority. Internal linking topology that flows authority in all four directions (top-down, bottom-up, sibling, cross-cluster). Get those four right and every article that ships compounds against the existing structure; get any of them wrong and content investment leaks authority every quarter.
Common questions
Common questions
Quick answers to what we get asked before every trial signup.
The combination of URL structure, hierarchical organisation of content, and internal linking topology. Architecture controls (1) how easily crawlers discover pages, (2) how authority flows between pages via internal links, (3) how clearly the site signals topical authority, and (4) how predictable the URLs are for users and engines. Good architecture compounds — every new page benefits from the structure already in place. Bad architecture fragments — pages compete with each other and authority dilutes.
In this cluster