Skip to content

docs(seo): SPARROW structured-data + social + self-hosted assets parity#62

Open
rain-Brian wants to merge 3 commits into
mainfrom
seo/sparrow-infra-structured-data
Open

docs(seo): SPARROW structured-data + social + self-hosted assets parity#62
rain-Brian wants to merge 3 commits into
mainfrom
seo/sparrow-infra-structured-data

Conversation

@rain-Brian
Copy link
Copy Markdown
Collaborator

Replicates the merged microsoft/Biodiversity#653 SEO infrastructure pattern for the SPARROW docs site. Part of ADO Epic 506340 (Phase 1 repo on-page SEO).

What changed

  • Structured data (overrides/main.html, new theme.custom_dir: overrides): SoftwareSourceCode JSON-LD on the homepage and a BreadcrumbList on every other page. Both are gated on page.file.src_path, which is unique per page, so a schema block cannot be copied between pages by accident.
  • Social cards: Open Graph and Twitter Card meta on every page, using an absolute share-image URL ({site_url}assets/cat.png).
  • Self-hosted assets: favicon and logo now point at docs/assets/cat.png instead of the Zenodo URL, removing a cross-origin dependency.
  • Plugins: added callouts (first plugin) and git-revision-date-localized (with enable_creation_date and fallback_to_build_date); listed both in docs-requirements.txt. The docs-deploy actions/checkout@v4 step now sets fetch-depth: 0 so the revision-date plugin can read full git history.

Keywords in the JSON-LD target SPARROW's lane in the cluster keyword-ownership matrix (solar-powered edge AI, NVIDIA Jetson camera trap, Starlink wildlife, bioacoustic edge device), avoiding overlap with the MegaDetector and PyTorch-Wildlife repos.

Notes

  • SPARROW has no FAQ page, so no FAQPage schema is included.
  • mkdocs build --strict passes cleanly. Verified the built site/index.html: homepage title renders, both JSON-LD blocks parse as valid JSON (SoftwareSourceCode on index, BreadcrumbList on sub-pages), zero remaining Zenodo references for theme assets, and OG/Twitter meta carry the absolute image URL.

Add SEO infrastructure to the SPARROW docs site, mirroring the merged
microsoft/Biodiversity pattern:

- overrides/main.html: SoftwareSourceCode JSON-LD on the homepage and a
  BreadcrumbList on every other page (gated on page.file.src_path so the
  schema cannot leak between pages), plus Open Graph and Twitter Card meta
  with an absolute share-image URL.
- Self-host the favicon and logo at docs/assets/cat.png, dropping the
  Zenodo cross-origin references.
- Add the callouts and git-revision-date-localized plugins (with
  fetch-depth: 0 on the docs-deploy checkout so commit dates resolve).
@rain-Brian rain-Brian requested a review from zhmiao June 3, 2026 22:22
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant