Text

Sitemap

Text

Maintaining the HTML sitemap is critical to make sure that the right content is appearing on the search engine results page when a user performs a search. Only the important pages that we want to drive traffic should be listed on the sitemap. Importantly, the pages that we don’t want to drive traffic should not appear on the sitemap.

AEM automatically adds any page to the sitemap apart from the pages in the /campaign/ directory. Any page that should not appear, should be manually "excluded" by a content author prior to page launch.

Follow these steps to exclude a page from indexing:

  1. Manually exclude the page from the sitemap. From the /site-map.html page, click on the wrench option and click the “Add field” button and navigate to the page to auto-populate the path to the page. Click the the checkmark and publish the sitemap.

  2. From the page that should be excluded, open the page prosperities and check: “No Index”, “No-Follow” and “Hide in Navigation”.
Text

Pages that should NOT be listed on the sitemap:

  • Pages that we don't want to drive search engine traffic to.
  • Pages for outdated events
  • Pages in the /campaign/directory (this happens automatically)
  • Redirected pages
  • Archived pages — Content authors need to mark them as “No Index”, “No-Follow” and “Hide in Navigation”
  • Thank you pages for gated content
  • Content pages for gated experiences (contains a download or the actual gated content)
  • Registration confirmation pages
  • Search engine results page
  • 404 page
  • Pages built to test a layout or functionality
  • Pages for an A/B test