FAQ - What is indexing?

The Google Search Appliance (GSA) will crawl all the domains and subdomains added to its' URL list. It processes each of the pages as it crawls in order to compile a massive index of all the words it sees and their location on each page. It may take 100 or even 1000 jumps for the crawler to find a page, but if the page is linked from another page it will be indexed.

The GSA will also process additional information including meta element, key content tags (title tags) and attributes (ALT attributes). The GSA will follow links contained in PDF documents and Flash files. It will not follow links in other formats, such as Microsoft Office documents.

Exceptions:

  • The GSA honours robots.txt files, META robot entries and googleoon/googleoff tags.
  • Service Nova Scotia may have blocked either the directory or web server that contains the page your are looking for. We have blocked some pages at the request of the Government Web Coordinator due to inappropriateness to Government of Nova Scotia Standards (personal pages on EDnet servers) or redundancy with other pages.
  • We have allowed only certain file formats from be indexed in order to maintain a high level of integrity and quality user experience with search results.

Related Links