A search engine cache is a cache of web pages that shows the page as it was when it was indexed by a web crawler. Cached versions of web pages can be used to view the contents of a page when the live version cannot be reached, has been altered or taken down.[1]
A web crawler collects the contents of a web page, which is then indexed by a web search engine. The search engine might make the copy accessible to users. Web crawlers that obey restrictions in robots.txt[2] or meta tags[3] by the site webmaster may not make a cached copy available to search engine users if instructed not to.
Search engine cache can be used for crime investigation,[4] legal proceedings[5] and journalism.[6][1] Examples of search engines that offer their users cached versions of web pages are Bing, Yandex Search, and Baidu.
Search engine cache may not be fully protected by the usual laws that protect technology providers from copyright infringement claims.[7]
When a page becomes controversial, the publishers may take it down or alter it without acknowledgment. If you suspect you're running into the problem, the first place to turn is Google's cache of the page as it was when it did its last crawl.
noarchive: Do not show a cached link in search results.
noarchive - Don't show a Cached link for a page in search results.
For the investigator this can be a valuable piece of information. Depending on when Google crawled the site, the last page may contain information different from the current page. Documenting and capturing Google's cached page of a webpage can therefore be important step to ensure this time snapshot is preserved.
The World Wide Web is not as ephemeral as one might think. An increasing number of older web pages are available online through such services as the Wayback Machine, Yahoo Cache, or Bing Cache. Some plaintiffs' lawyers and corporate gadflies use these services as a matter of routine.
Journalists can also access delisted content via the Google cache.
For the cache function, the Court rejected the exception of a "technically necessary copy". This exception exempts temporary reproduction which is a necessary part of a technical process applied by an intermediary for transmission in a network between third parties. According to the Court, the cache copy that Google stores on its server is not technically necessary for efficient transmission.