archive.today

From Infogalactic: the planetary knowledge core
Jump to: navigation, search

<templatestyles src="Module:Hatnote/styles.css"></templatestyles>

archive.today
Archive.is-Screenshot.png
Screenshot of the archive.today home page
Web address <templatestyles src="Plainlist/styles.css"/>
Type of site
Web archiving
Registration No
Available in Multilingual
Launched May 16, 2012; 11 years ago (2012-05-16)[2][3]

archive.today (or archive.is) is a web archiving site, founded in 2012, that saves snapshots on demand, and has support for JavaScript-heavy sites such as Google Maps and progressive web apps such as Twitter.[4] archive.today records two snapshots: one replicates the original webpage including any functional live links; the other is a screenshot of the page.[5]

History

Archive.today was founded in 2012. The site originally branded itself as archive.today, but in May 2015, changed the primary mirror to archive.is.[6]

In January 2019, it began to deprecate the archive.is domain in favor of the archive.today mirror.[7]

Features

Lua error in package.lua at line 80: module 'strict' not found.

Functionality

Archive.today can capture individual pages in response to explicit user requests.[8][9][10] Since its beginning, it has supported crawling pages with URLs containing the now-deprecated hash-bang fragment (#!).[11]

Archive.today records only text and images, excluding XML, RTF, spreadsheet (xls or ods) and other non-static content. However, videos for certain sites, like Twitter, are saved.[12] It keeps track of the history of snapshots saved, requesting confirmation before adding a new snapshot of an already saved page.[13][14]

Pages are captured at a browser width of 1,024 pixels. CSS is converted to inline CSS, removing responsive web design and selectors such as :hover and :active. Content generated using JavaScript during the crawling process appears in a frozen state.[15] HTML class names are preserved inside the old-class attribute. When text is selected, a JavaScript applet generates a URL fragment seen in the browser's address bar that automatically highlights that portion of the text when visited again.

Web pages cannot be duplicated from archive.today to web.archive.org as second-level backup, as archive.today places an exclusion for Wayback Machine and does not save its snapshots in WARC format. The reverse—from web.archive.org to archive.today—is possible,[16] but the copy usually takes more time than a direct capture. Some web sites get deleted from Internet Archive's listings retroactively or blocked from being saved due to their robots.txt file, but archive.today does not use this.[10]

The research toolbar enables advanced keywords operators, using * as the wildcard character. A couple of quotation marks address the search to an exact sequence of keywords present in the title or in the body of the webpage, whereas the insite operator restricts it to a specific Internet domain.[17]

Once a web page is archived, it cannot be deleted directly by any Internet user.[18] Removing advertisements, popups or expanding links from archived pages is possible by asking the owner to do it on his blog.[19]

While saving a dynamic list, archive.today search box shows only a result that links the previous and the following section of the list (e.g. 20 links for page).[20] The other web pages saved are filtered, and sometimes may be found by one of their occurrences.[21][clarification needed]

The search feature is backed by Google CustomSearch. If it delivers no results, archive.today attempts to utilize Yandex Search.[22]

While saving a page, a list of URLs for individual page elements and their content sizes, HTTP statuses and MIME types is shown. This list can only be viewed during the crawling process.

One can download archived pages as a ZIP file, except pages archived since 29 November 2019, when archive.today changed their browser engine from PhantomJS to Chromium.[23]

In July 2013, Archive.today began supporting the API of the Memento Project.[24][25]

Worldwide availability

Australia

<templatestyles src="Module:Hatnote/styles.css"></templatestyles>

In March 2019, the site was blocked for six months by several Australian internet providers in the aftermath of the Christchurch mosque shootings in an attempt to limit distribution of the footage of the attack.[26][27] It has since been unblocked.

China

<templatestyles src="Module:Hatnote/styles.css"></templatestyles>

According to GreatFire.org, archive.today has been blocked in China since March 2016,[28] archive.li since September 2017,[29] archive.fo since July 2018,[30] as well as archive.ph since December 2019.[31]

Finland

On 21 July 2015, the operators blocked access to the service from all Finnish IP addresses, stating on Twitter that they did this in order to avoid escalating a dispute they allegedly had with the Finnish government.[32]

Russia

<templatestyles src="Module:Hatnote/styles.css"></templatestyles>

In Russia, only HTTP access is possible; HTTPS connections are blocked.[33][34] HTTP is not encrypted, contrary to HTTPS, therefore agents listening on the network can read and modify in-transit the whole communication, including the URL of the page requested, the returned content, and strings that identify the sender device (such as the User-Agent and cookies).

Cloudflare DNS availability

Between May 2018[35] and May 2022,[36] Cloudflare's 1.1.1.1 DNS service would not resolve the organizations web addresses, making it inaccessible to users of the Cloudflare DNS service. Both organizations claimed the other was responsible for the issue. Cloudflare staff stated that the problem was on archive.today's DNS infrastructure, as its authoritative nameservers return invalid records when Cloudflare's network systems made requests to archive.today. archive.today countered that the issue was due to Cloudflare requests not being compliant with DNS standards, as Cloudflare does not send EDNS Client Subnet information in its DNS requests.[37][38] The issue was subsequently resolved.

See also

<templatestyles src="Div col/styles.css"/>

References

  1. Lua error in package.lua at line 80: module 'strict' not found.
  2. Lua error in package.lua at line 80: module 'strict' not found.
  3. Lua error in package.lua at line 80: module 'strict' not found.
  4. Lua error in package.lua at line 80: module 'strict' not found.
  5. Lua error in package.lua at line 80: module 'strict' not found.
  6. Lua error in package.lua at line 80: module 'strict' not found.
  7. Lua error in package.lua at line 80: module 'strict' not found.
  8. Lua error in package.lua at line 80: module 'strict' not found.
  9. Lua error in package.lua at line 80: module 'strict' not found.
  10. 10.0 10.1 Lua error in package.lua at line 80: module 'strict' not found.
  11. Lua error in package.lua at line 80: module 'strict' not found.
  12. Lua error in package.lua at line 80: module 'strict' not found.
  13. Lua error in package.lua at line 80: module 'strict' not found.
  14. Lua error in package.lua at line 80: module 'strict' not found.
  15. JavaScript-generated loading animation of Dailymotion video appearing in a frozen state
  16. Lua error in package.lua at line 80: module 'strict' not found.
  17. For example, the string insite: https://en.wikipedia.org "World Cup" returns the "World+Cup"/ related snapshots
  18. Lua error in package.lua at line 80: module 'strict' not found.
  19. Lua error in package.lua at line 80: module 'strict' not found.
  20. Lua error in package.lua at line 80: module 'strict' not found.
  21. Lua error in package.lua at line 80: module 'strict' not found.
  22. Lua error in package.lua at line 80: module 'strict' not found.
  23. Lua error in package.lua at line 80: module 'strict' not found.
  24. Lua error in package.lua at line 80: module 'strict' not found.
  25. Lua error in package.lua at line 80: module 'strict' not found.
  26. Lua error in package.lua at line 80: module 'strict' not found.
  27. Lua error in package.lua at line 80: module 'strict' not found.
  28. Lua error in package.lua at line 80: module 'strict' not found.
  29. Lua error in package.lua at line 80: module 'strict' not found.
  30. Lua error in package.lua at line 80: module 'strict' not found.
  31. Lua error in package.lua at line 80: module 'strict' not found.
  32. Lua error in package.lua at line 80: module 'strict' not found.
  33. Lua error in package.lua at line 80: module 'strict' not found.
  34. Lua error in package.lua at line 80: module 'strict' not found.
  35. Lua error in package.lua at line 80: module 'strict' not found.
  36. Lua error in package.lua at line 80: module 'strict' not found.
  37. Lua error in package.lua at line 80: module 'strict' not found.
  38. Lua error in package.lua at line 80: module 'strict' not found.

External links