Starting in 1996, Alexa Internet has been donating their crawl data to the Internet Archive. Flowing in every day, these data are added to the Wayback Machine after an embargo period.
The HTML tag is the outermost tag. It is not required and may safely
be omitted. It indicates that the text is HTML (the version can be
indicated with the optional VERSION attribute), but this information
is almost never used by servers or browsers.
Notes:
If used, the HTML tags should go around the entire
document, but directly after the DOCTYPE declaration.