The National Archives
Search our website
  • Search our website
  • Search our records
   
 
Image of software box and CD PRONOM
Welcome (PRONOM  home page) About PRONOM Add an entry
Search Help - opens in a new window Information resources - opens in a new window
 
 
 

*Details: File format summary

   
 

 

Search by keyword Search by file format Search by PUID Search by software Search by vendor Search by lifecycles Search by Migration Pathway

Details for:

Save as... XML | CSV Printer friendly version
 
 

Summary

Name CDX Internet Archive Index
Version  
Other names  
Identifiers PUID:  fmt/869
Family  
Classification Text (Structured)
Disclosure  
Description A CDX file consists of individual lines of text, each of which summarizes a single web document. The first line in the file is a legend for interpreting the data, and the following lines contain the data for referencing the corresponding pages within the host. The CDX signatures use a field delimiter character which is assumed to be a space character, however please contact the PRONOM team should you encounter CDX index files where the delimiter is different.
Orientation  
Byte order  
Related file formats None.
Technical Environment  
Released  
Supported until  
Format Risk  
Developed by None.
Supported by None.
Source Digital Preservation Department / The National Archives
Source date 14 Jan 2016
Source description  
Last updated 22 Sep 2020
Note  
Top of page Top of page
 
         
The National Archives Newsletter Icon

Send me The National Archives’ newsletter

A monthly round-up of news, blogs, offers and events.