The National Archives
Search our website
  • Search our website
  • Search our records
Image of software box and CD PRONOM
Welcome (PRONOM  home page) About PRONOM Add an entry
Search Help - opens in a new window Information resources - opens in a new window

*Details: File format summary



Search by keyword Search by file format Search by PUID Search by software Search by vendor Search by lifecycles Search by Migration Pathway

Details for:

Save as... XML | CSV Printer friendly version


Name TEI P4 XML - Corpus File
Version P4
Other names  
Identifiers MIME:  application/tei+xml
PUID:  fmt/1475
Classification Text (Mark-up)
Description The Text Encoding Initiative Guidelines provide a methodology for encoding textual content for a wide variety of academic and publishing purposes and repurposes. P4 is serialised as XML. Conceptually, TEI is a sibling of HTML which focuses on textual semantics rather than display. Note that TEI permits customisations and some may mean that the file no longer matches the attached signatures. This is especially the case where TEI is fragmented, embedded within other XML or when other XML namespaces are used. Information on how to convert TEI P4 to TEI P5 can be found at and a script at Such automated conversion is not guaranteed to be lossless, particularly when the TEI has been customised. P4 differs from the proceeding P3 version by being in XML rather than in SGML
Byte order  
Related file formats Has priority over Extensible Markup Language (1.0)
Technical Environment  
Supported until  
Format Risk  
Developed by None.
Supported by None.
Source Victoria University of Wellington / Victoria University of Wellington
Source date 19 Oct 2021
Source description  
Last updated 19 Oct 2021
Top of page Top of page
The National Archives Newsletter Icon

Send me The National Archives’ newsletter

A monthly round-up of news, blogs, offers and events.