Details: File format summary

Details for:

Go to: Summary | Documentation

| Signatures

| Compression

| Rights

Properties

Summary
Name	Apache ORC
Version	V0 and V1
Other names
Identifiers	PUID: fmt/2030
Family
Classification	Text (Structured)
Disclosure
Description	The Apache ORC (Optimized Row Columnar) format is an open-source, column-oriented data storage format which is utilized by prominent data processing frameworks including Apache Spark, Apache Hive, Apache Flink and Apache Hadoop.The format was annouced by Hortonworks (in collaboration with Facebook) in 2013.
Orientation
Byte order
Related file formats	None.
Technical Environment
Released
Supported until
Format Risk
Developed by	The Apache Software Foundation / The Apache Software Foundation
Supported by	The Apache Software Foundation / The Apache Software Foundation
Source	Digital Preservation Department / The National Archives
Source date	28 Jan 2025
Source description
Last updated	28 Jan 2025
Note	Specifications: https://orc.apache.org/specification/ORCv0/ https://orc.apache.org/specification/ORCv1/ Note: https://github.com/apache/orc/tree/main/examples https://en.wikipedia.org/wiki/Apache_ORC