Summary
|
| Name |
Apache ORC |
| Version |
V0 and V1 |
| Other names |
|
| Identifiers |
PUID:
fmt/2030
|
| Family |
|
| Classification |
Text (Structured) |
| Disclosure |
|
| Description |
The Apache ORC (Optimized Row Columnar) format is an open-source, column-oriented data storage format which is utilized by prominent data processing frameworks including Apache Spark, Apache Hive, Apache Flink and Apache Hadoop.The format was annouced by Hortonworks (in collaboration with Facebook) in 2013. |
| Orientation |
|
| Byte order |
|
| Related file formats |
None.
|
| Technical Environment |
|
| Released |
|
| Supported until |
|
| Format Risk |
|
| Developed by |
The Apache Software Foundation / The Apache Software Foundation
|
| Supported by |
The Apache Software Foundation / The Apache Software Foundation
|
| Source |
Digital Preservation Department / The National Archives
|
| Source date |
28 Jan 2025 |
| Source description |
|
| Last updated |
28 Jan 2025 |
| Note |
Specifications:
https://orc.apache.org/specification/ORCv0/
https://orc.apache.org/specification/ORCv1/
Note:
https://github.com/apache/orc/tree/main/examples
https://en.wikipedia.org/wiki/Apache_ORC |