A -  B -  C -  D -  E -  F -  G -  H -  I -  J -  K -  L -  M -  N -  O -  P -  Q -  R -  S -  T -  U -  V -  W -  X -  Y -  Z

Find the APPSI quality score key (* - *****) at the bottom of the page.

# = Defined in legislation

A

Aggregated data
A combination of unit records created with the objective that individual details are not disclosed. ***
Source: APPSI Panel
Anonymisation
The process of adapting data so that individual people or businesses cannot be identified. ****
Source: APPSI Panel
Application Programming Interface (API)
A specification used as an interface by software components to communicate with each other. An API may include specifications for routines, data structures, object classes, and variables.****
Source: Wikipedia (modified by APPSI)
Attribution licence
A licence that requires that the original source of the licensed material is cited (attributed).***
Source: http://opendatahandbook.org/en/glossary.html
Authoritative
Able to be trusted as being accurate or true; reliable: e.g. "clear, authoritative information". ****
Source: Wikipedia
Authoritative data source
A recognised or official data production source with a designated mission statement or source/product to publish reliable and accurate data for subsequent use by customers. An authoritative data source may be the functional combination of multiple, separate data sources.****
Source: https://metadata.ces.mil/dse-help/en/Authoritative_Data_Sources

B

Big data
A loose term, not formally defined, for high-volume, high-velocity and high-variety information assets that demand cost-effective, innovative forms of information processing, that can give enhanced insight and decision making.****
Source: http://www.gartner.com/resId=2057415
Big data analytics
The process of examining and interrogating big data assets to derive insights of value for decision making.****
Source: "The Big Data Opportunity, Policy Exchange, ISBN: 978-1-907689-22-2, 2012"

C

Click-use
The online licensing system for Crown and Parliamentary copyright information developed by the Office of Public Sector Information in 2001. Click-Use was replaced by the Open Government Licence and the Open Parliament Licence but remains historically significant.****
Source: http://www.nationalarchives.gov.uk/information-management/government-licensing/glossary-and-references.htm
Commercial use/re-use
Use that is intended for or directed toward commercial advantage or private monetary compensation. For the purposes of the UK Government Licensing Framework, 'private monetary compensation' does not include the exchange of the Information for other copyrighted works by means of digital file-sharing or otherwise provided there is no payment of any monetary compensation in connection with the exchange of the Information.***
Source: http://www.nationalarchives.gov.uk/information-management/government-licensing/glossary-and-references.htm
Compiled database right
The legal protection provided by EC and UK law to a collection of databases (which have been compiled from a number of different sources and normalised to facilitate cross searching).****
Source: APPSI Panel
Content
The collection of information stored for a purpose in a file, folder or electronic message***
Source: APPSI Panel
Copyright
The protection given to literary works (including books and articles, and also databases and computer programs) which are recorded in writing. Electronic works may be recorded in analogue or digital form. Databases are protected by copyright where the selection and arrangement of the contents of the database are the author’s own intellectual creation. Other aspects of a database may be protected by database right.****
Source: APPSI Panel
Core-reference data
Authoritative or definitive data necessary to use other information, produced by the public sector as a service in itself due to its high importance and value. Usually including a field that may be used as a database key, or locational coordinates that may not be changed.**
Source: http://data.gov.uk/sites/default/files/Open_data_White_Paper.pdf
Costs - Fixed
Costs which do not vary with the level of activity in the short run.****
Source: A Consultation on Data Policy for a Public Data Corporation http://discuss.bis.gov.uk/pdc/glossary-of-terms/
Costs - Marginal
The incremental cost of providing one further unit of a good or service.****
Source: A Consultation on Data Policy for a Public Data Corporation http://discuss.bis.gov.uk/pdc/glossary-of-terms/
Costs - Full
The total cost of all the resources used in providing a good or service in any accounting period (usually one year). This will include all direct and indirect costs of producing the output (both cash and non-cash costs), including a full proportional share of overhead costs and any selling and distribution costs, insurance, depreciation, and the cost of capital, and any selling and distribution costs, insurance, depreciation, and the cost of capital, including any appropriate adjustment for expected cost increases.*****
Source: Managing Public Money
Creative Commons
A non-profit US organisation that enables the sharing and use of creativity and knowledge through free legal tools.****
Source: http://creativecommons.org/about
Crown Copyright
Crown copyright covers material created by civil servants, ministers and government departments and agencies. It is legally defined under section 163 of the Copyright, Designs and Patents Act 1988 as works made by officers or services of the Crown in the course of their duties. Copyright can also come into Crown ownership by means of an assignment or transfer of the copyright from the legal owner of the copyright to the Crown.****
Source: Defined by The National Archives

D

Data (can be singular or plural in common usage)
The quantities, characters, or symbols on which operations are performed by a computer, which may be stored and transmitted in the form of electrical signals and recorded on magnetic, optical, or mechanical recording media. (The terms data, information and knowledge are frequently used for overlapping concepts. The main difference is in the level of abstraction being considered. Data is a broad term, embracing others, but is often the lowest level of abstraction, information is the next level and, finally, knowledge is the highest level.) See Raw data, Derived data, Metadata.***
Source: http://www.oxforddictionaries.com/definition/english/data
Data discovery
The process of finding out what data exists and how it can be accessed.****
Source: A Consultation on Data Policy for a Public Data Corporation http://discuss.bis.gov.uk/pdc/glossary-of-terms/
Data sharing
The transfer, by agreement, of data collected for a specific purpose between two or more parties***
Source: APPSI Panel
Database rights
An intellectual property right which applies to databases defined by the Copyright and Rights in Databases Regulations 1997 as 'a collection of independent works or materials arranged in a systematic or methodical way and that are individually accessible by electronic or other means'. Database rights apply only to the collection of works, not to the individual works contained within it. Database right protection lasts for 15 years from when the database was completed but the 15 year period will restart if the database is altered significantly.***
Source: http://www.nationalarchives.gov.uk/information-management/government-licensing/glossary-and-references.htm
Dataset
A collection of data, published or curated by a single source, and available for access or download in one or more formats.****
Source: http://www.w3.org/TR/vocab-dcat/#class--dataset
De-anonymisation
The technical process of attempting to determine the identity of a person or individual to whom a pseudonymised dataset relates.***
Source: http://data.gov.uk/sites/default/files/Open_data_White_Paper.pdf
Definitive
Of recognized authority or excellence****
Source: The Free Online Dictionary
Delegations of Authority
Authority granted by the Controller of Her Majesty's Stationery Office to Crown bodies enabling them to license the re-use of information which they produce. Crown bodies with complete delegations to license information include trading funds, however some departments have partial delegations to license the use of particular information. All Crown bodies with delegations of authority are subject to the supervision of the Information Fair Trader Scheme.****
Source: http://www.nationalarchives.gov.uk/information-management/government-licensing/glossary-and-references.htm
Derived data
A data element or dataset adapted from other data sources using a mathematical, logical, or other type of transformation, e.g. arithmetic formula, composition, aggregation. See Value-added data.***
Source: OECD Glossary of Statistical Terms (modified by APPSI)
Digital rights management
A class of access control technologies that are used by hardware manufacturers, publishers, copyrightholders and individuals with the intent to limit the use of digital content and devices after sale.***
Source: Wikipedia
Disclosive
Data is potentially disclosive if, despite the removal of obvious identifiers, characteristics of this dataset in isolation or in conjunction with other datasets might lead to identification of the individual to whom a record belongs.****
Source: http://data.gov.uk/sites/default/files/Open_data_White_Paper.pdf
Document #
Any content whatever its medium (written on paper or stored in electronic form or as a sound, visual or audiovisual recording).*****
Source: EU PSI Directive

F

Free at point of use
Where there is no charge or fee to the end-user for the use or re-use of information.***
Source: A Consultation on Data Policy for a Public Data Corporation http://discuss.bis.gov.uk/pdc/glossary-of-terms/
Freemium
A business model by which a product or service (typically a digital offering such as software, media, games or web services) is provided free of charge, but a premium is charged for advanced features or functionality.****
Source: Wikipedia

G

Geospatial data
Also known as spatial data or geographic information, it is the data that represents the geographic location of natural and man-made features on Earth. Spatial data is usually stored as coordinates of points, lines and areas and may include their topological relationship and attributes.****
Source: Webopedia (modified by APPSI)

I

Information
Interpretation and analysis of data that when presented in context represents added value, message or meaning. See Data.****
Source: A Consultation on Data Policy for a Public Data Corporation http://discuss.bis.gov.uk/pdc/glossary-of-terms/
Information Asset Registers (IAR)
Registers specifically set up to capture and organise metadata about the vast quantities of information held by government departments and agencies. A comprehensive IAR includes databases, old sets of files, recent electronic files, collections of statistics, research and so forth.***
Source: http://opendatahandbook.org/en/glossary.html & http://assets.okfn.org/tmp/opendatamanual/glossary.html
Information Fair Trader Scheme (IFTS)
A scheme to set and assess standards for public sector bodies in allowing the re-use of their information. Any public sector body may apply to become IFTS accredited. However, all Crown bodies that hold a delegation of authority from the Controller of HMSO must become IFTS accredited. ITFS measures members' performance against the six principles of maximisation, simplicity, transparency, fairness, challenge and innovation. It considers both the commercial re-use of public sector information and non-commercial citizen access to information.****
Source: http://www.nationalarchives.gov.uk/information-management/government-licensing/glossary-and-references.htm
Information provider
The person, creator or organisation providing the information for re-use under the Open Government Licence or the Non-Commercial Government Licence.***
Source: http://www.nationalarchives.gov.uk/information-management/government-licensing/glossary-and-references.htm
Intellectual property (rights)
A set of property rights that grant the right to protect the created materials. Intellectual property rights comprise trade marks, patents, registered designs, copyright and database rights.****
Source: http://data.gov.uk/sites/default/files/Open_data_White_Paper.pdf

L

Licence (noun)
A legal document giving permission to use information****
Source: http://www.nationalarchives.gov.uk/information-management/government-licensing/glossary-and-references.htm
License (verb)
The act of giving a formal licence (usually written) authorisation.****
Source: http://www.nationalarchives.gov.uk/information-management/government-licensing/glossary-and-references.htm
Linked data
The technical term used to describe the best practice of exposing, sharing and connecting items of data on the semantic web using unique resource identifiers (URIs) and resource description framework (RDF). Not to be confused with data linking.****
Source: APPSI Panel

M

Metadata
Data that describes or defines other data. Anything that users need to know to make proper and correct use of the real data, in terms of reading, processing, interpreting, analysing and presenting the information. Thus metadata includes file descriptions, codebooks, processing details, sample designs, fieldwork reports, conceptual motivations, etc., in other words, anything that might influence the way in which the information is used.***
Modelled data
Information created by mathematical representation of data relationships; sometimes used to simulate environments that are difficult to observe reliably or consistently. ***
Source: APPSI Expert Member
Mosaic/jigsaw effect
The technical process of combining anonymised data with auxiliary data in order to attempt to reconstruct identifiers linking data to the individual it relates to.***
Source: http://data.gov.uk/sites/default/files/Open_data_White_Paper.pdf

N

Non-Commercial Government Licence
The Non-Commercial Government Licence offers a legal solution to enable the provision and use of public sector information under a common set of terms and conditions at no charge for Non-Commercial use only. It enables any public sector information holder to make their information available for use and re-use under its terms. The main requirement for re-users is to attribute the information provider and source.****
Source: http://www.nationalarchives.gov.uk/information-management/government-licensing/glossary-and-references.htm
Non-commercial use
Use that is not intended for or directed toward commercial advantage or private monetary compensation. For the purposes of the UK Government Licensing Framework, 'private monetary compensation' does not include the exchange of the Information for other copyrighted works by means of digital file-sharing or otherwise provided there is no payment of any monetary compensation in connection with the exchange of the Information.***
Source: http://www.nationalarchives.gov.uk/information-management/government-licensing/glossary-and-references.htm

O

Ontology
Formal representation of knowledge as a set of concepts within a domain, and the relationships among those concepts.****
Source: Wikipedia
Open access (academic)
Provision of free access to peer-reviewed academic publications.***
Source: http://data.gov.uk/sites/default/files/Open_data_White_Paper.pdf
Open data
Data is open if anyone is free to access, use, modify, and share it — subject, at most, to measures that preserve provenance and openness.****
Source: http://opendefinition.org/od/
Open Data Commons
An Open Knowledge Foundation project run by its Advisory Council and like the Foundation is a not-for-profit effort working for the benefit of the general open knowledge community. Open Data Commons is the home of a set of legal 'tools' to help others provide and use open data. ****
Source: http://www.nationalarchives.gov.uk/information-management/government-licensing/glossary-and-references.htm
Open Government Licence (OGL)
The Open Government Licence offers a legal solution to enable the provision and use of public sector information under a common set of terms and conditions. It enables any public sector information holder to make their information available for use and re-use under its terms. The main requirement for re-users is to attribute the Information Provider and source.*****
Source: http://www.nationalarchives.gov.uk/information-management/government-licensing/glossary-and-references.htm

P

Personal data #
Data which relate to a living individual who can be identified – (a) from those data, or (b) from those data and other information which is in the possession of, or is likely to come into the possession of, the data controller, and includes any expression of opinion about the individual and any indication of the intentions of the data controller or any other person in respect of the individual.*****
Source: Data Protection Act 1998
Pseudonymised data
Data relating to a specific individual where the identifiers have been replaced by artificial identifiers to prevent identification of the individual.***
Source: http://data.gov.uk/sites/default/files/Open_data_White_Paper.pdf
Public domain
Works that are publicly available and in which the intellectual property rights have expired or been waived****
Source: APPSI Panel
Public sector bodies #
State, regional or local authorities, bodies governed by public law and associations formed by one or several such authorities or one or several such bodies governed by public law.*****
Source: EU PSI Directive
Public Sector Information (PSI)
The wide range of information that public sector bodies collect, produce, reproduce and disseminate in many areas of activity while accomplishing their Public Task.****
Source: APPSI Panel
Public task
Public task information consists of information that a public sector body must produce, collect or provide to fulfil its core role and functions, whether these duties are statutory in nature or are established through custom and practice. The term ‘public task’ features in the Re-use of Public Sector Information Regulations 2005 (SI 2005 No. 1515) and the INSPIRE Regulations 2009 (SI 2009 No. 3157).****
Source: A Consultation on Data Policy for a Public Data Corporation http://discuss.bis.gov.uk/pdc/glossary-of-terms/

R

Raw data 
In the context of PSI, raw data is data collected which has not been subjected to processing or any other manipulation beyond that necessary for its first use. Raw data, i.e. unprocessed data, is a relative term; data processing commonly occurs by stages, and the 'processed data' from one stage may be considered the 'raw data' of the next.***
Source: A Consultation on Data Policy for a Public Data Corporation http://discuss.bis.gov.uk/pdc/glossary-of-terms/ (modified by APPSI)
Resource Description Framework (RDF)
RDF, a W3C standard, is the foundation of several technologies for modelling distributed knowledge and is meant to be used as the basis of the Semantic Web***
Source: http://www.rdfabout.com/
Re-use (noun/verb) #
The use by persons or legal entities of documents held by public sector bodies, for commercial or non-commercial purposes other than the initial purpose within the public task for which the documents were produced. Exchange of documents between public sector bodies purely in pursuit of their public tasks does not constitute re-use. *****
Source: EU PSI Directive
Sample of Anonymised Records (SARs)
A set of unit records available for research where key information has been removed to ensure anonymity. (Specifically Census SARs)***
Source: APPSI Expert Member

S

Semantic Web
A web of data that can be processed directly and indirectly by machines, providing a common framework that allows data to be shared and reused across application, enterprise, and community boundaries. It is based on the Resource Description Framework (RDF).****
Source: Berners-Lee quotation/http://www.w3.org/2001/sw/
Share-alike licence
The Creative Commons Attribution Share-Alike license allows re-distribution and re-use of a licensed work on the conditions that the creator is appropriately credited and that any derivative work is made available under “the same, similar or a compatible license”.****
Source: http://opendefinition.org/licenses/cc-by-sa/
Star rating
In UK Linked Data, a system of ranking data sources that indicates ease of machine readability. It is not a measure of the quality of the data content.**
Source: APPSI Panel
Synthetic population
A particular application of simulated data that generates a complete micro-view of individuals in a population.**
Source: APPSI Panel

T

Taxonomy
The science or technique of classification.****
Source: Dictionary.com
Third party rights
Information, the rights for which are not owned by the Information Provider or Licensor.***
Source: http://www.nationalarchives.gov.uk/information-management/government-licensing/glossary-and-references.htm
Trading Fund #
An organisation (either within a government department or forming one) which is largely or wholly financed from commercial revenue generated by its activities. Its Estimate shows its net impact, allowing its income from receipts to be devoted entirely to its business. *****
Source: Managing Public Money

U

Uniform Resource Identifier (URI)
The generic term for all types of names and addresses that refer to objects on the World Wide Web. A URL is one kind of URI.***
Source: Webopedia
Uniform Resource Locator (URL)
A type of URI that identifies a resource via a representation of its network location****
Source: http://www.nationalarchives.gov.uk/information-management/government-licensing/glossary-and-references.htm
Unit records
Individual items of information from surveys or observations that often contain confidential details. ***
Source: APPSI Panel

V

Value-added information (or data)
Data to which value has been added to enhance and facilitate its use and effectiveness by or for users. See Derived data.***
Source: A Consultation on Data Policy for a Public Data Corporation http://discuss.bis.gov.uk/pdc/glossary-of-terms/

APPSI quality score key

  • Score Key
  • ***** Widely agreed definition or enshrined in legislation
  • **** A sound definition; fit for purpose
  • *** A working definition capable of improvement
  • ** Usable but potentially confusing or ambiguous
  • * A loose or contentious definition; not adequate

Note: Descriptions of Organisations and Acts are omitted from this glosssary.