Open data
We’re using The Open Definition which can be summarised as:
A piece of data or content is open if anyone is free to use, reuse, and redistribute it — subject only, at most, to the requirement to attribute and/or share-alike.
- NHS England Data Catalogue
- Pubmed Central Open Access Subset
- SGUL’s Linked Open Data repository. Academic API, SPARQL endpoint
Proprietary Data
While it may be easy to get access to this data, it’s subject to various terms and conditions that make it not open.
- NHS Choices – limited range of datasets (apply to NHS for API)
- NHS Safety Thermometer - restrictions on use include “It must not be used to make public statements or pronouncements, or cause, or allow it to appear in public either directly or indirectly”
- NHS iView - lots of information, and some data is restricted
- Patient Opinion - CC BY-NC-ND 3.0 so no distributing derivative works
Other APIs
- APIs for SNOFyre, a demonstrator tool for use with SNOMED CT clinical records for aggregation and analysis, giving a range of functions such as browsing the terminology
- BioPortal, a quick way to support autocomplete on a problem list
- ALISS (Access to Local Information to Support Self-management - Scottish info) - community assets of all sorts to help people live well with a long-term condition.
- Commissioning data packs
- National Dementia and Antipsychotic Prescribing Audit info at GP level - restricted to GPs and selected others
- Nationally supported clinical audits
- CPRD/GRPD/GOLD - patient-level GP data pricelist
- QOF datasets
- Datasets used to make CMO annual report on the nation’s health for 2011
- NHS Atlas of Variation (see here for more information and to view maps etc)
- Marmot indicators for local authorities in England 2012 - Maps
- Marmot indicators for local authorities in England 2012 - Data
- Data.gov.uk NHS datasets
- Kasabi NHS datasets (complete list here)
- ScraperWiki
- HSCIC Indicator Portal (Population health data, GP practice data, NHS Outcomes Framework data, hospital mortality, social care)
- HSCIC - searchable list of NHS approved datasets
- HSCIC - transparency data (includes GP Prescribing Data)
- HSCIC - useful list of stuff
- CfH - major classification schemes for disease and intervention (more for bean counting than clinicians or patients)
- NHS Data Hub
- Government economic costing for various sectors, including healthcare, for example cost per GP visit
Open source healthcare software
These open source code repositories provide APIs and access to health and care relevant datasets:
- clods - open source UK organisational data HTTP server and library
- hermes - open source SNOMED CT terminology server
- hades - open source FHIR terminology server
- deprivare - open source HTTP server for deprivation data in UK
- dmd - open source library and server to UK dm+d (dictionary of medicines and devices) data
- codelists - declarative code lists using SNOMED CT, ATC, ICD-10 and ECL
- nhspd - Library and HTTP server for NHS/UK postal code data
- ods-weekly - open source http server for GP data in each GP practice
- nhs-number - Clojure open source nhs number validation and generation library
- nhs-number - Python open source NHS number validation and generation library
trud - Clojure library to simplify use of NHS’s reference data TRUD
- Royal College of Paediatrics and Child Health open source software repositories - including digital growth charts
Wish list
Things we’d love to have!
- HES data - as a minimum, aggregates could be made available at useful levels
- NICOR - positive intent, but difficult to find data
- Legacy Library - to be scraped and mined
- http://www.scotcourts.gov.uk/opinions/2010FAI15.html - to be scraped and mined
- Data at hospital level rather than trust level - important for a public audience who don’t know or care about who runs an organisation and important for understanding variations within a provider
- CQC’s Quality and Risk Profiles (currently only available to providers and CQC staff)