June 29, 2017

Future-proofing 'big data' biological research depends on good digital identifiers

"Big data" research runs the risk of being undermined by the poor design of the digital identifiers that tag data. A group of worldwide researchers, led by Julie McMurry, at Oregon Health & Science University, has assembled a set of pragmatic guidelines to create, reference and maintain web-based identifiers to improve reproducibility, attribution, and scientific discovery. The guidance, publishing June 29 in the open access journal PLOS Biology helps address the frequent problems associated with persistent identifiers linked to scientific data.

Over the past decade, the life sciences have drastically changed as data continues to evolve to be larger, more interdependent and natively web-based. In this landscape, the broader scientific research community has struggled to engineer this data for the web so that it is persistently accessible, reusable and attributable.

Depending on the individual database involved, identifiers can signify a gene, a genome, a chemical, an organism, a set of experimental data, or even a published article. The usefulness of all these items depends on the robustness and uniqueness of their respective identifiers, enabling them to be linked and discovered in perpetuity. The authors point out that the organic way in which most identifiers have arisen threatens that usefulness, and recognise that it is difficult to create and sustain persistent identifiers or web addresses that won't break and that are used consistently.

This work calls on professionals to do a better job of identifier engineering - according to emerging community-developed conventions - so that data can be utilized more effectively for scientific discovery. It also calls on users to be aware enough of these conventions, and of available tooling, to not get burned by broken links and missed connections.

"As with plumbing fixtures, the question of how identifiers work should only need to be understood by those that build and maintain them. However, everyone needs to know how identifiers should be used, and this is where convention is important," said McMurry. "Through this work, we hope to encourage all participants in the scholarly ecosystem - including authors, data creators, data integrators, publishers, software developers, and resolvers - to adhere to best practice in order to maximize the utility and impact of life science data."

More information: McMurry JA, Juty N, Blomberg N, Burdett T, Conlin T, Conte N, et al. (2017) Identifiers for the 21st century: How to design, provision, and reuse persistent identifiers to maximize utility and impact of life science data. PLoS Biol 15(6): e2001414. doi.org/10.1371/journal.pbio.2001414

Journal information: PLoS Biology

Provided by Public Library of Science

Citation: Future-proofing 'big data' biological research depends on good digital identifiers (2017, June 29) retrieved 19 April 2024 from https://phys.org/news/2017-06-future-proofing-big-biological-good-digital.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

Search gets smarter with identifiers

10 shares

Feedback to editors

Future-proofing 'big data' biological research depends on good digital identifiers

Ghost particle on the scales: Research offers more precise determination of neutrino mass

Light show in living cells: New method allows simultaneous fluorescent labeling of many proteins

Warming of Antarctic deep-sea waters contribute to sea level rise in North Atlantic, study finds

Unraveling water mysteries beyond Earth: Ground-penetrating radar will seek bodies of water on Jupiter

Baby white sharks prefer being closer to shore, scientists find

Key protein regulates immune response to viruses in mammal cells

Unraveling the mysteries of consecutive atmospheric river events

Research team resolves decades-long problem in microscopy

RNA's hidden potential: New study unveils its role in early life and future bioengineering

Smoother surfaces make for better accelerators

Relevant PhysicsForums posts

Can four legged animals drink from beneath their feet?

Mold in Plastic Water Bottles? What does it eat?

Dolphins don't breathe through their esophagus

Is this egg-laying or something else?

Color Recognition: What we see vs animals with a larger color range

How to Implement Beamforming in Ultrasound Diffraction Tomography

Search gets smarter with identifiers

Unusual brand logos and images work well

Novel cybercatalog of flower-loving flies suggests the digital future of taxonomy

Identifying problems with national identifiers: Supposedly encrypted numbers can be easily decrypted

New guidance on data sharing will minimize risks to patient privacy

Sociologists urge use of big data to study human interaction

Linking environmental influences, genetic research to address concerns of genetic determinism of human behavior

40 years of crop research shows inequities

AI-generated disproportioned rat genitalia makes its way into peer-reviewed journal

Unpacking social equity from biodiversity data: An interdisciplinary policy perspective

A whiff of tears reduces male aggression, says study

Solicitor in 19th-century Tasmania traded human Aboriginal remains for scientific accolades, study reveals

Medical Xpress

Tech Xplore

Science X

Future-proofing 'big data' biological research depends on good digital identifiers

Ghost particle on the scales: Research offers more precise determination of neutrino mass

Light show in living cells: New method allows simultaneous fluorescent labeling of many proteins

Warming of Antarctic deep-sea waters contribute to sea level rise in North Atlantic, study finds

Unraveling water mysteries beyond Earth: Ground-penetrating radar will seek bodies of water on Jupiter

Baby white sharks prefer being closer to shore, scientists find

Key protein regulates immune response to viruses in mammal cells

Unraveling the mysteries of consecutive atmospheric river events

Research team resolves decades-long problem in microscopy

RNA's hidden potential: New study unveils its role in early life and future bioengineering

Smoother surfaces make for better accelerators

Relevant PhysicsForums posts

Related Stories

Search gets smarter with identifiers

Unusual brand logos and images work well

Novel cybercatalog of flower-loving flies suggests the digital future of taxonomy

Identifying problems with national identifiers: Supposedly encrypted numbers can be easily decrypted

New guidance on data sharing will minimize risks to patient privacy

Sociologists urge use of big data to study human interaction

Recommended for you

Linking environmental influences, genetic research to address concerns of genetic determinism of human behavior

40 years of crop research shows inequities

AI-generated disproportioned rat genitalia makes its way into peer-reviewed journal

Unpacking social equity from biodiversity data: An interdisciplinary policy perspective

A whiff of tears reduces male aggression, says study

Solicitor in 19th-century Tasmania traded human Aboriginal remains for scientific accolades, study reveals

Newsletter sign up

Donate and enjoy an ad-free experience