September 12, 2019 report

Chemists show how bias can crop up in machine learning algorithm results

by Bob Yirka , Phys.org

A team of material scientists at Haverford College has shown how human bias in data can impact the results of machine-learning algorithms used to predict new reagents for use in making desired products. In their paper published in the journal Nature, the group describes testing a machine-learning algorithm with different types of datasets and what they found.

One of the more well-known applications of machine-learning algorithms is in facial recognition. But there are possible problems with such algorithms. One such problem occurs when a facial algorithm intended to look for an individual among many faces has been trained using people of just one race. In this new effort, the researchers wondered if bias, unintentional or otherwise, might be cropping up in machine learning algorithm results used in chemistry applications designed to look for new products.

Such algorithms use data describing the ingredients of reactions that result in the creation of a new product. But the data the system is trained on could have a major impact on the results. The researchers note that currently, such data is obtained from published research efforts, which means they are typically generated by humans. They note that the data from such efforts could have been generated by the researchers themselves, or by other researchers working on separate efforts. Data could even come from a single person simply relating from memory, or from a professor's suggestion, or a graduate student with a bright idea. The point is, the data could be biased in terms of the background of the resource.

In this new effort, the researchers wanted to know if such biases might have an impact on the results of machine-learning algorithms used for chemistry applications. To find out, they looked at a specific set of materials called amine-templated vanadium borates. When they are synthesized successfully, crystals form—an easy way to determine if a reaction was successful.

The experiment consisted of training a machine-learning algorithm on data surrounding the synthesis of vanadium borates, and then programming the system to create its own. Some of the data collected by the researchers was human-generated, and some of it was collected randomly. They report that the algorithm trained on the random data did better at finding ways to synthesize the vanadium borates than when it used data generated from humans. They claim this shows a clear bias in the data that was created by humans.

More information: Xiwen Jia et al. Anthropogenic biases in chemical reaction data hinder exploratory inorganic synthesis, Nature (2019). DOI: 10.1038/s41586-019-1540-5

Journal information: Nature

Citation: Chemists show how bias can crop up in machine learning algorithm results (2019, September 12) retrieved 25 April 2024 from https://phys.org/news/2019-09-chemists-bias-crop-machine-algorithm.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

Algorithm able to accurately see differences between cancerous lung tumors

115 shares

Feedback to editors

Chemists show how bias can crop up in machine learning algorithm results

Artificial intelligence helps scientists engineer plants to fight climate change

Ultrasensitive photonic crystal detects single particles down to 50 nanometers

Scientists map soil RNA to fungal genomes to understand forest ecosystems

Researchers show it's possible to teach old magnetic cilia new tricks

Mantle heat may have boosted Earth's crust 3 billion years ago

Study suggests that cells possess a hidden communication system

Researcher finds that wood frogs evolved rapidly in response to road salts

Imaging technique shows new details of peptide structures

Cows' milk particles used for effective oral delivery of drugs

New research confirms plastic production is directly linked to plastic pollution

Relevant PhysicsForums posts

Very confused about Naunyn definition of acid and base

Can you eat the Periodic Table?

Ideas for a project in computational chemistry?

New Insight into the Chemistry of Solvents

Separation of KCl from potassium chromium(III) PDTA

Zirconium Versus Zirconium Carbide For Use With Galinstan

Algorithm able to accurately see differences between cancerous lung tumors

New algorithm limits bias in machine learning

How CERN machine-learning techniques could improve autonomous vehicles

A Hippocratic Oath for data science? We'll settle for a little more data literacy

An algorithm could play a major role in helping radiologists diagnose cancer early, accurately

Programming and prejudice: Computer scientists discover how to find bias in algorithms

Researchers show it's possible to teach old magnetic cilia new tricks

New method could cut waste from drug production

Some cannabis rolling papers may contain unhealthy levels of heavy metals

Scientists develop novel liquid metal alloy system to synthesize diamond under moderate conditions

Researchers create artificial cells that act like living cells

Previous theory on how electrons move within protein nanocrystals might not apply in every case

Medical Xpress

Tech Xplore

Science X

Chemists show how bias can crop up in machine learning algorithm results

Artificial intelligence helps scientists engineer plants to fight climate change

Ultrasensitive photonic crystal detects single particles down to 50 nanometers

Scientists map soil RNA to fungal genomes to understand forest ecosystems

Researchers show it's possible to teach old magnetic cilia new tricks

Mantle heat may have boosted Earth's crust 3 billion years ago

Study suggests that cells possess a hidden communication system

Researcher finds that wood frogs evolved rapidly in response to road salts

Imaging technique shows new details of peptide structures

Cows' milk particles used for effective oral delivery of drugs

New research confirms plastic production is directly linked to plastic pollution

Relevant PhysicsForums posts

Related Stories

Algorithm able to accurately see differences between cancerous lung tumors

New algorithm limits bias in machine learning

How CERN machine-learning techniques could improve autonomous vehicles

A Hippocratic Oath for data science? We'll settle for a little more data literacy

An algorithm could play a major role in helping radiologists diagnose cancer early, accurately

Programming and prejudice: Computer scientists discover how to find bias in algorithms

Recommended for you

Researchers show it's possible to teach old magnetic cilia new tricks

New method could cut waste from drug production

Some cannabis rolling papers may contain unhealthy levels of heavy metals

Scientists develop novel liquid metal alloy system to synthesize diamond under moderate conditions

Researchers create artificial cells that act like living cells

Previous theory on how electrons move within protein nanocrystals might not apply in every case

Newsletter sign up

Donate and enjoy an ad-free experience