November 7, 2018

Team breaks world record for fast, accurate AI training

by Hong Kong Baptist University

HKBU and Tencent break world record for fast and accurate AI training — Diagram showing data transmission of a 5-layer model. Credit: HKBU

Researchers at Hong Kong Baptist University (HKBU) have partnered with a team from Tencent Machine Learning to create a new technique for training artificial intelligence (AI) machines faster than ever before while maintaining accuracy.

During the experiment, the team trained two popular deep neural networks called AlexNet and ResNet-50 in just four minutes and 6.6 minutes respectively. Previously, the fastest training time was 11 minutes for AlexNet and 15 minutes for ResNet-50.

AlexNet and ResNet-50 are deep neural networks built on ImageNet, a large-scale dataset for visual recognition. Once trained, the system was able to recognise and label an object in a given photo. The result is significantly faster than previous records and outperforms all other existing systems.

Machine learning is a set of mathematical approaches that enable computers to learn from data without explicitly being programmed by humans. The resulting algorithms can then be applied to a variety of data and visual recognition tasks used in AI.

The HKBU team comprises Professor Chu Xiaowen and Ph.D. student Shi Shaohuai from the Department of Computer Science. Professor Chu said, "We have proposed a new optimised training method that significantly improves the best output without losing accuracy. In AI training, researchers strive to train their networks faster, but this can lead to a decrease in accuracy. As a result, training machine-learning models at high speed while maintaining accuracy and precision is a vital goal for scientists."

Professor Chu said the time required to train AI machines is affected by both computing time and communication time. The research team attained breakthroughs in both aspects to create this record-breaking achievement.

This included adopting a simpler computational method known as FP16 to replace the more traditional one, FP32, making computation much faster without losing accuracy. As communication time is affected by the size of data blocks, the team came up with a communication technique named "tensor fusion," which combines smaller pieces of data into larger ones, optimising the transmission pattern and thereby improving the efficiency of communication during AI training.

This new technique can be adopted in large-scale image classification, and it can also be applied to other AI applications, including machine translation; natural language processing (NLP) to enhance interactions between human language and computers; medical imaging analysis; and online multiplayer battle games.

More information: Xianyan Jia et al. Highly Scalable Deep Learning Training System with Mixed-Precision: Training ImageNet in Four Minutes. arXiv:1807.11205 [cs.LG]. arxiv.org/abs/1807.11205

Provided by Hong Kong Baptist University

Citation: Team breaks world record for fast, accurate AI training (2018, November 7) retrieved 18 April 2024 from https://phys.org/news/2018-11-team-world-fast-accurate-ai.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

Supercomputing speeds up deep learning training

103 shares

Feedback to editors

Unraveling the mysteries of consecutive atmospheric river events

31 minutes ago

Research team resolves decades-long problem in microscopy

32 minutes ago

RNA's hidden potential: New study unveils its role in early life and future bioengineering

1 hour ago

Smoother surfaces make for better accelerators

1 hour ago

Scientists reveal hydroclimatic changes on multiple timescales in Central Asia over the past 7,800 years

1 hour ago

Research reveals a surprising topological reversal in quantum systems

2 hours ago

NASA's Juno gives aerial views of mountain and lava lake on Io

2 hours ago

Toxic fireproof chemicals can be absorbed through touch, 3D-printed skin model shows

2 hours ago

Skyrmions move at record speeds: A step towards the computing of the future

3 hours ago

A third of China's urban population at risk of city sinking, new satellite data shows

3 hours ago

Load comments (0)

Team breaks world record for fast, accurate AI training

Unraveling the mysteries of consecutive atmospheric river events

Research team resolves decades-long problem in microscopy

RNA's hidden potential: New study unveils its role in early life and future bioengineering

Smoother surfaces make for better accelerators

Scientists reveal hydroclimatic changes on multiple timescales in Central Asia over the past 7,800 years

Research reveals a surprising topological reversal in quantum systems

NASA's Juno gives aerial views of mountain and lava lake on Io

Toxic fireproof chemicals can be absorbed through touch, 3D-printed skin model shows

Skyrmions move at record speeds: A step towards the computing of the future

A third of China's urban population at risk of city sinking, new satellite data shows

Relevant PhysicsForums posts

Error logging in: onLoginSuccess is not a function

My Website For Creating Interactive Visuals Linked To Equations

Latest Notable AI accomplishments

Building a homemade Long Short Term Memory with FSMs

Most efficient way to randomly choose a word from a file with a list of words

Git, staging and committing files

Supercomputing speeds up deep learning training

Training artificial intelligence with artificial X-rays

Restoring balance in machine learning datasets

Training with states of matter search algorithm enables neuron model pruning

Scientists improve deep learning method for neural networks

A light-weight and accurate deep learning model for audiovisual emotion recognition

Machine learning approach for low-dose CT imaging yields superior results

Medical Xpress

Tech Xplore

Science X

Team breaks world record for fast, accurate AI training

Unraveling the mysteries of consecutive atmospheric river events

Research team resolves decades-long problem in microscopy

RNA's hidden potential: New study unveils its role in early life and future bioengineering

Smoother surfaces make for better accelerators

Scientists reveal hydroclimatic changes on multiple timescales in Central Asia over the past 7,800 years

Research reveals a surprising topological reversal in quantum systems

NASA's Juno gives aerial views of mountain and lava lake on Io

Toxic fireproof chemicals can be absorbed through touch, 3D-printed skin model shows

Skyrmions move at record speeds: A step towards the computing of the future

A third of China's urban population at risk of city sinking, new satellite data shows

Relevant PhysicsForums posts

Related Stories

Supercomputing speeds up deep learning training

Training artificial intelligence with artificial X-rays

Restoring balance in machine learning datasets

Training with states of matter search algorithm enables neuron model pruning

Scientists improve deep learning method for neural networks

A light-weight and accurate deep learning model for audiovisual emotion recognition

Recommended for you

Machine learning approach for low-dose CT imaging yields superior results

Newsletter sign up

Donate and enjoy an ad-free experience