English

Scientists demonstrate a new optical neural network training method that can crush electronic microprocessors

486
2023-09-27 15:24:41
See translation

The current deep neural network system (such as ChatGPT) can quickly improve energy efficiency by 100 times in training, and "future improvements will greatly increase by several orders of magnitude. Scientists from MIT and other institutions have demonstrated a new optical neural network training method that can crush state-of-the-art electronic microprocessors.

Moreover, the computational density of the demonstrated system is about two orders of magnitude higher than that of Nvidia, Google, or Graphcore systems.

Basically, this means that the most advanced models can be trained with 100 times less energy and occupy less space at the same speed.

Artificial neural networks mimic the way biological brains process information. These artificial intelligence systems aim to learn, combine, and summarize information from big datasets, reshaping the field of information processing. Current applications include images, objects, speech recognition, games, medicine, and physical chemistry.

The current artificial intelligence model has reached hundreds of billions of artificial neurons, showing exponential growth and posing challenges to current hardware capabilities.

This paper demonstrates that optical neural network (ONN) methods with high clock speed, parallelism, and low loss data transmission can overcome current limitations.

Our technology opens up a path for large-scale optoelectronic processors to accelerate machine learning tasks from data centers to decentralized edge devices, "the paper wrote.

The ONN method is expected to alleviate the bottlenecks of traditional processors, such as the number of transistors, data mobility energy consumption, and semiconductor size. ONN uses light, which can carry a large amount of information simultaneously due to its wide bandwidth and low data transmission loss. In addition, many photonic circuits can be integrated to expand the system.

In order to move light for calculation, the team led by MIT utilized many laser beams, which were described as "using mass-produced micrometer scale vertical cavity surface emitting lasers for neuron coding".

The researchers explained, "Our scheme is similar to the 'axon synapse dendrite' structure in biological neurons
They believe that the demonstrated system can be expanded through mature wafer level manufacturing processes and photon integration.

Dirk Englund, Associate Professor of Electrical Engineering and Computer Science at the Massachusetts Institute of Technology and the head of this work, explained to SciTechDaily that the size of models such as ChatGPT is limited by the capabilities of today's supercomputers. Therefore, training larger models is not economically feasible.

He claimed, "Our new technology can make it possible to cross machine learning models, otherwise it would not be possible in the near future.

This paper titled "Deep Learning Using Coherent VCSEL Neural Networks" was published by a large team of scientists. This work has received support from the Army Research Office, NTT Research, and NTT Netcast Awards, as well as financial support from the Volkswagen Foundation. The three researchers of the team have applied for patents related to this technology.

Source: Laser Network

Related Recommendations
  • EOS and AMCM will open a new UK Additive Manufacturing Excellence Center

    The University of Wolverhampton (UK), along with global 3D printing leaders EOS and AMCM, will collaborate to establish a new Centre of Excellence (AM) for Additive Manufacturing in the UK. This partnership will provide cutting-edge technology from EOS and AMCM, and focus on developing advanced materials and processes for high demand applications in industries such as aerospace, automotive, aerosp...

    2024-04-15
    See translation
  • Amplitude launches femtosecond lasers for industrial applications

    Recently, French femtosecond pulse and high peak power (PW class) laser manufacturer Amplitude announced that the company has launched a newly designed Satsuma X femtosecond laser, setting a new benchmark for industrial environments.This product was first announced in 2022 and is now available in a brand new design with proven durability and versatility. In pursuit of excellence and customer satis...

    2024-07-02
    See translation
  • IPG launches dual beam fiber laser for additive manufacturing applications

    Recently, American fiber laser giant IPG Photonics announced the launch of a new laser series specifically designed for the additive manufacturing field.The highlight of this series of lasers lies in its integration of IPG's unique dual beam technology, which can independently regulate and simultaneously emit core and ring beams, setting a new benchmark in accuracy, efficiency, and reliability.Ba...

    2024-11-25
    See translation
  • Developing nanocavities for enhancing nanoscale lasers and LEDs

    As humanity enters a new era of computing, new small tools are needed to enhance the interaction between photons and electrons, and integrate electrical and photon functions at the nanoscale. Researchers have created a novel III-V semiconductor nanocavity that can limit light below the so-called diffraction limit, which is an important step towards achieving this goal.In the journal Optical Materi...

    2024-01-29
    See translation
  • The physicist who built the ultrafast "attosecond" laser won the Nobel Prize

    Pierre Agostini, Ferenc Krausz, and Anne L'Huillier won the award for their ultra short optical pulses, which made close research on electrons possible.Ferenc Klaus, Anne Lullier, and Pierre Agostini (from left to right)Image sources: BBVA Foundation, Kenneth Ruona/Lund University, Ohio State UniversityThis year's Nobel Prize in Physics was awarded to three physicists - Pierre Agostini of Ohio St...

    2023-10-09
    See translation