English

Scientists demonstrate a new optical neural network training method that can crush electronic microprocessors

1236
2023-09-27 15:24:41
See translation

The current deep neural network system (such as ChatGPT) can quickly improve energy efficiency by 100 times in training, and "future improvements will greatly increase by several orders of magnitude. Scientists from MIT and other institutions have demonstrated a new optical neural network training method that can crush state-of-the-art electronic microprocessors.

Moreover, the computational density of the demonstrated system is about two orders of magnitude higher than that of Nvidia, Google, or Graphcore systems.

Basically, this means that the most advanced models can be trained with 100 times less energy and occupy less space at the same speed.

Artificial neural networks mimic the way biological brains process information. These artificial intelligence systems aim to learn, combine, and summarize information from big datasets, reshaping the field of information processing. Current applications include images, objects, speech recognition, games, medicine, and physical chemistry.

The current artificial intelligence model has reached hundreds of billions of artificial neurons, showing exponential growth and posing challenges to current hardware capabilities.

This paper demonstrates that optical neural network (ONN) methods with high clock speed, parallelism, and low loss data transmission can overcome current limitations.

Our technology opens up a path for large-scale optoelectronic processors to accelerate machine learning tasks from data centers to decentralized edge devices, "the paper wrote.

The ONN method is expected to alleviate the bottlenecks of traditional processors, such as the number of transistors, data mobility energy consumption, and semiconductor size. ONN uses light, which can carry a large amount of information simultaneously due to its wide bandwidth and low data transmission loss. In addition, many photonic circuits can be integrated to expand the system.

In order to move light for calculation, the team led by MIT utilized many laser beams, which were described as "using mass-produced micrometer scale vertical cavity surface emitting lasers for neuron coding".

The researchers explained, "Our scheme is similar to the 'axon synapse dendrite' structure in biological neurons
They believe that the demonstrated system can be expanded through mature wafer level manufacturing processes and photon integration.

Dirk Englund, Associate Professor of Electrical Engineering and Computer Science at the Massachusetts Institute of Technology and the head of this work, explained to SciTechDaily that the size of models such as ChatGPT is limited by the capabilities of today's supercomputers. Therefore, training larger models is not economically feasible.

He claimed, "Our new technology can make it possible to cross machine learning models, otherwise it would not be possible in the near future.

This paper titled "Deep Learning Using Coherent VCSEL Neural Networks" was published by a large team of scientists. This work has received support from the Army Research Office, NTT Research, and NTT Netcast Awards, as well as financial support from the Volkswagen Foundation. The three researchers of the team have applied for patents related to this technology.

Source: Laser Network

Related Recommendations
  • Japanese and Australian teams use lasers to search for space debris the size of peanuts

    It is reported that Japanese startup EX Fusion will soon reach an agreement with Australian space contractor Electric Optical Systems to conduct on-site testing of technology for tracking small space debris orbiting Earth.Image source: LeolabsEX Fusion, headquartered in Osaka, specializes in the laser business with the goal of achieving commercial laser fusion reactors. So far, nuclear fusion rese...

    2023-10-10
    See translation
  • Researchers have demonstrated a breakthrough boson sampling method using ultracold atoms in optical lattices

    JILA researcher, National Institute of Standards and Technology (NIST) physicist, physics professor Adam Kaufman and his team at the University of Colorado Boulder, as well as NIST collaborators, demonstrated a new method of cross laser beam lattice sampling using ultracold atoms for boson sampling in two-dimensional optics. This study, recently published in the journal Nature, marks a significant...

    2024-05-10
    See translation
  • Laser driven leap forward: the next generation of magnetic devices for controlling light is born

    Recently, a new laser heating technology developed by a Japanese research group has paved the way for advanced optical communication equipment by integrating transparent magnetic materials into optical circuits.This breakthrough was recently published in the journal Optical Materials. It is crucial for integrating magneto-optical materials and optical circuits, which has been a significant long-te...

    2023-12-21
    See translation
  • Scientists propose new methods to accelerate the commercialization of superlens technology

    Superlenses are nano artificial structures that can manipulate light, providing a technique that can significantly reduce the size and thickness of traditional optical components. This technology is particularly effective in the near infrared region, and has great prospects in various applications, such as LiDAR, which is called "the eye of autonomous vehicle", mini UAV and blood vessel detector.D...

    2024-03-29
    See translation
  • Thorlabs announces acquisition of Praevium Research

    On January 13, 2025, Thorlabs announced the acquisition of long-term partner Praevium Research, a developer of high-speed tunable VCSEL. In the future, Praevium will continue to operate as a department of Thorlabs under the name Praevium Research at its existing locations in California, while retaining its current leadership.It is understood that Christopher Burgner will serve as the general man...

    01-16
    See translation