English

Scientists demonstrate a new optical neural network training method that can crush electronic microprocessors

1233
2023-09-27 15:24:41
See translation

The current deep neural network system (such as ChatGPT) can quickly improve energy efficiency by 100 times in training, and "future improvements will greatly increase by several orders of magnitude. Scientists from MIT and other institutions have demonstrated a new optical neural network training method that can crush state-of-the-art electronic microprocessors.

Moreover, the computational density of the demonstrated system is about two orders of magnitude higher than that of Nvidia, Google, or Graphcore systems.

Basically, this means that the most advanced models can be trained with 100 times less energy and occupy less space at the same speed.

Artificial neural networks mimic the way biological brains process information. These artificial intelligence systems aim to learn, combine, and summarize information from big datasets, reshaping the field of information processing. Current applications include images, objects, speech recognition, games, medicine, and physical chemistry.

The current artificial intelligence model has reached hundreds of billions of artificial neurons, showing exponential growth and posing challenges to current hardware capabilities.

This paper demonstrates that optical neural network (ONN) methods with high clock speed, parallelism, and low loss data transmission can overcome current limitations.

Our technology opens up a path for large-scale optoelectronic processors to accelerate machine learning tasks from data centers to decentralized edge devices, "the paper wrote.

The ONN method is expected to alleviate the bottlenecks of traditional processors, such as the number of transistors, data mobility energy consumption, and semiconductor size. ONN uses light, which can carry a large amount of information simultaneously due to its wide bandwidth and low data transmission loss. In addition, many photonic circuits can be integrated to expand the system.

In order to move light for calculation, the team led by MIT utilized many laser beams, which were described as "using mass-produced micrometer scale vertical cavity surface emitting lasers for neuron coding".

The researchers explained, "Our scheme is similar to the 'axon synapse dendrite' structure in biological neurons
They believe that the demonstrated system can be expanded through mature wafer level manufacturing processes and photon integration.

Dirk Englund, Associate Professor of Electrical Engineering and Computer Science at the Massachusetts Institute of Technology and the head of this work, explained to SciTechDaily that the size of models such as ChatGPT is limited by the capabilities of today's supercomputers. Therefore, training larger models is not economically feasible.

He claimed, "Our new technology can make it possible to cross machine learning models, otherwise it would not be possible in the near future.

This paper titled "Deep Learning Using Coherent VCSEL Neural Networks" was published by a large team of scientists. This work has received support from the Army Research Office, NTT Research, and NTT Netcast Awards, as well as financial support from the Volkswagen Foundation. The three researchers of the team have applied for patents related to this technology.

Source: Laser Network

Related Recommendations
  • Artificial intelligence data centers trigger a wave of laser shortage

    The latest research released by TrendForce indicates that in the context of the evolution of artificial intelligence data centers towards large-scale clusters, high-speed optical interconnect technology has become the core key to improving system performance and scalability. The report predicts that the global shipment of 800G and higher rate optical transceivers will reach 24 million units by 202...

    12-10
    See translation
  • Overview: High throughput preparation of alloy composition design in additive manufacturing

    Researchers from the New Materials Technology Research Institute of Beijing University of Science and Technology and the Beijing Modern Transportation Metal Materials and Processing Laboratory reported a review of high-throughput preparation of alloy composition design in additive manufacturing. The relevant research is titled "High throughput preparation for alloy composition design in additive m...

    2024-07-08
    See translation
  • GF Machining Solutions will showcase the latest members of its laser tradition on EPHJ

    At the EPHJ exhibition, GF Machining Solutions will showcase its latest laser solutions for microfabrication and 3D surface texture processing. Inspired by 70 years of innovation in the machine tool industry and 15 years of mastery of laser technology, GF Machining Solutions' latest innovations enable manufacturers to take speed and accuracy to new levels - they can experience it firsthand at EP...

    2024-06-06
    See translation
  • The research team describes laser direct writing of single-photon optical fiber integrated multimode storage on a communication band chip

    Figure: Experimental setup.Quantum memory that relies on quantum band integration is a key component in developing quantum networks that are compatible with fiber optic communication infrastructure. Quantum engineers and information technology experts have yet to create such a high-capacity network that can form integrated multimode photonic quantum memories in communication frequency ban...

    2023-08-04
    See translation
  • Laser photonics helps simplify maintenance processes in the mining industry

    Laser Photonics Corporation (LPC) is a leading global developer of industrial laser systems for cleaning and other material processing applications, emphasizing the critical applications of its industrial laser cleaning systems in the mining industry.Laser Photonics provides a user-friendly, ethical, cost-effective, and time-saving solution for professionals in the mining industry to maintain heav...

    2024-06-14
    See translation