English

Scientists demonstrate a new optical neural network training method that can crush electronic microprocessors

1028
2023-09-27 15:24:41
See translation

The current deep neural network system (such as ChatGPT) can quickly improve energy efficiency by 100 times in training, and "future improvements will greatly increase by several orders of magnitude. Scientists from MIT and other institutions have demonstrated a new optical neural network training method that can crush state-of-the-art electronic microprocessors.

Moreover, the computational density of the demonstrated system is about two orders of magnitude higher than that of Nvidia, Google, or Graphcore systems.

Basically, this means that the most advanced models can be trained with 100 times less energy and occupy less space at the same speed.

Artificial neural networks mimic the way biological brains process information. These artificial intelligence systems aim to learn, combine, and summarize information from big datasets, reshaping the field of information processing. Current applications include images, objects, speech recognition, games, medicine, and physical chemistry.

The current artificial intelligence model has reached hundreds of billions of artificial neurons, showing exponential growth and posing challenges to current hardware capabilities.

This paper demonstrates that optical neural network (ONN) methods with high clock speed, parallelism, and low loss data transmission can overcome current limitations.

Our technology opens up a path for large-scale optoelectronic processors to accelerate machine learning tasks from data centers to decentralized edge devices, "the paper wrote.

The ONN method is expected to alleviate the bottlenecks of traditional processors, such as the number of transistors, data mobility energy consumption, and semiconductor size. ONN uses light, which can carry a large amount of information simultaneously due to its wide bandwidth and low data transmission loss. In addition, many photonic circuits can be integrated to expand the system.

In order to move light for calculation, the team led by MIT utilized many laser beams, which were described as "using mass-produced micrometer scale vertical cavity surface emitting lasers for neuron coding".

The researchers explained, "Our scheme is similar to the 'axon synapse dendrite' structure in biological neurons
They believe that the demonstrated system can be expanded through mature wafer level manufacturing processes and photon integration.

Dirk Englund, Associate Professor of Electrical Engineering and Computer Science at the Massachusetts Institute of Technology and the head of this work, explained to SciTechDaily that the size of models such as ChatGPT is limited by the capabilities of today's supercomputers. Therefore, training larger models is not economically feasible.

He claimed, "Our new technology can make it possible to cross machine learning models, otherwise it would not be possible in the near future.

This paper titled "Deep Learning Using Coherent VCSEL Neural Networks" was published by a large team of scientists. This work has received support from the Army Research Office, NTT Research, and NTT Netcast Awards, as well as financial support from the Volkswagen Foundation. The three researchers of the team have applied for patents related to this technology.

Source: Laser Network

Related Recommendations
  • The United States promotes the development of next-generation EUV lithography technology

    LLNL has long been a pioneer in the development of EUV lithography technology.A laboratory located in California will lay the foundation for the next development of extreme ultraviolet (EUV) lithography technology. The project is led by Lawrence Livermore National Laboratory (LLNL) and aims to promote the next development of EUV lithography technology, centered around the laboratory's developed dr...

    01-06
    See translation
  • A US research team has developed a new type of photonic memory computing device

    Recently, a research team from the University of California, Santa Barbara has successfully developed a new type of photonic memory computing device that integrates non reciprocal magneto-optical technology. This device achieves high-speed, high-energy efficiency, and ultra-high durability photon computing by utilizing the non reciprocal phase shift phenomenon. The research findings, titled "Integ...

    2024-10-24
    See translation
  • Making Infrared Light Visible: New Equipment Utilizes 2D Materials to Convert Infrared Light

    Infrared imaging and sensing technology can be used in various fields, from astronomy to chemistry. For example, when infrared light passes through a gas, sensing changes in light can help scientists identify specific properties of the gas. The use of visible light may not always achieve this sensing.However, existing infrared sensors are bulky and inefficient. In addition, due to the use of infra...

    2024-06-24
    See translation
  • Coherent lasers will help expand the scale of fusion tokamaks

    Coherent company's excimer lasers can be more widely used in fusion reactor applications, after the US based photonics giant signed a "letter of intent" with Japan's Faraday 1867 Holdings.Faraday 1867, headquartered in Kanagawa Prefecture, is said to have become the world's leading manufacturer of high-temperature superconducting (HTS) tape through its subsidiary Faraday Japan factory.This tape is...

    2023-10-11
    See translation
  • Research has found that inorganic perovskite materials are easy to prepare and process, making them suitable for manufacturing lasers

    According to research from Busan National University, inorganic perovskite materials are easy to prepare and process, making them suitable for manufacturing lasers.The perovskite of interest is CsPbBr3, which must form "nanosheets" within the specific structure invented by the Busan team to obtain sufficient laser gain.It is not that the laser has been achieved, as the research project aims to cha...

    2024-01-04
    See translation