繁体中文

Scientists demonstrate a new optical neural network training method that can crush electronic microprocessors

154
2023-09-27 15:24:41
查看翻譯

The current deep neural network system (such as ChatGPT) can quickly improve energy efficiency by 100 times in training, and "future improvements will greatly increase by several orders of magnitude. Scientists from MIT and other institutions have demonstrated a new optical neural network training method that can crush state-of-the-art electronic microprocessors.

Moreover, the computational density of the demonstrated system is about two orders of magnitude higher than that of Nvidia, Google, or Graphcore systems.

Basically, this means that the most advanced models can be trained with 100 times less energy and occupy less space at the same speed.

Artificial neural networks mimic the way biological brains process information. These artificial intelligence systems aim to learn, combine, and summarize information from big datasets, reshaping the field of information processing. Current applications include images, objects, speech recognition, games, medicine, and physical chemistry.

The current artificial intelligence model has reached hundreds of billions of artificial neurons, showing exponential growth and posing challenges to current hardware capabilities.

This paper demonstrates that optical neural network (ONN) methods with high clock speed, parallelism, and low loss data transmission can overcome current limitations.

Our technology opens up a path for large-scale optoelectronic processors to accelerate machine learning tasks from data centers to decentralized edge devices, "the paper wrote.

The ONN method is expected to alleviate the bottlenecks of traditional processors, such as the number of transistors, data mobility energy consumption, and semiconductor size. ONN uses light, which can carry a large amount of information simultaneously due to its wide bandwidth and low data transmission loss. In addition, many photonic circuits can be integrated to expand the system.

In order to move light for calculation, the team led by MIT utilized many laser beams, which were described as "using mass-produced micrometer scale vertical cavity surface emitting lasers for neuron coding".

The researchers explained, "Our scheme is similar to the 'axon synapse dendrite' structure in biological neurons
They believe that the demonstrated system can be expanded through mature wafer level manufacturing processes and photon integration.

Dirk Englund, Associate Professor of Electrical Engineering and Computer Science at the Massachusetts Institute of Technology and the head of this work, explained to SciTechDaily that the size of models such as ChatGPT is limited by the capabilities of today's supercomputers. Therefore, training larger models is not economically feasible.

He claimed, "Our new technology can make it possible to cross machine learning models, otherwise it would not be possible in the near future.

This paper titled "Deep Learning Using Coherent VCSEL Neural Networks" was published by a large team of scientists. This work has received support from the Army Research Office, NTT Research, and NTT Netcast Awards, as well as financial support from the Volkswagen Foundation. The three researchers of the team have applied for patents related to this technology.

Source: Laser Network

相關推薦
  • The Application of Femtosecond Laser in Precision Photonics Manufacturing

    Femtosecond laser emits ultra short light pulses with a duration of less than 1 picosecond, reaching the femtosecond domain. The characteristics of femtosecond lasers are extremely short pulse width and high peak intensity.Ultra short blasting can minimize waste heat, ensure precise material processing, and minimize incidental damage. Their peak intensities can cause nonlinear optical interactions...

    2024-02-28
    查看翻譯
  • The researchers used ultrafast lasers to create nanoscale photonic crystals

    The optical properties of photonic crystals are closely related to their lattice constants, which are usually required to be in the same order of magnitude as the operating wavelength. In a crystal material, the photonic crystal structure is formed by the periodic arrangement in space of units whose dielectric constant is different from that of the crystal itself, and whose lattice constant depend...

    2023-08-04
    查看翻譯
  • A major investment! Lumentum completes acquisition of research and development site in Carswell, UK

    Lumentum, a leading designer and manufacturer of innovative optical and photonic products, has announced that it has completed the acquisition of a site in Caswell, UK.Lumentum revealed that it has made significant investments in the site over the past two years and is currently undergoing development upgrades for its state-of-the-art cleanrooms and laboratories to continue to support the developm...

    2023-09-13
    查看翻譯
  • Using Topological Photon Chips to Uncover the Secrets of Open Systems

    Conservation of energy is a fundamental concept in physics that can be used to explain anything from planetary orbits to the internal workings of individual atoms.Energy can be converted into other forms, but the overall energy level is usually considered to vary over time. Therefore, when attempting to describe a system, physicists usually pay attention to ensuring that it is isolated from the su...

    2024-02-02
    查看翻譯
  • New type of femtosecond laser: used for broadband terahertz generation and nonlinear wafer detection

    Recently, HüBNER Photonics, the leading manufacturer of high-performance lasers, has launched the latest member of the VALO femtosecond series - VALO Tidal. This laser not only represents a major leap in the fields of imaging, detection, and analysis, but also demonstrates the infinite possibilities of laser technology with its outstanding performance.The VALO Tidal femtosecond laser typically sho...

    2024-06-26
    查看翻譯