Space Industry and Business News
ROBO SPACE
Why tech firms are aiming for smaller, leaner AI models
Why tech firms are aiming for smaller, leaner AI models
By Daxia ROJAS
Paris (AFP) Dec 3, 2024

AI firms have long boasted about the enormous size and capabilities of their products, but they are increasingly looking at leaner, smaller models that they say will save on energy and cost.

Programs like ChatGPT are underpinned by algorithms known as "large language models", and the chatbot's creator bragged last year that its GPT-4 model had nearly two trillion "parameters" -- the building blocks of the models.

The vast size of GPT-4 allows ChatGPT to handle queries about anything from astrophysics to zoology.

But if a company needs a program with knowledge only of, say, tigers, the algorithm can be much smaller.

"You don't need to know the terms of the Treaty of Versailles to answer a question about a particular element of engineering," said Laurent Felix of Ekimetrics, a firm that advises companies on AI and sustainability.

Google, Microsoft, Meta and OpenAI have all started offering smaller models.

Amazon too allows for all sizes of models on its cloud platform.

Kara Hurst, Amazon's chief sustainability officer, said at a recent event in Paris that it showed the tech industry was moving towards "sobriety and frugality".

- Energy needs -

Smaller models are better for simple tasks like summarising and indexing documents or searching an internal database.

US pharmaceutical company Merck, for example, is developing a model with Boston Consulting Group (BCG) to understand the impact of certain diseases on genes.

"It will be a very small model, between a few hundred million and a few billion parameters," said Nicolas de Bellefonds, head of AI at BCG.

Laurent Daudet, head of French AI startup LightOn, which specialises in smaller models, said they had several advantages over their larger siblings.

They were often faster and able to "respond to more queries and more users simultaneously", he said.

He also pointed out that they were less energy hungry -- the potential climate impact being one of the major concerns over AI.

Huge arrays of servers are needed to "train" the AI programs and then to process queries.

These servers -- made up of highly advanced chips -- require vast amounts of electricity both to fuel their operation and to cool them down.

Daudet explained that the smaller models needed far fewer chips, making them cheaper and more energy efficient.

- Multi-model future -

Other proponents point out that they can run without using data centres altogether by being installed directly on devices.

"This is one of the ways to reduce the carbon footprint of our models," Arthur Mensch, head of French start-up Mistral AI, told the Liberation newspaper in October.

Laurent Felix pointed out that direct use on a device also meant more "security and confidentiality of data".

The programs could potentially be trained on proprietary data without fear of it being compromised.

The larger programs, though, still have the edge for solving complex problems and accessing wide ranges of data.

De Bellefonds said the future was likely to involve both kinds of models talking to each other.

"There will be a small model that will understand the question and send this information to several models of different sizes depending on the complexity of the question," he said.

"Otherwise, we will have solutions that are either too expensive, too slow, or both."

dax/jxb/rl

Merck & Co.

GOOGLE

MICROSOFT

Meta

Amazon.com

Related Links
All about the robots on Earth and beyond!

Subscribe Free To Our Daily Newsletters
Tweet

RELATED CONTENT
The following news reports may link to other Space Media Network websites.
ROBO SPACE
New datasets aim to teach AI models cross-disciplinary scientific thinking
Los Angeles CA (SPX) Dec 03, 2024
What can exploding stars reveal about blood flow in arteries, or how might swimming bacteria inform our understanding of ocean dynamics? Researchers from leading institutions have taken a major step forward in training artificial intelligence (AI) models to draw insights across disciplines to unlock scientific discoveries. The initiative, known as Polymathic AI, leverages advanced technology similar to large language models like ChatGPT, but instead of processing text, it uses datasets from fields ... read more

ROBO SPACE
A new way to create realistic 3D shapes using generative AI

Scientists explore sustainable use of fly ash for water treatment

Bioinspired dropletronics pave the way for advanced biocompatible devices

Scientists create coral-inspired material for effective bone repair

ROBO SPACE
Airbus to deliver advanced satellite modems to UK MoD for Skynet comms

Fleet Space Centauri 6 advances resilient SATCOM for defence

SpaceX launches secret 'Optus-X' payload atop Falcon 9 rocket

Fort Detrick Maryland chosen as permanent site for Wideband Military SATCOM training

ROBO SPACE
ROBO SPACE
Space Systems Command and U.S. Navy achieve major MGUE program milestone

N. Korea jams GPS signals, affecting ships, aircraft in South

Successful demo showcases BAE Systems' next-gen M-Code GNSS technology

BeiDou remote sensing experiment enhances ecological monitoring in Yellow River

ROBO SPACE
Study defines sustainable aviation and provides framework for progress

Qatar to invest 1 bn pounds in climate technologies with UK

South Korea scrambles jets as Chinese, Russian warplanes approach

An electronic system eases airport traffic

ROBO SPACE
ASML sees business as usual despite US chip export curbs

China curbs exports of key chipmaking components to US

Photonic processor could enable ultrafast AI computations with extreme energy efficiency

US unveils fresh export curbs targeting China's chip sector

ROBO SPACE
MINE-THE-GAP project to transform mining oversight with AI and satellite data

NASA data reveals role of green spaces in cooling cities

New framework improves remote sensing image fusion through frequency-based learning

New AI tool generates realistic satellite images of future flooding

ROBO SPACE
Plastic pollution talks fail to reach landmark deal; Greenpeace activists detained over plastic tanker protest

Indigenous groups call for health protections in plastic deal

UN chief defends plastic pollution talks after collapse

Greenpeace activists detained over plastic tanker protest

Subscribe Free To Our Daily Newsletters




The content herein, unless otherwise known to be public domain, are Copyright 1995-2024 - Space Media Network. All websites are published in Australia and are solely subject to Australian law and governed by Fair Use principals for news reporting and research purposes. AFP, UPI and IANS news wire stories are copyright Agence France-Presse, United Press International and Indo-Asia News Service. ESA news reports are copyright European Space Agency. All NASA sourced material is public domain. Additional copyrights may apply in whole or part to other bona fide parties. All articles labeled "by Staff Writers" include reports supplied to Space Media Network by industry news wires, PR agencies, corporate press officers and the like. Such articles are individually curated and edited by Space Media Network staff on the basis of the report's information value to our industry and professional readership. Advertising does not imply endorsement, agreement or approval of any opinions, statements or information provided by Space Media Network on any Web page published or hosted by Space Media Network. General Data Protection Regulation (GDPR) Statement Our advertisers use various cookies and the like to deliver the best ad banner available at one time. All network advertising suppliers have GDPR policies (Legitimate Interest) that conform with EU regulations for data collection. By using our websites you consent to cookie based advertising. If you do not agree with this then you must stop using the websites from May 25, 2018. Privacy Statement. Additional information can be found here at About Us.