The AI wars: Microsoft unveils Phi-3, a capable AI model that fits easily on a phone

In response to Meta's recent Llama-3 release, Microsoft has published findings on the latest iteration of its lightweight AI model. The technical report shows the Phi-3-mini outperforming LLMs such as GPT-3.5 despite being a fraction of their size.

Sarfo Ashong-Listowell, Published 04/26/2024 🇮🇹 🇵🇱 ...

Microsoft launched Phi-3 earlier this week on HuggingFace, Ollama and the Azure AI catalog. While it does not quite match the general knowledge skills of Windows Copilot, the open-source AI technology represents the fourth generation of small language models from Redmond that rival mainstream LLMs in speed, efficiency and performance.

At 3.8 billion parameters, Phi-3 is slightly larger than its predecessor but remains small enough to run on as little as 1.8GB of mobile storage. For comparison, a typical complex LLM such as Llama or GPT-3.5 utilizes hundreds of billions of parameters to comprehend input and is impractical to store natively. GPT-5, launching this summer, is expected to be trillions of parameters in size. By conventional scaling laws, more parameters means more intelligent results. But according to Microsoft, this might not necessarily be the case.

Graph comparing the Phi-3 models to Llama-3, Gemma, and Mixtral (Source: Microsoft)

Microsoft makes some bold claims in its technical report; chief among them being the performance benchmarks which are, by the company's own admission, purely academic. In 12 out of 19 benchmark tests, Phi-3-mini appears to outperform Llama-3-instruct despite running on more than twice as many parameters. With the 7B Phi-3-small, and 14B Phi-3-medium, the results were even more staggering.

The engineers attribute these efficiency gains to their carefully curated training dataset derived from two sources: ‘textbook-quality’ web content and AI-generated data designed to teach language, general knowledge and common sense reasoning with a cherry-picked list of 3000 words serving as building blocks. Microsoft’s researchers claim that this sort of data recipe enabled last year's Phi-2 to match the performance of Meta's considerably larger (70 B) Llama-2 model.

Phi-3 benchmark comparison with major LLMs. (Source: Azure)

Eric Boyd, corporate VP of Azure AI, boasts through The Verge that Phi-3 is just as capable as GPT-3.5, albeit in a “smaller form factor”. However, Phi-3 continues to be plagued by a deficiency in factual knowledge due to its limited size. Perhaps this a necessary trade-off for AI to run natively instead of via cloud computing?

Considering how flexibility and cost-efficiency are key issues for businesses, it is not surprising that companies have already begun to harness the capabilities of SLMs. However, Phi-3 has keen competition. Meta's Llama-3, Anthropic’s Claude-3 suite, Google Gemini and Gemma all have lightweight versions that are capable of supporting edge computing on mobile. And although Phi-3 seems to compete favorably, Gemini Nano has already made it to devices like the Google Pixel 8 Pro and Samsung Galaxy S24 series ($784 on Amazon).

The Phi-3 family of AI models is by no means the only SLM Microsoft has been working on. Last month, the company adapted Mistral to create Orca-Math, a specialized model that was shown to be considerably more accurate than Llama, GPT-3.5 and Gemini Pro at grade school math. AutoDev, a more recent project, draws upon AutoGen and Auto-GPT to autonomously plan and execute programming tasks based on user-defined objectives. The AI wars are far from over, but at least on the lower scale, we have a leading contender.

Orca-Math achieves an 86.8% pass rate at the GSM8K problems, outperforming every other model tested. (Image source: Microsoft)

Overview of the AutoDev framework(Image source: Microsoft Research)

Source(s)

Azure, Microsoft Blog

Better late than never? (Source: Anthropic)

Last in line: Anthropic rolls out Claude mobile app for iOS 05/03/2024

Apple hints at on-device AI with an open-source language model 05/01/2024

Casas dos sonhos acessíveis e amigáveis ao clima podem ser possíveis graças a um novo arquiteto de IA e à impressão 3D (imagem: Icon)

Casas dos sonhos baratas e amigáveis ao clima: Novo arquiteto com IA e impressão 3D transformam o setor de construção 04/27/2024

Affordable and climate-friendly dream homes could be possible thanks to a new AI architect and 3D printing (image: Icon)

Cheap, climate-friendly dream homes: New AI architect and 3D printing transform construction industry 04/27/2024

Shy Kids made Air Head in collaboration with OpenAI's Sora video generation model. (Image source: Shy Kids on YouTube)

OpenAI's Sora finicky to work with, needs hundreds of prompts, serious VFX work for under 2 minutes of cohesive story 04/27/2024

According to a report from South Korea, the Samsung Galaxy Watch7 could already offer non-invasive blood sugar monitoring. (Image: AliExpress)

Samsung Galaxy Watch7 is said to already offer a non-invasive blood sugar monitor thanks to AI 04/26/2024

Samsung may well have confirmed the existence of Google's Gemini Nano 2, which is to be used in the Galaxy S25, for the first time. (Image: SK, Youtube)

Samsung "confirms" Galaxy S25 will have even more Galaxy AI power thanks to Google's Gemini Nano 2 04/21/2024

OnePlus x Google AI is on the way. (Source: OnePlus)

OnePlus and OPPO smartphones to adopt Gemini and Cloud AI from Google 04/12/2024

Google has confirmed the Pixel 8 will get Gemini Nano with the next Pixel Feature Drop (image via Notebookcheck)

Google Pixel 8 to get Gemini Nano later this year 03/28/2024

The next iteration of OpenAI's GPT LLM is only a few short months away. (Image: OpenAI)

ChatGPT-5 said to be on track for a summer release 03/20/2024

Free users of Microsoft Copilot now have access to the advanced GPT-4 Turbo (Image source: Microsoft)

Microsoft Copilot now offers OpenAI's GPT-4 Turbo for free 03/14/2024

Researchers say that LLM makers like OpenAI need to more thoroughly vet their AIs for

Even after anti-racism training AI chatbots like ChatGPT still exhibit racial prejudice 03/11/2024

Qualcomm's new AI Hub offers a wealth of resources for developers. (Image: Qualcomm)

Qualcomm AI Hub offers access to generative AI models compatible with its Snapdragon chips 03/09/2024

Klarna's new ChatGPT-inspired AI is said to be better than human employees (image: Klarna)

Klarna freezes hiring: AI chatbot to provide better experience than human employees 02/29/2024

Vision Pro owners can now use ChatGPT (Image source: Apple)

ChatGPT app is now available for Apple Vision Pro 02/06/2024

A MacBook being used to develop ML models. (Image: Apple)

Apple finally gets serious about gen AI with the release of MLX, an ML framework for devs 12/12/2023

Meta offers its Llama 2 AI model free-of-cost for commercial use. (Source: Meta)

Human-like AI chatbots with varied personalities could come to Meta platforms as early as September 08/02/2023

Loading Comments

Comment on this article

Unreal Engine 5.4 brings many new f...

Samsung Galaxy Watch7 is said to al...

Sarfo Ashong-Listowell - News Writer - 46 articles published on Notebookcheck since 2023

I was fortunate to be exposed to the awesomeness of tech as a child. I delighted in seeking out the nerdiest sci-fi gadgets I could afford to play with. These days I take a professional interest in biotech, especially health-tracking wearables, and futuristic smart home appliances. If you ever come to Unilag's College of Medicine, you'll probably find me geeking about some biomedical discovery. That's if I'm not scrolling YouTube shorts. Or sleeping.

Please share our article, every link counts!

> Expert Reviews and News on Laptops, Smartphones and Tech Innovations > News > News Archive > Newsarchive 2024 04 > The AI wars: Microsoft unveils Phi-3, a capable AI model that fits easily on a phone

Sarfo Ashong-Listowell, 2024-04-26 (Update: 2024-04-26)

The AI wars: Microsoft unveils Phi-3, a capable AI model that fits easily on a phone

Source(s)

Related Articles