translation

This is an AI translated post.

해리슨 블로그

Gemini 1.5 Flash, GPT-4o, and Pricing of Other LLMs

Select Language

  • English
  • 汉语
  • Español
  • Bahasa Indonesia
  • Português
  • Русский
  • 日本語
  • 한국어
  • Deutsch
  • Français
  • Italiano
  • Türkçe
  • Tiếng Việt
  • ไทย
  • Polski
  • Nederlands
  • हिन्दी
  • Magyar

Summarized by durumis AI

  • Compare the performance and pricing of various AI models such as GPT-4o, Opus, Gemini 1.5 Pro, Haiku, and Gemini 1.5 Flash to present the strengths and weaknesses of each model, and recommend appropriate models based on the purpose of use.
  • Provides users with the best AI model selection guide considering input token size, output ratio, and task complexity.
  • Based on the performance and pricing information of the latest AI models as of May 30, 2024, this helps users make wise choices.

Google and OpenAI have both announced a ton of new AI-related content in the past two days.

The two things people are generally curious about are:

Performance and price. (Of course there are more features, but that's something professional bloggers will review...)

Open AI - GPT

As always with OpenAI, the new 4o is cheaper than the previous GPT-4T. Performance is something you can find a ton of reviews about on other blogs, so let's just talk about the price here.

GPT Price Table


Open AI has consistently lowered the price each time a new product is released since GPT 3.5 Turbo, starting with the initial release of GPT 4. Of course, the performance has been upgraded. Currently, if you're looking for the most affordable option, 3.5 Turbo is the way to go. For everything else, 4o should do the trick.


Anthopic - Claude 3

Claude 3 Price Table

Anthropic hasn't released any new products recently, but Haiku for value and Opus for high performance make it an LLM company that can't be ignored.

Haiku is the cheapest of the three in terms of input token pricing, making it the most affordable for simple text processing.

In fact, until Gemini Flash came out, Haiku even outperformed Gemini 1.0 Pro, making it a very useful LLM.


Google - Gemini

Gemini Price Table

Google maintains two pricing systems.

One is AI Studio, the other is Vertex AI.

AI Studio is token-based like other companies, while Vertex AI is uniquely priced based on characters.

Looking at the table above, if 1 token is less than 3 characters on average (1-2 characters), Vertex AI is cheaper to use. If it's 3 characters or more, AI Studio is cheaper. But, naturally, English characters usually exceed the character count, making AI Studio cheaper. Korean nowadays also has cases where 1 token is multiple characters...

Anyway, even if you only look at input tokens or performance, Gemini 1.5 Flash is much better than 1.0 Pro. For high-performance tasks, 1.5 Pro is superior.


Summary

Comprehensive

In terms of performance alone, based on MMLU, it seems to be GPT-4o > Opus > 1.5 Pro.

For highly intellectual tasks, use GPT-4o. If you want something a little cheaper or if your token size exceeds 200K (Opus only supports up to 200K), Gemini 1.5 Pro might be a good option. It's a bit different in actual usage, so use what works best for you.

If you need to do a lot of text processing on a budget, there are two options:

If the ratio of input to output is low (e.g., if you need to input a large amount of documents and output a short result), Claude 3 Haiku is the cheapest. However, Haiku has a high output cost, so on the other hand, if the ratio of output is higher (e.g., if you input a specific text and then instruct it to modify or change it), Gemini 1.5 Flash is recommended. In that case, Flash is the cheapest for output cost.


Summary and Conclusion

"I don't care about the price, I just want the most complex task done." -> GPT - 4o

"But, the input token size exceeds 128K." (GPT - 4o only supports up to 128K) -> Opus

"High performance is needed, but the price is a bit cheaper, or the token size exceeds 200K." (Opus only supports up to 200K) -> Gemini 1.5 Pro


"I need the cheapest LLM." -> Haiku

"But, the input/output ratio is higher in terms of output, or it exceeds 200K tokens." -> Gemini 1.5 Flash


해리슨
해리슨 블로그
해리슨의 깜짝 블로그
해리슨
Claude 3 vs Gemini Price Comparison Anthropic's Claude 3 Haiku model is now available on GCP, and H2O.ai's evaluation using RAG shows that it outperforms Gemini in terms of price-to-performance. Claude 3 Haiku is the cheapest based on input and output costs per million tokens.

April 7, 2024

ChatGPT vs Gemini Pricing Comparison This article compares two major LLM services currently available, ChatGPT and Gemini. ChatGPT, which is token-based, is charged $0.125 per 1 million tokens, while Gemini, which is character-based, is charged $0.125 per 1 million characters for input and $

March 7, 2024

durumis Development - Part 3: Gemini Pro durumis has developed various features using Google's next-generation LLM 'Gemini Pro'. By applying AI technology such as automatic URL generation, summarization, writing descriptions, generating topics, and automatic classification, we have efficientl

February 3, 2024

Google Gemini 1.5 vs 1.5 Pro Comparison (with Examples) Gemini 1.5 Pro is a more powerful AI model than Gemini 1.5, and can be used for various tasks such as code analysis, automatic unit test generation, code conversion, and more. It is particularly suitable for handling large amounts of data and complex task
Unusual Curiosity: 흔치 않은 궁금증
Unusual Curiosity: 흔치 않은 궁금증
Unusual Curiosity: 흔치 않은 궁금증
Unusual Curiosity: 흔치 않은 궁금증
Unusual Curiosity: 흔치 않은 궁금증

June 28, 2024

The Dawn of the AI Age: Harmonizing Technological Innovation with Ethics Google and OpenAI have showcased the groundbreaking advancements in AI technology by unveiling their new AI models 'Gemini' and 'GPT-4'. Gemini can process information from various modalities, summarize information, and provide answers to queries. It is s
durumis AI News Japan
durumis AI News Japan
durumis AI News Japan
durumis AI News Japan

May 18, 2024

Google Gemini Ultra to be Embodied in Smartphones Google has announced plans to equip its smartphones with the cloud-exclusive AI model "Gemini Ultra" next year. The advancement in LLM compression technology enables on-device execution, promising a significant expansion of smartphone functionality. Morga
세상 모든 정보
세상 모든 정보
세상 모든 정보
세상 모든 정보

April 1, 2024

The era of algorithmic branding is coming Elon Musk, Google CEO Sundar Pichai, and Sam Altman of OpenAI, AI experts, warn of the dangers of artificial intelligence and offer different solutions for the future of humanity. In an electronic newspaper column on April 24, 2023, check out their argume
Byungchae Ryan Son
Byungchae Ryan Son
Byungchae Ryan Son
Byungchae Ryan Son
Byungchae Ryan Son

May 10, 2024

Weights & Biases Releases a White Paper on Best Practices for LLM Evaluation, Available for General Download Weights & Biases (W&B) has released a white paper on "Best Practices for Large Language Model (LLM) Evaluation." This paper draws on W&B's experience operating a Korean LLM leaderboard to provide best practices for LLM evaluation and future prospects. The
스타트업 커뮤니티 씬디스 (SeenThis.kr)
스타트업 커뮤니티 씬디스 (SeenThis.kr)
스타트업 커뮤니티 씬디스 (SeenThis.kr)
스타트업 커뮤니티 씬디스 (SeenThis.kr)

May 9, 2024

Building an AI Full Stack with Open Source New open source LLM (Large Language Model) models are emerging in the AI ecosystem. Powerful models with open licenses, such as Mistral, Llama, and phi-2, have been released, and various tools to use them are also being developed. From LLM frameworks such
RevFactory
RevFactory
RevFactory
RevFactory

February 5, 2024