AInsights: Your executive-level insights on the latest in generative AI
Google is making up for lost time in the AI race by following the age old Silicon Valley mantra, move fast and break things.
Google recently released Gemini 1.5, and it’s next level! This release dethrones Anthropic’s brief reign as leading foundation model. But by the time you read this, OpenAI will also have made an announcement about ChatGPT improvements. It’s now become a matter of perpetual leapfrogging, which benefits us as users, but makes it difficult to keep up! Note: I’ll follow up with a post about ChatGPT/DALL-E updates.
Here’s why it matters…
1 million tokens: It’s funny. I picture Dr. Evil raising his pinky to his mouth as he says, “1 million tokens.” Gemini 1.5 boasts a dramatically increased context window with the ability to process up to 1 million tokens. Think of tokens as inputs, i.e. words or parts of words, in a single context window. This is a massive increase from previous models like Gemini 1.0 (32k tokens) and GPT-4 (128k tokens). It also surpasses Anthropic’s context record at 200,000 tokens.
A 1 million token context window allows Gemini 1.5 to understand and process huge amounts of data. This unlocks multimodal super-prompting and higher caliber outputs. A 1 million token context window can support extremely long books, documents, scripts, codebases, video/audio files, specifically:
1 hour of video 🎥
11 hours of audio 🎶 🎙️
30,000 lines of code 🧑💻
700,000 words of text ⌨️
https://twitter.com/briansolis/status/1789375696961262062
This long context capability enables entirely new use cases that were not possible before, like analyzing full books, long documents, or videos in their entirety.
Chrome Support: With Perplexity creeping into Google’s long-dominate grasp on internet search (more about the future of search here), Google is at least integrating Gemini prompting directly in the Chrome browser. Here’s how to do it 👇
https://twitter.com/briansolis/status/1787243863788388608
More countries, more languages supported: Gemini 1.5 is available in 100 additional countries and now offers support for 9 additional languages. Languages supported include English, Japanese, and Korean: Arabic, Bengali, Bulgarian, Chinese (simplified and traditional), Croatian, Czech, Danish, Dutch, Estonian, Finnish, French, German, Greek, Hebrew, Hindi, Hungarian, Indonesian, Italian, Latvian, Lithuanian, Norwegian, Polish, Portuguese, Romanian, Russian, Serbian, Slovak, Slovenian, Spanish, Swahili, Swedish, Thai, Turkish, Ukrainian, Vietnamese.
Google has put Gemini 1.5 through rigorous safety evaluations to analyze potential risks and harms before release. Novel safety testing techniques were developed specifically for 1.5’s long context capabilities.
That’s your AInsights in a snapshot to help you make sense of Google’s latest genAI news.
Please subscribe to AInsights, here.
If you’d like to join my master mailing list for news and events, please follow, a Quantum of Solis.
Brian Solis | Author, Keynote Speaker, Futurist
Brian Solis is world-renowned digital analyst, anthropologist and futurist. He is also a sought-after keynote speaker and an 8x best-selling author. In his new book, Lifescale: How to live a more creative, productive and happy life, Brian tackles the struggles of living in a world rife with constant digital distractions. His previous books, X: The Experience When Business Meets Design and What’s the Future of Business explore the future of customer and user experience design and modernizing customer engagement in the four moments of truth.
Invite him to speak at your next event or bring him in to your organization to inspire colleagues, executives and boards of directors.
21z61p
We have learned a lot from your post, which is very satisfying to me. thank you so much.