先做个广告:需要购买Gemini帐号或代充值Gemini会员,请加微信:gptchongzhi
点蓝色字关注“机器学习算法工程师”
推荐使用Gemini中文版,国内可直接访问:https://ai.gpt86.top
设为星标,干货直达!
谷歌的CEO 桑达尔・皮查伊终于管宣Gemini 1.0,Gemini 大模型是和GPT-4一样的大模型,在各个方面的评测上均超过GPT-4!
下面是来自谷歌 CEO 皮查伊的声明:
Every technology shift is an opportunity to advance scientific discovery, accelerate human progress, and improve lives. I believe the transition we are seeing right now with AI will be the most profound in our lifetimes, far bigger than the shift to mobile or to the web before it. AI has the potential to create opportunities — from the everyday to the extraordinary — for people everywhere. It will bring new waves of innovation and economic progress and drive knowledge, learning, creativity and productivity on a scale we haven’t seen before.
That’s what excites me: the chance to make AI helpful for everyone, everywhere in the world.
Nearly eight years into our journey as an AI-first company, the pace of progress is only accelerating: Millions of people are now using generative AI across our products to do things they couldn’t even a year ago, from finding answers to more complex questions to using new tools to collaborate and create. At the same time, developers are using our models and infrastructure to build new generative AI applications, and startups and enterprises around the world are growing with our AI tools.
This is incredible momentum, and yet, we’re only beginning to scratch the surface of what’s possible.
We’re approaching this work boldly and responsibly. That means being ambitious in our research and pursuing the capabilities that will bring enormous benefits to people and society, while building in safeguards and working collaboratively with governments and experts to address risks as AI becomes more capable. And we continue to invest in the very best tools, foundation models and infrastructure and bring them to our products and to others, guided by our AI Principles.
Now, we’re taking the next step on our journey with Gemini, our most capable and general model yet, with state-of-the-art performance across many leading benchmarks. Our first version, Gemini 1.0, is optimized for different sizes: Ultra, Pro and Nano. These are the first models of the Gemini era and the first realization of the vision we had when we formed Google DeepMind earlier this year. This new era of models represents one of the biggest science and engineering efforts we’ve undertaken as a company. I’m genuinely excited for what’s ahead, and for the opportunities Gemini will unlock for people everywhere.
– Sundar
这次发布的Gemini 1.0共包括三个不同大小的版本:
Gemini Ultra — 最大且最具能力的模型,用于处理高度复杂的任务。
Gemini Pro — 最优秀的模型,适用于广泛范围的任务扩展。
Gemini Nano — 最高效的模型,用于设备端任务。
Gemini Ultra 在大型语言模型研发被广泛使用的 32 个学术基准测试集中,在其中 30 个测试集的性能超过当前 SOTA 结果。
Gemini 1.0 具有复杂多模态推理能力,最重要的多模态能力是和GPT-4V一样支持图像理解,下面的示例展示了如何使用Gemini来生成用于重新排列子图的matplotlib代码:
在各个图像多模态评测集上,Gemini Ultra也基本全方位超过GPT-4V:
除了支持图像,Gemini还支持视频和语音,其中语音识别上超过OpenAI的Whisper:
当然你可以把不同多模态结合在一起,比如同时输入图像和语音:
此外,Gemini还原生支持图像生成,而无需依赖可能阻碍模型表达图像能力的中间自然语言描述。
而且,谷歌使用了Gemini的专门版本来创建了更先进的代码生成系统AlphaCode 2。这个系统擅长处理那些超出常规编程范围、涉及复杂数学和理论计算机科学的竞赛级编程问题。
经过与原始 AlphaCode 在相同平台上进行评估,AlphaCode 2 展现出巨大的改进,解决的问题数量几乎是原来的两倍:
谷歌终于王者归回,OpenAI的GPT时代要终结了?
推荐阅读
使用PyTorch 2.0加速Transformer:训练推理均拿下!
硬核解读Stable Diffusion(系列三)
硬核解读Stable Diffusion(系列二)
硬核解读Stable Diffusion(系列一)
带你入门扩散模型:DDPM
机器学习算法工程师
一个用心的公众号