 

当前位置：Toy Tech Blog  前沿哨所  正文

Google Releases T5Gemma 2: Next-Generation Multimodal Long-Context Encoder-Decoder Model

2025-12-19 分类：前沿哨所阅读(2) 评论(0) 赞(0)

智谱 GLM，支持多语言、多任务推理。从写作到代码生成，从搜索到知识问答，AI 生产力的中国解法。

Google recently released T5Gemma 2, a next-generation encoder-decoder model based on the Gemma 3 architecture. Compared to its predecessor, T5Gemma 2 introduces multiple architectural innovations, including tied word embeddings and merged attention mechanisms, significantly reducing model parameters. The new model supports multimodal processing capabilities, able to simultaneously understand and process images and text; its context window expands to 128K tokens, greatly enhancing long text processing capabilities; and it supports over 140 languages, possessing powerful multilingual processing abilities. Performance tests show that T5Gemma 2 surpasses its predecessor in multimodal, long-context, encoding, and reasoning tasks. The series offers three scales of pre-trained models: 270M-270M, 1B-1B, and 4B-4B, suitable for on-device applications and downstream task development. The models are now available for download on platforms such as Kaggle and Hugging Face, providing AI researchers and developers with a powerful new tool.

Original Link:Hacker News

赞(0)

未经允许不得转载：Toy Tech Blog » Google Releases T5Gemma 2: Next-Generation Multimodal Long-Context Encoder-Decoder Model

分享到

decoder encoder generation google long model multimodal next releases tgemma

免费、开放、可编程的智能路由方案，让你的服务随时随地在线。

相关推荐

评论抢沙发

快讯

Correct Implementation of Circular Buffers: Fixing Years of Programming Errors

This article reveals common mistakes programmers make when implementing circular buffers, focusing on how to build efficient concurrent non-blocking single-reader single-writer circular buffers using atomic operations and memory barriers. The discussion includes optimization solutions for non-power-of-two sizes (using conditional checks instead of integer modulo operations), as well as lock-free implementation techniques like the LMAX Disrupter pattern. These techniques have significant value in systems programming, chip design, embedded systems, and other fields, significantly improving performance and reliability, while also covering practical applications such as FPGA asynchronous FIFO design and PortAudio.

Original Link:Hacker News

49分钟前
Vibe Coding: The New Programming Paradigm in the AI Era

In this article, David Bau delves into the emerging programming paradigm of 'Vibe Coding,' which utilizes AI agents for software development. He distinguishes between two models: one where humans maintain control while AI handles small tasks, and another where AI builds complex systems with humans ceding partial cognitive control. Through the example of improving a Mandelbrot fractal viewer, the author demonstrates how AI can expand 780 lines of code to 13,600, implementing complex features like GPU acceleration and high-precision computing. The article proposes two rules for effectively using AI agents in programming: automated testing and testing the tests, to maintain human understanding of the system. The author also前瞻性地 considers how AI will transform the software development process, drawing an analogy to highway system construction, and explores the importance of enhancing 'intellectual mobility' while preserving human decision-making capabilities. This article provides valuable insights for understanding and practicing AI-assisted programming.

Original Link:Hacker News

49分钟前
Delty Hiring AI Engineers: Medical AI System Built by Former Google Team

Delty, a YC X25-backed startup, is hiring machine learning engineers focused on building an AI operating system for healthcare. Founded by former Google engineering leaders, the team has deep experience in YouTube and large-scale infrastructure. Responsibilities include building end-to-end machine learning systems, from data modeling and feature engineering to training, evaluation, deployment, and monitoring. Candidates will design data pipelines to process raw medical data, train models for case prioritization and risk prediction, and ensure reliable model operation in production environments. Working closely with backend engineers and product leaders to integrate AI technology into real-world healthcare workflows. This reflects cutting-edge applications of AI in health tech, offering readers valuable insights into technology and industry trends.

Original Link:Hacker News

50分钟前
Uganda Launches Official Multilingual AI Based on Qwen-3 Model

Uganda's government, in partnership with Sunbird AI, has launched a multilingual translation AI system called 'Sunflower'. The project addresses the challenge of most large language models not supporting Uganda's 40+ languages by choosing Qwen-3 as the base training model, developed in a cost-effective manner. The official website is now open for use, providing API access and detailed tutorials on GitHub for easy developer integration. This innovation demonstrates the practical application value of AI technology in addressing global language diversity challenges, offering solutions for developing countries.

Original link:Linux.do

50分钟前
AI Code Assistant Pricing Sparks Controversy: High Costs and Limitations, Why Are Users Paying?

Recently, the developer community has been engaged in heated discussions about the pricing strategies of AI code assistant tools. One developer shared their experience using tools like Cursor and Windsurf, questioning why these products are priced so high despite having numerous restrictions. For example, Windsurf charges $30 per month but only provides 500 query opportunities, while the author's personal $100 Claude subscription offers a better experience. The article reflects a common phenomenon in the current AI tool market: high prices coexist with strict limitations. Despite this, some users are still willing to pay for these tools. This discussion has sparked reflections on the合理性 of AI service business models, especially for developers, where finding a balance between cost and efficiency has become crucial. As AI tools become increasingly widespread, their pricing strategies will directly impact user choices and industry development.

Original link:Linux.do

50分钟前
环形缓冲区的正确实现：多年编程错误修正

本文揭示了程序员在实现环形缓冲区时常见的错误，重点介绍了如何使用原子操作和内存屏障构建高效的并发非阻塞单读者单写者环形缓冲区。讨论包括非2的幂次大小的优化方案（使用条件判断替代整数模运算），以及锁-free实现技术，如LMAX Disrupter模式。这些技术对系统编程、芯片设计、嵌入式系统等领域具有重要价值，可显著提升性能和可靠性，同时涉及FPGA异步FIFO设计和PortAudio等实际应用场景。

原文链接：Hacker News

51分钟前

十年稳如初 — LocVPS，用时间证明实力

10+ 年老牌云主机服务商，全球机房覆盖，性能稳定、价格厚道。

老品牌，更懂稳定的价值你的第一台云服务器，从 LocVPS 开始