Google's EmbeddingGemma: A New Contender for On-Device RAG

I usually default to OpenAI for embeddings, but Google’s new EmbeddingGemma model is a noteworthy development. It’s not just another model; it’s a strategic move that shows real promise for improving Retrieval-Augmented Generation (RAG) pipelines, especially in on-device and edge applications. What is EmbeddingGemma? Google has released EmbeddingGemma as a lightweight, efficient, and multilingual embedding model. At just 308M parameters, it’s designed for high performance in resource-constrained environments. This isn’t just about making a smaller model; it’s about making a capable small model. ...

5 September, 2025 · 2 min · 375 words · Yury Akinin

Google's AI Coding Agent 'Jules' Launches Publicly, Powered by Gemini 2.5

Google has officially moved its asynchronous coding agent, Jules, out of beta and into public availability. The key upgrade is its new engine: Gemini 2.5 Pro, which Google claims enhances its ability to generate high-quality code by first developing a structured plan. From Beta to Public Launch The public launch follows a substantial beta period where thousands of developers tackled tens of thousands of tasks, resulting in over 140,000 code improvements. This feedback has been used to refine the platform, leading to several key enhancements: ...

13 August, 2025 · 2 min · 256 words · Yury Akinin

Google's MLE-STAR: AI Agents That Automate Machine Learning Engineering

Google’s MLE-STAR: AI Agents That Automate Machine Learning Engineering Google Cloud’s research team has unveiled MLE-STAR (Machine Learning Engineering via Search and Targeted Refinement), an AI agent system that marks a significant step toward the full automation of building ML pipelines. For anyone who has spent countless hours engineering features, selecting models, and optimizing hyperparameters, this development is worth paying close attention to. At its core, MLE-STAR moves beyond the limitations of traditional AutoML. Instead of relying on a predefined set of models and techniques, it uses an innovative approach that combines external knowledge with internal optimization. ...

4 August, 2025 · 3 min · 483 words · Yury Akinin