InfoQ Live Logo

The Software Architects' Newsletter
May 2025
View in browser

Welcome to the InfoQ Software Architects' Newsletter! We bring you essential news and experience on emerging patterns and technologies from industry peers each month.

This month, we again focus on "AI Trends: Disrupting Software Tooling, Techniques, and Teams". Technologies, patterns, and practices from this topic span the entire "diffusion of innovation" graphs in our InfoQ Trends Reports 2024 eMag and our InfoQ AI, ML and Data Engineering Trends Report - September 2024 (and accompanying podcast).

There's no denying that LLMs have dominated the conversation around software engineering for the past year, but at InfoQ, we focus on the impact this is having within the enterprise and on real-world software delivery projects.

News

From Architecture to Deployment: How AI-Powered Toolkits Are Unifying Developer Workflows

Developer tooling is undergoing a shift as AI moves beyond code completion to unify multiple stages of the software development workflow. Recent announcements like GitHub Copilot Workspace and Claude 3's system-level reasoning showcase how AI is beginning to assist not just with coding, but also with planning, documentation, and testing.

Emerging projects like DevFlow, which integrates documentation generation, test suggestions, and architecture diagramming, offer an early glimpse into how these trends may reshape developer environments in the near future.

OpenAI and Windsurf Release New Coding Models

OpenAI has launched Codex, a research preview of a cloud-based software engineering agent designed to automate common development tasks such as writing code, debugging, testing, and generating pull requests. Integrated into ChatGPT for Pro, Team, and Enterprise users, Codex runs each assignment in a secure sandbox environment preloaded with the user's codebase and configured to reflect their development setup.

Separately, Windsurf has introduced its first set of SWE-1 models, aimed at supporting the full range of software engineering tasks, not limited to code generation. The lineup consists of three models SWE-1, SWE-1-lite, and SWE-1-mini, each designed for specific scenarios.

Akka Launches New Deployment Options for Agentic AI

Akka, best known for its tools for building resilient and elastic distributed systems, has launched new deployment features designed to simplify the deployment of large-scale agentic AI systems. Akka highlights that shifting towards agentic architectures requires a fundamental change from transaction-centered to conversation-centered systems. There are many similarities with existing actor-based systems that Akka has developed.

Scaling Financial Operations: Uber’s GenAI-Powered Approach to Invoice Automation

Uber recently described a GenAI-powered invoice processing system that reduced manual effort by 50%, cut handling time by 70%, and delivered 25–30% cost savings. By leveraging GPT-4 and a modular platform called TextSense, Uber improved data accuracy by 90%, enabling globally scalable, efficient, and highly automated financial operations.

Google Unveils Ironwood TPU for AI Inference

Google has unveiled its seventh-generation Tensor Processing Unit (TPU), Ironwood, at Google Cloud Next 25. Ironwood is Google's most performant and scalable custom AI accelerator to date and the first TPU designed specifically for inference workloads.

Google emphasises that Ironwood is designed to power what they call the "age of inference", marking a shift from responsive AI models to proactive models that generate insights and interpretations. The company states that AI agents will use Ironwood to retrieve and generate data, delivering insights and answers.

Case Study

Beyond the Gang of Four: Practical Design Patterns for Modern AI Systems

The Gang of Four's 23 object-oriented patterns shaped how an entire generation of developers designed software. In the 2010s, cloud computing introduced patterns such as publish-subscribe ("pub-sub"), microservices, event-driven workflows, and serverless models, which now power most cloud-based distributed systems.

Similarly, before the current AI boom, the machine learning community had already developed "ML design patterns". When you build and deploy ML models, you face specific challenges, and patterns like Checkpointing, Feature Stores, and Versioning have become standard practice.

Why should you care about these patterns? They help you solve known problems in standardized ways. Instead of reinventing solutions, you use a shared vocabulary. When you say "Singleton", "Pub-Sub", or "Feature Store", your team immediately understands your approach. This speeds up your development, reduces errors, and makes your systems easier to maintain.

Modern AI systems present new challenges that neither classic software nor conventional machine learning (ML) patterns fully address.

For example, how do you guide model output and prevent misleading content? How do you build user experiences that help users understand, trust, and effectively use AI-powered applications? How do you manage agent interactions in multi-agent systems? How do you reduce computational costs to make your product sustainable?

Many AI patterns have emerged across the industry to help develop a well-architected AI system. The full version of this article demonstrates how existing patterns fit together, organized into five categories that build on each other as you scale your AI system.

  • Prompting and Context Patterns to craft effective instructions and provide relevant context to guide the model's output
  • Responsible AI Patterns to ensure ethical, fair, and trustworthy outputs
  • User Experience Patterns to build intuitive interactions
  • AI-Ops Patterns to manage AI at scale
  • Optimization Patterns to maximize efficiency and reduce cost

This content is an excerpt from a recent InfoQ article by Rahul Suresh, "Beyond the Gang of Four: Practical Design Patterns for Modern AI Systems".

To get notifications when InfoQ publishes content on these topics, follow "AI, ML & Data Engineering", "Machine Learning", and "Large Language Models (LLMs)" on InfoQ.

Missed a newsletter? You can find all of the previous issues on InfoQ.

Sponsored

The APIOps Playbook: Architecting Scalable and Secure API Ecosystems - Sponsored by Boomi

As APIs become core to digital architecture, sprawl, governance gaps, and performance issues can hinder agility and innovation. The APIOps Playbook provides architects with strategic guidance to operationalize API management at scale. Explore patterns for federated governance, built-in security, and streamlined API delivery that align with enterprise architecture goals. Learn how to evolve APIOps into a disciplined practice that enhances control, resilience, and developer productivity across complex, distributed systems.

Download the white paper “The APIOps Playbook: Managing APIs with Speed, Security, and Scale” sponsored by Boomi

Upcoming Events

QCon: For practitioners, by practitioners

InfoQ Dev Summit Boston 2025 (June 9–10): Last chance to register

If you're tackling complex development challenges and need real-world solutions from experienced practitioners, don’t miss this. In just two weeks, join senior developers for practical talks on AI/ML, modern architectures, performance, resilience, and DX. Explore the schedule and save your seat before it’s too late!


InfoQ Dev Summit Munich 2025 (October 15–16): Lead through the next wave of software change

Across Europe, senior developers are facing AI disruption, rising cloud complexity, and pressure from evolving EU regulations. Join peers for two days of practical talks on secure AI adoption, resilient systems, platform engineering, and regulatory strategy. See the schedule and save with early bird pricing through June 10.


QCon San Francisco 2025 (November 17–21): Navigate what’s next in software

From AI-enabled workflows to modern platform engineering and distributed architectures, software complexity is surging. Join senior engineers for 12 curated tracks and 60+ peer-driven sessions on scaling systems and making smarter tech decisions. Register now. Early bird pricing ends June 10.


QCon AI New York 2025 (December 16–17): Build AI systems that deliver

Move beyond prototypes. Learn how real teams are scaling AI across the SDLC, from architecture and integration to MLOps, compliance, and business impact. Two days of peer-driven talks focused on production-grade AI. Secure your spot. Early bird pricing ends June 10.

About InfoQ

Senior software developers rely on the InfoQ community to keep ahead of the adoption curve. One of the main reasons software architects and engineers tell us they keep coming back to InfoQ is because they trust the information provided and selected by their peers.

We’ve been helping software development teams adopt new technologies and practices for over 19 years through InfoQ articles, news items, podcasts, tech talks, trends reports, and QCon software development conferences.

We hope you find this newsletter useful. If not, you can unsubscribe using the link below.

Unsubscribe

Forwarded email? Subscribe and get your own copy.

Subscribe

Follow InfoQ on:

You have received this email because you subscribed to "The Architects' Newsletter". To stop receiving the Architects' Newsletter, please click the following link: Unsubscribe

- - -

C4Media Inc. (InfoQ.com), 705-2267 Lake Shore Blvd. West,
Toronto, Ontario, Canada, M8V 3X2