Archive of posts with category 'AI'

Responsible AI GRC on AWS: Bedrock Agent Controls for Financial Services

AWS updated its responsible AI governance, risk, and compliance guidance for financial services on May 13, 2026. The useful part is not another principles list. The useful part is turning...

SageMaker Data Agent with IAM Identity Center: Secure AI Data Workflows

AWS announced SageMaker Data Agent availability for IAM Identity Center domains on May 13, 2026. That is a quiet sentence with a big governance implication: natural-language data analysis is only...

Agentic App Modernization on AWS: Strands, Transform Custom, and Bedrock AgentCore

AWS published an agentic modernization architecture in May 2026 that combines Strands, AWS Transform custom, and Bedrock AgentCore. The tempting headline is simple: agents can modernize large code portfolios. The...

Docker Model Runner and Open WebUI: Local AI Infrastructure That DevOps Teams Can Actually Operate

Local AI stopped being a weekend toy when developer workstations started handling private code, tickets, logs, and architecture notes. Docker Model Runner plus Open WebUI is interesting because it wraps...

SageMaker Capacity-Aware Inference: Surviving GPU Shortages Without Manual Endpoint Retries

A real-time AI endpoint that fails before serving its first request is not an inference platform. It is a capacity bet. On May 4, 2026, SageMaker AI added instance pools...

OpenAI on Amazon Bedrock: Codex, GPT-5.5, and Managed Agents for AWS Teams

On April 28, 2026, AWS announced something that changes the enterprise AI architecture conversation: OpenAI models, Codex on Amazon Bedrock, and Amazon Bedrock Managed Agents powered by OpenAI are coming...

Amazon Bedrock Trust and Safety: A Production Checklist for AI Apps

AWS published Bedrock trust-and-safety guidance on April 29, 2026, and two numbers should catch every AI platform team’s attention: AWS cites an 82% improvement in employee trust when organizations communicate...

GitHub Copilot Usage-Based Billing: Budget Controls for DevOps Teams

GitHub Copilot changes its billing model on June 1, 2026. Premium request units go away, GitHub AI Credits become the unit of usage, and the expensive part is no longer...

AWS Bedrock AgentCore: Building Production AI Agents in 2026

I spent most of last year watching teams try to build AI agents from scratch. The common thread: they underestimated the infrastructure. Everyone focuses on the model choice, spends a...

GitHub Copilot vs Kiro for DevOps: 2026 Showdown

I’ve spent the last three months using both GitHub Copilot and Kiro on actual DevOps work. Not toy examples. Not “write a hello world Lambda.” Real infrastructure code: Terraform modules...

AWS Lambda S3 Files: Mount S3 as a File System and Kill the /tmp Copy Loop

On April 21, 2026, AWS Lambda removed one of the dumbest pieces of plumbing in a lot of serverless code: the “download from S3, write to /tmp, process, upload to...

Amazon Rekognition Availability Change: Replacing Streaming Video Analysis and Batch Image Moderation

AWS has set the same April 30, 2026 new-customer cutoff for two Amazon Rekognition capabilities: Streaming Video Analysis and Batch Image Content Moderation. Existing accounts that used the affected features...

Amazon Comprehend Feature Availability Change: Replacing Topic Modeling, Event Detection, and Prompt Safety

AWS has set an important cutoff for Amazon Comprehend users: topic modeling, event detection, and prompt safety classification stop being available to new customers on April 30, 2026. Existing AWS...

Amazon Bedrock AgentCore New Features: CLI, Harness, Persistent Files, and Coding Agent Skills

On April 22, 2026, AWS added a set of Amazon Bedrock AgentCore features that changes the early developer workflow more than the runtime architecture. The update is not just “more...

Deploying MCP Servers on Amazon ECS: When Fargate Beats Lambda or AgentCore

AWS published its official MCP-on-ECS walkthrough on April 14, 2026. That matters because the runtime question is now the real question. Most teams no longer struggle with the idea of...

Amazon Bedrock Guardrails Cross-Account: Centralized AI Safety Across AWS Organizations

AWS shipped Bedrock Guardrails cross-account enforcement on April 3, 2026, and this is exactly the kind of feature security teams ask for after the first wave of internal AI pilots...

Amazon Bedrock Granular Cost Attribution: Who Spent What and Why

Amazon Bedrock got a billing upgrade that matters more than it sounds. If several teams, applications, or engineers call the same model endpoints, the monthly bill used to tell you...

Accelerate Region Expansion with the AWS Knowledge MCP Server

The AWS Knowledge MCP Server is the kind of infrastructure that becomes obvious only after you use it. It went generally available on October 1, 2025, and the pitch is...

Secure AI Agent Access Patterns to AWS Resources Using MCP

AWS made the control question explicit in April 2026: when an AI agent touches AWS, which IAM principal is actually acting? That is the part people skip, and it is...

Docker Sandboxes and MicroVMs: A Practical Security Model for Local AI and Untrusted Code

Docker’s March 2026 security push is not subtle. The company said over a quarter of production code is now AI-authored, and that developers using agents are merging roughly 60% more...

Running AI Agents on Kubernetes: Agent Sandbox, AI Gateway, and the Platform Gaps They Fix

Kubernetes is finally getting serious about the parts of AI systems that do not fit a normal Deployment. On March 9, 2026, the Kubernetes community announced the AI Gateway Working...

AI on EKS: A Practical Guide to Scalable GPU and Neuron Workloads

AWS keeps pushing Amazon EKS deeper into AI infrastructure for a reason: it scales, it is familiar, and it already sits in a lot of enterprise networking and identity stacks....

Amazon Bio Discovery: AWS Turns Antibody Design into a Lab-in-the-Loop AI Workflow

AWS launched Amazon Bio Discovery on April 14, 2026. What stood out to me was simple: AWS did not ship a generic model endpoint for biotech teams. It shipped a...

Spring AI + Amazon Bedrock + MCP: A Practical Java Stack for AI Agents

Java teams already have enough framework churn. Most of them are not looking for a new agent platform. They want to keep Spring Boot, add model access, expose a few...

Amazon S3 Vectors vs Gemini File Search: Two Very Different Answers to the Same RAG Problem

AWS rolled out S3 Vectors in preview on July 15, 2025. Google put Gemini File Search into public preview on November 6, 2025. That changed the retrieval conversation. A year...

Bedrock AgentCore Stateful MCP Servers: Elicitation, Sampling, and Long-Lived Context

On March 10, 2026, AWS added stateful MCP server features to Amazon Bedrock AgentCore Runtime. If you only read the headline, it sounds like a protocol update. It is more...

AWS Agent Registry Preview: Govern Agents, MCP Servers, and Skills at Scale

On April 9, 2026, AWS launched AWS Agent Registry in preview inside Amazon Bedrock AgentCore. That launch matters because most teams are no longer struggling with a single agent demo....

Hybrid RAG on AWS: Amazon Bedrock and OpenSearch That Hold Up in Production

On March 1, 2024, AWS added hybrid search to Knowledge Bases for Amazon Bedrock for Amazon OpenSearch Serverless. On March 27, 2025, AWS added Amazon OpenSearch Managed Cluster as a...

How to Test AI Agents in CI/CD with Bedrock AgentCore Evaluations

AWS made Amazon Bedrock AgentCore Evaluations generally available on March 31, 2026. That launch matters because it answers the first serious production question every agent team eventually hits: how do...

Build Coding Agents on AgentCore: Shell Commands and Persistent Session Storage

Amazon Bedrock AgentCore got two features in March 2026 that matter far more than the marketing language around them. On March 17, 2026, AWS launched shell command execution in AgentCore...

Kiro: AWS's Agentic AI IDE Built Around Spec-Driven Development

AWS launched Kiro on July 14, 2025. It’s an agentic IDE built on Code OSS (the open-source foundation of VS Code) and it makes a specific bet: the biggest problem...

Customize Amazon Nova Models with Bedrock Fine-Tuning

Training a custom Nova model on Bedrock costs less than you probably expect. A supervised fine-tuning job on roughly 5,000 conversation examples — enough data to meaningfully shift model behavior...

Amazon Bedrock Model Lifecycle: ACTIVE, LEGACY, and End-of-Life

On April 28, 2026, Claude 3.7 Sonnet reached end-of-life on Amazon Bedrock. Calls to anthropic.claude-3-7-sonnet-20250219-v1:0 returned a ValidationException with the message “The provided model identifier is invalid.” No warning in...

Amazon Bedrock Cost Allocation by IAM User and Role

When three teams share a single AWS account and all call Bedrock, figuring out who spent what requires more than just looking at the monthly bill. The “Amazon Bedrock” line...

Bedrock Agents vs Direct Nova Pro API: Cost and Latency at Scale

At 1,000 multi-step requests per day, Bedrock Agents costs roughly $864/month. Running the same workload against Nova Pro directly with a custom orchestration layer costs about $137/month. That gap —...

Bedrock AgentCore Gateway Server-Side Tool Execution: Cleaner Than Client Tool Loops

On February 24, 2026, AWS announced server-side tool execution for Amazon Bedrock through Amazon Bedrock AgentCore Gateway integration with the Responses API. That launch changes a stubborn problem in agent...

Bedrock AgentCore AG-UI: Real-Time Agent Frontends Without Custom Event Plumbing

On March 13, 2026, AWS announced that Amazon Bedrock AgentCore Runtime now supports the Agent-User Interaction protocol, better known as AG-UI. That is more important than it sounds. Most agent...

Amazon S3 Files: Your S3 Bucket Now Has a File System

The announcement in April 2026 was surprisingly quiet for something that changes a real pain point. AWS shipped S3 Files — a feature that mounts an S3 bucket as a...

Unleashing the Power of AI & ML in Enhancing Cloud Security

As cloud usage grows, data spreads across servers everywhere. This creates a real problem: traditional security tools cannot keep up with cybercriminals who move fast and adapt faster. AI and...

Using Machine Learning for Time Management: What Actually Works

I’ve tested seven “AI-powered productivity” tools over the past two years. Three were genuinely useful. Four were polished demos that didn’t survive contact with a real calendar. This post is...

The Future of Software Development: AI and DevSecOps

Welcome to DevSecOps and Artificial Intelligence (AI) in software development. This post explores how AI fits into the DevSecOps landscape and how teams handle modern software development challenges.

Harnessing Generative AI for Efficient in DevOps

Cloud services run fast, and when they don’t, customers leave. That’s the reality of running anything online today. Downtime costs money. Latency costs customers. If you’ve ever watched your error...

Power BI Data Mastery: The Stuff They Don't Cover in the Tutorials

Every Power BI tutorial shows you the happy path. Click here, drag there, your chart appears. That works fine until you’re staring at a 50-million-row dataset at 2am, your refresh...

Metaverse as a Service: Building Virtual Business Platforms in 2023

I spent three months last year evaluating metaverse platforms for a client in the healthcare training space. We were deciding between building on Decentraland, rolling our own WebGL environment, or...