Skip to main content

Posts

From Classroom Question to Enterprise Pattern: Rethinking AI Retrieval on Governed Data Platforms

    This started while I was preparing my Big Data labs for my students. I wanted something closer to reality. Not only pipelines and Spark jobs, but how AI agents would actually interact with a governed data platform. So I set the environment as it should be in a serious setup. A lakehouse governed with AWS Lake Formation, metadata centralized in AWS Glue Data Catalog, and Spark handling execution. Clean, controlled, auditable. Then I added the missing piece. An agent that needs to understand intent and retrieve context, not just run queries. And almost immediately, the same request appeared. “We need a graph database for ontology.” In AWS terms, that means Amazon Neptune. I see this pattern often, not only with students. Also in real projects. Someone comes with a solution already decided. I always give the same answer. What is the business problem you are trying to solve? Because “I need Neptune” is not a requirement. It is a conclusion. When you force the conversation back...
Recent posts

When architecture come to life outside of slides and tools.

There’s something powerful about seeing architecture come to life outside of slides and tools. Today I want to recognize Ahra, one of my students, who took the time to translate her understanding of our Big Data labs into a hand-drawn reference architecture. Not only is it correct in structure, but it reflects clarity of thought and ownership of the problem. From ingestion to Bronze using a dual pipeline pattern (batch and streaming), through data quality and standardization in Silver, to consumption-ready Gold for BI and AI use cases, she captured the full journey. Even more interesting, she extended the architecture into Labs 11 and 12, incorporating natural language AI agents capable of querying the data platform using skills, RAG, and process knowledge. This is exactly the mindset we aim for: not just using tools, but designing systems that solve real problems and enable business interaction through AI. Kudos to you, Ahra. This is the kind of thinking that builds real data architec...

The Next Phase of AI: From Experimentation to Value Realization

Over the past few years, artificial intelligence has moved from promise to pervasive investment. Across industries, organizations have committed significant resources to AI initiatives. New models, new platforms, new capabilities. The pace of innovation has been extraordinary. And yet, a new question is emerging. Not what can we build with AI, but what is actually delivering measurable value. This shift marks an inflection point. We are entering a new phase of AI maturity. One defined not by experimentation, but by accountability. The Illusion of Progress From the outside, the landscape looks impressive. More models. More pilots. More demonstrations of capability. But inside many organizations, a different reality is taking shape. AI initiatives often remain disconnected from measurable business outcomes. Value is assumed, but not always quantified. Success is described in terms of activity, not impact. This is not a failure of technology. It is a reflection of something de...

The Third Layer of the AI Story: Value Capture

Value Capture The Third Layer of the AI Story: Value Capture A reflection on how the AI economy is splitting between model builders and infrastructure enablers, and why long-term value will shift toward applications, agents, and business outcomes. In my previous reflections, I explored two dimensions of the current AI wave. First, the responsibility that comes with systems that act. Then, the unprecedented scale of infrastructure investment and its financial implications. There is now a third layer that completes the picture. The structure of value itself. We are starting to see a clear split in the AI economy. On one side, pure-play model companies are scaling rapidly. Revenue is growing at extraordinary speed, but so are losses, driven by compute, training, and inference. On the other side, hyperscalers and enablers are generating strong profits. But they are also committing to massive capital expenditure, with projections of $650–700 billion in AI infrastructure in 2026 al...

Hyperscaler AI capital expenditure versus slower and uncertain monetization timelines

  LinkedIn What we may be witnessing right now feels uncomfortably familiar. Not in the technology itself… but in the economics behind it. We are seeing hyperscalers committing extraordinary levels of capital, with projected AI-driven capex in 2026 approaching $650–700 billion annually, up from roughly $380–410 billion in 2025. In just one year, that is a 60–70%+ increase, with an estimated ~75% of that spend directly tied to AI infrastructure. Individually, the scale is even more striking: - Amazon alone is projected around $200B - Alphabet around $175–185B - Meta between $115–135B - Microsoft exceeding $100B+ run rate - Oracle adding another ~$50B This is no longer incremental investment. This is a structural capital shift at global scale. And this is where the parallel with the dot-com era becomes difficult to ignore. Back then, the belief was that the internet would transform everything. It did. But the timing of value creation and the scale of investment were completely misali...

AEPD has published guidance on agentic AI

  Original Post in LinkedIn I have been saying for a while that AI is no longer just about models… it is about systems that act. Now regulators are catching up. The Spanish Data Protection Agency (AEPD) has published guidance on agentic AI from a data protection perspective. And this is important. Because agentic AI is fundamentally different. These are not systems that just answer… they decide, interact, and execute autonomously to achieve goals. That changes everything. It changes how data is accessed, how decisions are made, and how responsibility is assigned. And one key message stands out: using AI does not remove accountability. Organizations remain fully responsible for data protection, transparency, and control. From an architecture perspective, this reinforces something we often underestimate. You cannot build agentic AI on top of weak data foundations. Governance, data minimization, traceability, and privacy-by-design are no longer compliance topics… they are cor...

The Invisible Builder and The Visible Translator

     Original Post In LinkedIn      Today I had a conversation with my boss, Soumya, someone I genuinely consider a mentor, and it led to a reflection that stayed with me. I asked him: Do you know who Dennis Ritchie is… without googling it? Do you know who Steve Jobs is? Then I told him: “I am Dennis Ritchie… and you are Steve Jobs.” And I meant it. Because there is a quiet truth in how impact is perceived. Not everyone remembers the inventor of the wrench. But everyone remembers the brand that made it known, usable, and desirable. Some of us operate deep in the foundations; building systems, architectures, and platforms that power everything else. Work that is critical, but often invisible. Others have the ability to translate that complexity, connect it to real human value, and bring it into the world in a way that people understand, adopt, and remember. The world doesn’t always reward creation alone. It rewards connection, storytelling, and adoption. But...