Sign in to view Arvind’s full profile
or
New to LinkedIn? Join now
By clicking Continue to join or sign in, you agree to LinkedIn’s User Agreement, Privacy Policy, and Cookie Policy.
Sign in to view Arvind’s full profile
or
New to LinkedIn? Join now
By clicking Continue to join or sign in, you agree to LinkedIn’s User Agreement, Privacy Policy, and Cookie Policy.
San Francisco, California, United States
Sign in to view Arvind’s full profile
Arvind can introduce you to 8 people at Tabsdata
or
New to LinkedIn? Join now
By clicking Continue to join or sign in, you agree to LinkedIn’s User Agreement, Privacy Policy, and Cookie Policy.
5K followers
500+ connections
Sign in to view Arvind’s full profile
or
New to LinkedIn? Join now
By clicking Continue to join or sign in, you agree to LinkedIn’s User Agreement, Privacy Policy, and Cookie Policy.
View mutual connections with Arvind
Arvind can introduce you to 8 people at Tabsdata
or
New to LinkedIn? Join now
By clicking Continue to join or sign in, you agree to LinkedIn’s User Agreement, Privacy Policy, and Cookie Policy.
View mutual connections with Arvind
or
New to LinkedIn? Join now
By clicking Continue to join or sign in, you agree to LinkedIn’s User Agreement, Privacy Policy, and Cookie Policy.
Sign in to view Arvind’s full profile
or
New to LinkedIn? Join now
By clicking Continue to join or sign in, you agree to LinkedIn’s User Agreement, Privacy Policy, and Cookie Policy.
Articles by Arvind
-
Part 4: Governance Without the Overhead – Compliance without the friction
Part 4: Governance Without the Overhead – Compliance without the friction
🔹 This article is part of the ongoing series: How Pub/Sub for Tables Fixes What Data Pipelines Broke. The Governance…
38
1 Comment -
Part 3: Building Data Products: Turning raw data into governed, reusable assetsApr 28, 2025
Part 3: Building Data Products: Turning raw data into governed, reusable assets
🔹 This article is part of the ongoing series: How Pub/Sub for Tables Fixes What Data Pipelines Broke. The Promise of…
56
1 Comment -
Part 2: Enabling Data Contracts: Creating transparency & accountabilityApr 14, 2025
Part 2: Enabling Data Contracts: Creating transparency & accountability
🔹 This article is part of the ongoing series: How Pub/Sub for Tables Fixes What Data Pipelines Broke. Why Data…
49
2 Comments -
What is Pub/Sub for Tables?Apr 10, 2025
What is Pub/Sub for Tables?
Pub/Sub for Tables redefines the publish-subscribe model by making tables the fundamental unit of publication and…
60
5 Comments -
Part 1: Simplifying Data Engineering — Freeing teams from pipeline firefightingApr 7, 2025
Part 1: Simplifying Data Engineering — Freeing teams from pipeline firefighting
🔹 This article is part of the ongoing series: How Pub/Sub for Tables Fixes What Data Pipelines Broke. The Problem:…
69
4 Comments -
How Pub/Sub for Tables Fixes What Data Pipelines BrokeMar 31, 2025
How Pub/Sub for Tables Fixes What Data Pipelines Broke
🔗 If you’re new to the concept, start with “What is Pub/Sub for Tables?” for a quick primer, then dive back here. Data…
131
10 Comments
Activity
5K followers
-
Arvind Prabhakar shared thisA few of my thoughts on Daniel's post below. Tabsdata materializes the full dataset before writing to destination. This is a prerequisite for correctness: joins, aggregations, transactional consistency, data quality, lineage, replay. It also happens to be exactly how modern data platforms like Snowflake, Databricks, AWS and more prefer to receive data: large Parquet files, bulk transfers, columnar writes. That architectural alignment is why the performance numbers look the way they do. The benchmarks are also honest about where Airbyte holds its own. What is not covered here: incremental loads, CDC, transformations. More to come.Arvind Prabhakar shared thisThe data is in! Tabsdata is up to 86x faster than Airbyte! 🚀 Over the past several weeks, I've been working on a really exciting benchmarking project comparing Tabsdata performance to other popular data integration tools. The first leg of this journey explores how Tabsdata and Airbyte compare when extracting and loading full data refreshes across different sources and destinations. Although extract and load is only a small subset of what Tabsdata can do, I wanted to see how it performs when restricted to point-to-point data movement, workflows that Airbyte specializes in. As the benchmarks show, Tabsdata vastly outperforms Airbyte, while remaining lightweight, offering complex data transformation capabilities, and providing automatic data versioning, lineage, and observability out of the box. There's still a lot more to explore: incremental CDC, ETL, testing more sources and destinations, and more. I am also creating Terraform scripts so anyone can run and validate these benchmarks. Read all about it in my blog post. https://lnkd.in/guHKDQTY
-
Arvind Prabhakar shared thisThe best data conversations happen over a cold one. If you're in the Portland area, don't miss this - good people and real talk about data engineering. RSVP and join Daniel and the team!Arvind Prabhakar shared thisHi everyone! For folks in the Portland area, I wanted to announce the second PDX Data Engineering Happy Hour which I'll be hosting at Binary Brewing (Beaverton Location) on March 19th @ 6:30 PM. Similar the our last event, it'll be informal and we'll have a free bar tab (courtesy of Tabsdata). Come by, grab some drinks or food on us, and chat with other people in the Portland data community. If you're interested, only requirement is just to RSVP through the link below https://luma.com/c133sajuPDX Data Engineering Happy Hour (March Edition) · LumaPDX Data Engineering Happy Hour (March Edition) · Luma
-
Arvind Prabhakar posted thisData is corrected after it moves. Categories are redefined and records are restated long after pipelines have done their job. Downstream systems keep operating on assumptions that are no longer true. I call this truth-drift.
-
Arvind Prabhakar posted thisThe modern data stack is really a compensation stack for pipeline-centric design. Most of our time and tooling go into rebuilding trust after data already moved. We normalized the lie that reliable transport is enough. It isn’t.
-
Arvind Prabhakar posted thisETL systems fail in more ways than people admit. Gaps between jobs, failures that force reruns, and shifts in execution order all create phases where data is updated but not logically consistent. Anyone who runs a data platform has seen this. Tucu (Alejandro Abdelnur) just wrote a clear breakdown of why this happens in modern stacks. He walks through execution plans, transactional boundaries across domains, and what it takes for downstream systems to see one coherent state. It is a short practical read and the first article in our new Tabsdata publication. Worth a look if you want cleaner refresh cycles and fewer surprises in your platform. More info in comments.
-
Arvind Prabhakar shared thisELT vs ETL: Most teams today follow the ELT model, where data is first loaded into a platform and then transformed inside it. ELT replaced ETL largely because transformations became impractical outside data platforms as data sources multiplied. Joining and shaping data across APIs, files, and other stores required complex logic for boundaries, retires, and failure handling. By shifting transformations downstream, teams simplified ingestion. ETL originally performed these steps inline, pushing SQL down to the source systems for efficient and consistent results. The trade-off has become clear. ELT fragments ownership and inflates cost. Ingest tools, transformation orchestrators, observability layers, and semantic modeling layers all exist to compensate for the loss of context between extraction and transformation. The outcome is slower refresh cycles, inconsistent metrics, and rising compute and storage expenses. Each layer adds friction, erodes accountability, and weakens trust in data. What started as a practical workaround has hardened into an architecture that drains resources and delays decisions. Tabsdata restores balance by bringing ETL back to its architectural roots. It unifies extraction, transformation, and load in a single declarative flow that scales horizontally across serverless infrastructure. Transformation occur as data is collected and results propagated to any destination. This reduces latency, cost, and operational complexity while preserving the shared context between data producers and consumers. It is the simplicity of classic ETL, now built for the modern data stack.
-
Arvind Prabhakar shared thisIt is common for different teams in the same organization to report different values for the same metric. Ask Finance and Operations for "net revenue by region last quarter," and you may get two different numbers. Add an AI assistant to the mix and you might get a third. Each team believes it is using the same data, yet the results diverge. This inconsistency erodes confidence and makes it difficult to align on business performance and next steps. The divergence rarely comes from data itself. It arises from how data is accessed and interpreted. Business terms lose precision because of misaligned semantics between domains, and multiple copies of data obscure the truth, blindsiding queries that would otherwise be correct. Metadata dilution strips away key details, introducing inconsistencies that accumulate over time. With uneven update timings, lack of dependency management, and unsynchronized copies, what started as a single source of truth fractures into several partial truths. Tabsdata prevents this by ensuring every dataset refresh propagates instantly and consistently across all dependencies in the correct idempotent order. Semantics and metadata remain intact, preserving a shared meaning of each dataset and all downstream results. And when differences do appear, Tabsdata's built-in lineage shows exactly where and why they occurred. The result is a foundation of trust and alignment that allows Finance, Operations, and every other domain to move with clarity and speed. Cross-functional consistency becomes the hallmark of agile, high-efficiency business.
-
Arvind Prabhakar shared thisStreaming and batch processing solve different problems. Streaming handles ordered, unbounded events that flow between applications. Batch handles unordered, bounded datasets designed for completeness and reproducibility. Streaming fits application integration and event-driven systems. Batch fits data integration or ETL. They are not interchangeable, and forcing one to act as the other creates unnecessary complexity. Many enterprises use streaming tools such as Kafka or Flink for data integration, assuming continuous flow will improve freshness. In reality, streaming semantics depend on delivery order, windowing, and timing assumptions that complicate analytics and require bespoke protocols between disconnected systems. What starts as a shortcut to speed often turns into an operational burden. The right way to gain speed in data integration is through instant ETL, where each dataset refresh propagates instantly and consistently. Tabsdata brings Pub/Sub semantics to datasets and delivers instant ETL for those who need speed without compromising consistency or trust. Streaming belongs where order defines correctness. Batch belongs where completeness defines trust. Mixing them serves neither well.
-
Arvind Prabhakar shared thisIt was great to be at TechCrunch this week and to meet so many teams who stopped by our booth to learn more. Most conversations centered on one concern: the rising cost of managing data across too many tools. Ingestion, orchestration, observability, data quality, semantic, and metadata tools each come with their own licenses and consumption models. The more the utilization, the faster the costs grow. Many teams turn to open source frameworks to control costs and stay flexible. These are powerful foundations, but as systems expand, maintenance and evolving best practices require dedicated expertise. The effort to manage change, dependencies, and failures often outweighs the savings. Add in wasted compute, redundant storage, unreliable governance, and the true cost becomes clear. The highest cost appears when data is not reproducible. Insights lose credibility, compliance becomes uncertain, and decisions drift from fact. Tabsdata brings all data operations - from ingestion to semantic cohesion - into one integrated platform. By removing redundant systems and operational overhead, Tabsdata delivers instant ETL and ensures every dataset is consistent, reproducible, and ready to use.
-
Arvind Prabhakar reacted on thisArvind Prabhakar reacted on thisWhen Max Schireson joined Battery Ventures over a decade ago, the plan was for him to come in as an EIR and start another killer database company. I had already seen him grow MongoDB from 0 to tens of millions while I was on the board there, so everyone expected round two. Fortunately for us, he took a liking to the investing side and it's a great honor to officially name him as partner! Over the last 10+ years, I've watched Max sit with the founder of Databricks and go deep on open source metrics, then pivot to debating astrophysics and nuclear energy with a room full of engineers - and everyone walks away feeling like they were talking to one of their own. He's got this amazing versatile and broad spectrum paired with real operator instincts for helping companies navigate financing, growth, which a lot of people really enjoy, especially the founders who work with Max. As AI opens up massive oppty's in deep tech - foundation models, robotics, quantum, neuromorphic computing - Max has found an amazing cohort of founders - Fundamental, Quantum Art, Reflection AI and others building world-class foundational platforms in these spaces. We're excited to have him focused on these deeper tech ideas that we believe will potentially be game changers over the next 5-10 years!
-
Arvind Prabhakar reacted on thisArvind Prabhakar reacted on thisA little over a month ago, I wrapped up five years at Snowflake. What a five years it was. I had the privilege of building and leading the Customer Experience Engineering (CXE) team, surrounded by some of the brightest, most driven people I've ever worked with. Over the past year alone, the CXE team fundamentally reimagined how Snowflake's CX organization operates with AI, delivering substantial gains in customer self-service and efficiency. I couldn't be more proud of what we built together. But every great chapter eventually leads to a new one. Back in late February, I took the leap and began co-founding a company alongside my longtime friend and colleague Linden Hillenbrand. In the first week of March, we officially incorporated as CustOS AI (pronounced 'KOOS-tohs'). Our mission: helping enterprises transform their Customer Operations with AI and data. It's an intersection I've spent the last 15 years hyper-focused on within Cloudera and then Snowflake. I’m ready to bring this to the broader market. Since day one, we've been heads-down meeting with leaders across our networks, connecting with potential design partners, pressure-testing our thesis, and sharpening our thinking with every conversation. The energy has been incredible. If you’re interested in learning more, please don’t hesitate to reach out. While I know this journey won’t be easy, I genuinely believe we're working on a problem that can drive real impact for the market. This is just the beginning. More to come. 🚀
-
Arvind Prabhakar liked thisArvind Prabhakar liked thisManufacturing is entering its most important decade in 50 years. And AI will determine who leads it. Today marks one year since I became the CEO of QAD | Redzone. When I joined, I had a simple belief: Manufacturing doesn’t need more software. It needs systems that act. Factories today are navigating unprecedented complexity — tariffs, labor shortages, supply chain volatility, and margin pressure. And now AI is reshaping how decisions get made. The old model — systems that simply record what happened — is no longer enough. Manufacturers need platforms that help them sense faster, decide faster, and act faster. That belief has shaped everything we’ve done this past year. In 12 months we have: • Launched Champion AI — an agentic AI platform purpose-built for manufacturing • Unified Adaptive ERP and Redzone Connected Workforce around a Systems of Action strategy • Accelerated our pace of execution across the company But the biggest shift this year hasn’t been technology. It’s been clarity. Manufacturing is entering a defining decade. The winners will not be the companies with the most dashboards. They will be the companies that can turn insight into action the fastest. That’s the future we are building. To our employees, customers, and partners — thank you for believing in this journey and for all the support you have given me. Year One was about focus. Year Two is about acceleration. Manufacturing is at a once-in-a-generation inflection point. And we intend to lead it. #Manufacturing #AI #SystemsOfAction #ChampionAI #Leadership Bryan Reimer John Dyck Jeff Winter Jake Hall Chris Luecke Allison Roberts Grealis Matthew Littlefield Michael Rowe Joe Sullivan Yvonne Genovese mikeroweWORKS Eric Kimberling Mark Vigoroso, MBA "R "Ray" Wang Holger Mueller Patrick Moorhead Robert Kramer National Association of Manufacturers - NAM CESMII #MAPI
-
Arvind Prabhakar liked thisArvind Prabhakar liked thisI’m incredibly honored to be selected for the HITEC 2026 Emerging Executive Program (EEP)! Looking forward to a year of growth alongside 100 top-tier technology leaders from 70 organizations. A huge thank you to my team at Capital One for supporting the nomination and growth in my leadership journey! #HITEC #EEP2026 #TechLeadership #ExecutiveDevelopment
-
Arvind Prabhakar liked thisCome for the context engineering, stay for the unexpectedly strong opinions about the Oshawa Generals.Arvind Prabhakar liked thisWe’re proud to be at #CDAOCanada 2026, joining Canada’s leading data and analytics executives to explore how data, AI, and governance are driving innovation and smarter decision-making. Catch our speaker, Dima Spivak and be part of the conversation. Learn more: https://ibm.co/6043EDZBX
-
Arvind Prabhakar reacted on thisArvind Prabhakar reacted on thisMy Career Is My Mom's Cup of Tea On this Women's Day, I find myself without the most important woman in my life. My mother left us on January 21st of this year, and the world has felt differently weighted ever since. Tomorrow, as the world honors women everywhere, I want to honor just one — quietly, personally, with the full weight of everything she meant to me. In 1991, I was fighting for an engineering seat against some of the most brutal competition India has to offer. I was never the most talented in the room — but I was relentless, and she made sure I stayed that way. Night after night, when I would push through until the early hours, she would appear at exactly the right moment - just when I was about to fall asleep — not because of an alarm, but because she simply knew — with a cup of tea, with her presence, with words that kept me going When I finally fell asleep, she would quietly put my books in order, as if preparing the battlefield for the next day's fight. That was the kind of love she gave — precise, selfless, and always on time. The career that followed was built on a foundation she laid in those quiet pre-dawn hours. Every moment I have chosen service over comfort — that was her. I was on a flight to say goodbye on January 21st, and I didn't make it in time. But I've come to understand that with a love like hers, the goodbye was never really the point. The point was every cup of tea. Every book she placed back in order. Every morning, she made it possible. Happy Women's Day, Maa. The most important woman in my life — then, now, and always. Your's always - Raju
Experience & Education
-
Tabsdata
********** * ***
-
****** ********* ** *********** ******
** **** undefined undefined
-
***** ***** **********
** undefined
View Arvind’s full experience
By clicking Continue to join or sign in, you agree to LinkedIn’s User Agreement, Privacy Policy, and Cookie Policy.
Welcome back
By clicking Continue to join or sign in, you agree to LinkedIn’s User Agreement, Privacy Policy, and Cookie Policy.
New to LinkedIn? Join now
View Arvind’s full profile
-
See who you know in common
-
Get introduced
-
Contact Arvind directly
Other similar profiles
Explore more posts
-
Sharad Kumar
FluidCloud • 18K followers
❤️ Loved this from Harshit Omar Cloud outages aren’t anomalies; they’re warnings. Most enterprises are architected as if hyperscalers are “too big to fail,” but recent outages have proven the opposite. Resilience needs a new baseline: Can you run somewhere else when your provider falters? That’s the promise of #CloudCloning and true #multicloud optionality. Big thanks to MR Rangaswami Sir for the conversation. Worth the read.
30
1 Comment -
Utsav Pandya
Tesla • 2K followers
📰 OpenAI & Intuit Sign $100M+ Partnership Deal 🔗 https://openai.com/news/ 📝 Summary: Intuit has signed a $100 million+ deal with OpenAI to integrate ChatGPT into its applications, expanding enterprise use cases and enabling seamless AI integration across Intuit's product suite for millions of users. 💭 Sentiment: Positive - Major enterprise partnership demonstrates strong commercial adoption and revenue growth potential for OpenAI. 🏢 Companies: @OpenAI @Intuit #AI #GenerativeAI #TechNews #Tech
-
Damodar Periwal
2K followers
I recently read an excellent piece by Jiquan Ngiam on PulseMCP about 𝘝𝘪𝘳𝘵𝘶𝘢𝘭 𝘔𝘊𝘗 𝘚𝘦𝘳𝘷𝘦𝘳𝘴 𝘢𝘯𝘥 𝘎𝘢𝘵𝘦𝘸𝘢𝘺𝘴. It highlights a powerful idea: 𝗔𝗜 𝗮𝗴𝗲𝗻𝘁𝘀 𝘄𝗼𝗿𝗸 𝗯𝗲𝘀𝘁 𝘄𝗵𝗲𝗻 𝘁𝗵𝗲𝗶𝗿 𝗮𝘃𝗮𝗶𝗹𝗮𝗯𝗹𝗲 𝘁𝗼𝗼𝗹𝘀𝗲𝘁 𝗶𝘀 𝗶𝗻𝘁𝗲𝗻𝘁𝗶𝗼𝗻𝗮𝗹𝗹𝘆 𝗰𝗼𝗻𝘀𝘁𝗿𝗮𝗶𝗻𝗲𝗱 𝗮𝗻𝗱 𝗰𝘂𝗿𝗮𝘁𝗲𝗱. Instead of giving agents dozens of tools, you provide only the narrowly scoped capabilities required for the task—significantly improving reliability, safety, and correctness. This “curated tool surface” approach aligns closely with a challenge many teams face when connecting AI systems to 𝘀𝘁𝗿𝘂𝗰𝘁𝘂𝗿𝗲𝗱 𝗱𝗮𝘁𝗮. Raw SQL access exposes too much complexity: schemas, joins, permissions, inference mistakes, autogenerated SQL, and unpredictable query behavior from LLMs. This is exactly the problem we designed 𝗢𝗥𝗠𝗖𝗣 to address. 𝗢𝗥𝗠𝗖𝗣 𝗶𝘀 𝗮 𝗠𝗼𝗱𝗲𝗹 𝗖𝗼𝗻𝘁𝗲𝘅𝘁 𝗣𝗿𝗼𝘁𝗼𝗰𝗼𝗹 (𝗠𝗖𝗣) 𝘀𝗲𝗿𝘃𝗲𝗿 𝘁𝗵𝗮𝘁 𝗽𝗿𝗲𝘀𝗲𝗻𝘁𝘀 𝗿𝗲𝗹𝗮𝘁𝗶𝗼𝗻𝗮𝗹 𝗱𝗮𝘁𝗮 𝘁𝗼 𝗔𝗜 𝘀𝘆𝘀𝘁𝗲𝗺𝘀 𝘂𝘀𝗶𝗻𝗴 𝗼𝗯𝗷𝗲𝗰𝘁-𝗼𝗿𝗶𝗲𝗻𝘁𝗲𝗱 𝗯𝘂𝘀𝗶𝗻𝗲𝘀𝘀 𝗲𝗻𝘁𝗶𝘁𝗶𝗲𝘀 𝗶𝗻𝘀𝘁𝗲𝗮𝗱 𝗼𝗳 𝘁𝗮𝗯𝗹𝗲𝘀 𝗮𝗻𝗱 𝗦𝗤𝗟. Instead of exposing an entire schema—and hoping the model infers relationships correctly—you expose 𝘰𝘯𝘭𝘺 𝘵𝘩𝘦 𝘴𝘱𝘦𝘤𝘪𝘧𝘪𝘤 𝘣𝘶𝘴𝘪𝘯𝘦𝘴𝘴 𝘦𝘯𝘵𝘪𝘵𝘪𝘦𝘴 𝘺𝘰𝘶 𝘤𝘩𝘰𝘰𝘴𝘦, with clear boundaries and predictable behaviors. This naturally creates its own form of 𝘥𝘢𝘵𝘢-𝘭𝘦𝘷𝘦𝘭 curation: ● AI agents don’t compose SQL. ● They never see tables directly. ● They only interact through stable, object-oriented access patterns. ● All data access passes through a controlled, schema-aware, MCP interface. And because ORMCP uses 𝗴𝗲𝗻𝗲𝗿𝗶𝗰, 𝗽𝗮𝗿𝗮𝗺𝗲𝘁𝗲𝗿𝗶𝘇𝗲𝗱 𝗠𝗖𝗣 𝘁𝗼𝗼𝗹𝘀, you don’t need a separate tool for every entity: ● One query tool handles all classes ● One insert tool handles all classes ● One aggregate tool handles all classes ● One update/delete tool handles all classes All while honoring strict, preconfigured schema boundaries. In effect, ORMCP complements the PulseMCP idea from another angle: 𝗧𝗵𝗲𝘆 𝗰𝘂𝗿𝗮𝘁𝗲 𝘁𝗼𝗼𝗹𝘀; 𝗢𝗥𝗠𝗖𝗣 𝗹𝗲𝘁𝘀 𝘆𝗼𝘂 𝗰𝘂𝗿𝗮𝘁𝗲 𝗱𝗮𝘁𝗮 𝘀𝘁𝗿𝘂𝗰𝘁𝘂𝗿𝗲𝘀. Together, these patterns help AI systems act more deterministically and safely when working with real enterprise databases. If you're building AI agents, RAG systems, or autonomous workflows that need to interact with PostgreSQL, MySQL, SQL Server, Oracle, SQLite, or any JDBC source—this might be worth exploring.
13
2 Comments -
Pavan Tikkani
Cytiva • 954 followers
🎉 The final installment of Synthetic Data and Fine Tuning a LLM series is live, and it's all about the results. In Part 4, we dive deep into evaluating our fine-tuned HuggingFaceTB/SmolLM-360M for the medical symptom checker task. After generating synthetic data with fastdata (Part 1), probing the base model (Part 2), and walking through the fine-tuning process (Part 3), this post answers the big question: Did it work? Part-1 : https://lnkd.in/gvWE-yvu Part-2: https://lnkd.in/g6Ck6KjK Part-3: https://lnkd.in/gekJjydP You'll see: 🔹 A side-by-side comparison of the base vs. fine-tuned SmolLM responses to our challenging medical prompts. 🔹 A detailed analysis of how targeted synthetic data dramatically improved performance, especially in critical scenarios (like handling potential heart attack symptoms!). 🔹 Discussion on what the model learned, where it still has room to grow, and the impact on its general capabilities (hello, catastrophic forgetting?). 🔹 Key takeaways from this entire end-to-end fine-tuning project. So grateful for everyone who's followed along! Check out the transformation and the final analysis: https://lnkd.in/gRUam2qt This series has been a fantastic learning journey, and I hope it provides some useful insights for your own LLM adventures! #LLMResults #FineTuningSuccess #ModelEvaluation #SyntheticDataImpact #SmolLM #MachineLearning #AI #NLP #TechBlog #DataScience #ArtificialIntelligence #MedicalAI
3
-
Matthijs de Vries
Nuklai • 2K followers
Just read “Why Metadata Is the New Interface Between IT and AI” by Krishna Subramanian on BigDataWire. Great insights from the author about how metadata is becoming the bridge between raw systems and intelligent models. Rather than forcing huge document ingestion or monolithic pipelines that need to stay in sync and strain your infra, metadata gives AI a map to work with, letting models pick what matters rather than going over all the data blindly. At Nuklai, we built Nexus with exactly that philosophy: - Leverage metadata-driven retrieval so models use context, not bulk data from slow and imperfect RAG systems. - Connect the data instead of ETL or duplication - Provide traceability and governance so outcome can be verified - Allow natural language queries that are efficient and trustworthy When metadata is the interface, AI finally gets context and trust. (Link in comment) #NuklaiNexus
28
3 Comments -
Andrew Moreland
Chalk • 7K followers
Last quarter, we shipped query tracing. Every resolver and model call is now instrumented and timed. The trace shows you exactly where time is spent in your query execution - which resolvers ran, what they called, how long each piece took. You can enable it per query with --trace in the CLI, or configure sampling to run automatically in production. For high-volume systems, sampling gives you continuous visibility without adding overhead. When you have a tight latency budget, you need to know where every millisecond goes. Read more in our product update in the link below.
34
1 Comment -
Shravan Belagalmath
Vayavya Labs Pvt. Ltd. • 604 followers
Agentic AI role is exponentially increasing in software development lifecycle. Janus is an agentic tool developed by Vayavya Labs that automates part of this lifecycle for code compliance. For more information meet us at nasscom ai
49
4 Comments -
Susanta Ghosh
JPMorganChase • 2K followers
Today let's talk about Parallel Fan Out/ Concurrent Agentic Design pattern and when parallel agents go rouge What’s the Concurrent Orchestration Pattern? Imagine a scenario where multiple AI agents—each with its own lens or specialty—tackle the same task simultaneously. Instead of a single, step-by-step chain, tasks fan out to different agents in parallel. Then their outputs are merged or aggregated for the final answer. It’s the AI equivalent of a brainstorming session where everyone chips in together. This pattern thrives when you need diverse insights or speed—think ensemble reasoning or reaching a verdict faster. This pattern is commonly used in agentic RAG. Here’s how you can apply it in a Retrieval-Augmented Generation (RAG) system: Step 1: Break the user’s query into smaller sub-queries (e.g., “Define concept X,” “List use cases,” “Give examples”) and map out their dependencies. Step 2: Run those sub-queries in parallel—agents fetch context or process each in isolation (e.g., document retrieval, summarization, external tool usage). Step 3: Once all agents have results, aggregate the findings into a single, coherent response. This technique lets you parallelize independent parts, reducing latency while maintaining clarity—especially effective when sub-queries don’t depend on each other. Super Important : Try to avoid this pattern and use sequential execution even if it's slow, but it might yield a good result, below is the reason As Cognition [the company behind devin] warns in a famous blog post “Don’t Build Multi-Agents”, things can go sideways fast if agents don’t share context or coordination is weak. When agents operate in isolation—making decisions based solely on their own view—the final result might be fragmented, contradictory, or just plain incoherent. Think two agents building different puzzle pieces that don’t fit. The core issues: Context fragmentation: Each agent works in a silo, leading to mismatched assumptions. Implicit decisions: Agents’ outputs reflect unspoken choices that may clash when merged. Coordination complexity: Without strong orchestration, integration becomes error-prone. References : 1. Concurrent Execution Pattern : https://lnkd.in/gZybgm2s 2. LLM Compiler whitepaper : https://lnkd.in/gWwGJhbi 3. Coginition blog Don't build multi agents which shows parallel execution can yield agent drift : https://lnkd.in/gkDKC4TP
14
3 Comments -
Prashanth Bhat
IBM • 670 followers
Enjoyed collaborating with Josephine Joyce on this DZone article: https://lnkd.in/gAhNzXd9 . In Performance-Focused Platform Engineering, we explore why platform teams must evolve beyond enablement and start engineering for measurable performance outcomes. Curious to hear how others are thinking about this. #PlatformEngineering #DevOps #SRE #CloudNative #IDP
8
-
Anshuman Jaiswal
OnePint.ai • 9K followers
Proud of this one! Devadas (Das) Pattathil and Sree Sreedhararaj at IPSY put together a really honest look at what it takes to manage #inventory at scale—millions of subscriptions, thousands of SKUs, and multiple channels, all needing a single source of truth. Lots of lessons here for any brand juggling subscriptions, ecommerce, and growth. #InventoryPlanning #RetailOps #SupplyChain #CustomerStory #ClientSuccess #Forecasting #AIForRetail #RetailInnovation
13
-
Pranay Prateek
SigNoz • 29K followers
Big week for SigNoz! We just wrapped up Launch Week 4.0, shipping 5 never-seen-before features built on top of OpenTelemetry. From deep Temporal workflow observability to real-time API monitoring and the industry’s first tracing funnels, these updates make debugging and monitoring dramatically easier. Curious how we’re pushing the boundaries of OTel-native observability? 👇 Full recap in the first comment.
26
2 Comments -
Harshil Vyas
Anyscale • 3K followers
"Every engineer is a Data/ML Engineer... if the Infra is right!" 🚀 Srikanth Sundarrajan's opening slide was a mic-drop moment that had every Platform and Infra engineer in the room (including me!) nodding in agreement. It perfectly captures our own (Anyscale) core mission: to build infrastructure so seamless it empowers everyone to innovate with data, AI & ML. InMobi's Encore'25 provided a fantastic look under the hood of their AI/ML infrastructure. For anyone obsessed with building and scaling machine learning systems, this was gold. Three things that stood out: Cost-Effective Model Training: Deep dive into their strategies for optimizing large-scale training jobs, intelligently leveraging spot instances and right-sizing resources without compromising on model development velocity. Real-Time Feature Stores: A key focus was their approach to building and maintaining a feature store that serves up-to-date data with single-digit millisecond latency, which is critical for their real-time bidding (RTB) models. The Rise of GenAI Infra: It was exciting to hear about their early infrastructure bets for supporting Generative AI and LLMs, particularly for dynamic creative optimization. This signals a major shift in the computational demands for the next generation of AdTech. Kudos to the InMobi engineering teams for their transparency and for building a truly impressive, high-performance ML platform. #AI #ML #Infrastructure #Encore25 #InMobi #MLOps #FeatureStore #GenAI #CloudComputing #AdTech
278
5 Comments -
Priyadarshi Das
Barclays • 2K followers
Why Guardrails Are Critical for Safe AI Adoption Generative AI has moved from a research novelty to an everyday reality. Tools like ChatGPT, LLaMA and Gemini are redefining how we write, code and create. But with great power comes great responsibility and that’s where guardrails step in. LLMs are brilliant, but they’re not flawless. They can hallucinate facts, amplify bias, leak sensitive data or even generate toxic content. Left unchecked, these risks can lead to misinformation, legal troubles or loss of user trust. So, what are guardrails? Think of them as AI safety layers programmable rules and systems that monitor and control what LLMs produce. They act during user interaction, filtering unsafe inputs and outputs in real time. A good guardrail doesn’t just block bad content; it ensures outputs are accurate, ethical and compliant. Here’s how the ecosystem looks today: ✅ Llama Guard (Meta) – Fine-tuned on the LLaMA architecture to classify content into predefined safety categories. Flexible for different use cases. ✅ NVIDIA NeMo Guardrails – Uses structured conversational flows with KNN-based intent matching and moderation to keep conversations on track. ✅ Guardrails AI – Adds type and format constraints (e.g., enforcing JSON structures), plus corrective prompting if outputs fail checks. ✅ TruLens – Focused on evaluation and feedback loops to improve context relevance, groundedness, and fairness in RAG-based apps. ✅ Guidance AI & LMQL – Programming approaches that combine logic and generation, letting developers enforce constraints using regex, control flows and real-time checks. Why does this matter? Because attackers are getting creative. Jailbreak attacks, prompt injections, and adversarial queries can still bypass basic safeguards. Some even exploit language patterns or use multilingual prompts to trick models into producing harmful content. Building strong guardrails isn’t just about adding filters it’s about system design. We need: ✔ Multi-disciplinary strategies combining neural and symbolic reasoning ✔ Continuous monitoring and evaluation ✔ Integration across the AI development lifecycle, like safety-critical systems in aviation or automotive (think ISO 26262) The ultimate goal is trust. Businesses, governments, and users will only adopt AI at scale if they believe it’s reliable, fair and safe. Investing in guardrails is not an afterthought it’s the backbone of responsible AI. What's Next? Guardrails need to evolve beyond simple filters. The future lies ✔ Neural + Symbolic Systems working together for deep reasoning ✔ Lifecycle integration-from design to deployment (think ISO 26262 for Al) ✔ Continuous monitoring to detect new attack patterns and biases #AI #LLM #ResponsibleAI #Guardrails #GenerativeAI #EthicsInAI #AITrust
1
-
Garyad Passmore
Thatdtailguy LLC • 141 followers
🚀 Exciting News Alert: Elevate Your System Performance with APEX_! 🚀Experience a remarkable transformation in mesh filtering by upgrading to the latest APEX_. Whether you’re tackling complex computational challenges, striving for efficiency, or pushing the boundaries of innovation, this upgrade is a game-changer for your tech ecosystem. 🌟 🔧 Say Goodbye to the Hassles: The new APEX_ upgrade provides seamless integration, enhanced functionality, and a user-friendly interface, ensuring a smoother experience than ever before. Wave farewell to the old issues and embrace solutions that redefine excellence in mesh filtering. 💡 Supercharge Innovation: Directly benefit from advanced features that not only optimize performance but foster an environment ripe for innovation. Imagine the possibilities when you can operate without bottlenecks or errors in your workflows. Get ready to unlock new levels of creativity and productivity! 📈 Stay Ahead of the Curve: By upgrading your mesh filtering with APEX_, you are ensuring that your system remains at the forefront of technology. In an ever-evolving digital landscape, staying ahead means continuous improvement and adopting state-of-the-art technology solutions. ✨ Why Upgrade to APEX_? • Leading-edge Performance Boosts 🚀 • Enhanced Precision & Accuracy 🔍 • User-centric Design 🖥️ • Streamlined Processes for Maximum Efficiency 📊 • Robust Support and Comprehensive Resources 🤝 🌐 Join the Community: Organizations around the globe are already experiencing the power of APEX_. Connect with a community that values precision, speed, and dependability. Share your experiences, learn new strategies, and see how others are benefiting from the upgrade. 🔔 Don’t Miss Out: Reserve your spot and transform how you handle mesh filtration using the APEX_ upgrade. Say hello to heightened efficiency and maximized results. Upgrade today and position your work at the cutting edge of technological advancements! 👉 Let’s revolutionize how you approach systems! Learn more and take the next step by visiting our website or contacting our team. Let's build the future together! #UpgradeYourSystem #MeshFilter #APEXUpgrade #Innovation #TechExcellence #Efficiency #PerformanceBoost --- ❇️ Engage with us in the comments! Share your thoughts and experiences. How has upgrading improved your workflow? Let us know, and let's grow this conversation together! #CommunityEngagement #TechDiscussion #NetworkingOpportunity #entrepreneur #autodetailing #thatdtailguy Thanks for watching! What Do You Want to see next? Want more? For the best deals on quality detailing products try ThatDetailShop.com Also, checkout my Amazon storefront: https://lnkd.in/eKqtzFSr Join the community: That Detail Group https://lnkd.in/ev--cXXs All links: linktr.ee/thatdtailguy
1 Comment -
Ashutosh Saxena
TorqueAGI • 10K followers
Physical AI needs more than compute—it requires resilience at scale and adaptability in the field. This audio recap shares lessons from Netflix’s fault-tolerant systems—chaos engineering, self-healing pipelines, and resilient edge-to-cloud design—informing TorqueAGI’s vision for AI that is not just intelligent, but robust, modular, and embodied. Aditya Jami Jonathan Siddharth Honglak Lee Samir Menon Dragomir Anguelov Jennifer Hoskins Jiquan Ngiam Oliver Cameron
26
-
Diptanu Gon Choudhury
Tensorlake • 4K followers
Excited to announce the official Tensorlake x LangChain integration! The Tensoralake tools equips Langraph Agents to use our Document Ingestion API with natural language, without any additional settings in the SDK, works with any LLM - OpenAI, Claude, Gemini, Qwen, etc! GitHub - https://lnkd.in/gzMHvHXM
41
2 Comments -
Rahul Madduluri
Doppel • 2K followers
Why did we build a simulation product at Doppel? And what are we trying to accomplish? From the early days, Kevin Tian and I set out to build the social engineering defense platform. In the last few years, we've built a world-class product that identifies and maps out social engineering attacks. Now, we're flipping the script and generating high-fidelity simulated attacks to prepare organizations for the age of AI. We're no longer just part of your blue team, we're part of your red team too. Read more here: https://lnkd.in/gYWHJUmi
71
4 Comments -
Eryn Muetzel
Tensor9 • 2K followers
We just published a new post on the Tensor9 blog about a problem that comes up in almost every vendor conversation we have: what happens when your application depends on a managed service like DocumentDB, and your customer can't run on AWS. https://lnkd.in/gzamYh2d
6
Explore top content on LinkedIn
Find curated posts and insights for relevant topics all in one place.
View top contentOthers named Arvind Prabhakar in United States
-
Arvind Prabhakar
San Diego, CA -
arvind prabhakar
Austin, Texas Metropolitan Area -
Arvind Shankar Prabhakar
San Francisco Bay Area -
Arvind Prabhakar
Ridgecrest, CA
4 others named Arvind Prabhakar in United States are on LinkedIn
See others named Arvind Prabhakar