NEW: Scale AI Case Study — ~1,900 data requests per week across 4 business units Read now →

NEW: Scale AI Case Study — ~1,900 data requests per week across 4 business units Read now →

Informatica

Informatica is the legacy enterprise leader in data integration. Founded in 1993, the company built PowerCenter into the dominant ETL platform of the 1990s and 2000s and is now repositioning for the cloud era with IDMC.

Informatica is the legacy elephant of data integration. Founded in 1993 by Gaurav Dhillon and Diaz Nesamoney, Informatica built PowerCenter — the visual, server-based ETL tool that became the dominant way to move data between enterprise systems for two decades. If you worked in a Fortune 500 data team between 1998 and 2015, you almost certainly touched PowerCenter, and you almost certainly had opinions about it.

Plain English: Informatica is the platform that taught a generation of enterprises how to do ETL. It was the Oracle of pipelines: heavyweight, expensive, capable of anything, hated by analysts, loved by procurement departments, and embedded so deeply in corporate IT that ripping it out has historically cost more than keeping it. In 2026, Informatica is in the middle of an existential transition from its on-prem PowerCenter past to its cloud-native IDMC future, and as of May 2025 it is being acquired by Salesforce for roughly $8 billion — the most consequential M&A event in data integration in years.

Origin Story: Visual Pipelines for the Data Warehouse Era

In 1993, the modern data warehouse was a brand-new idea. Bill Inmon had just published his seminal book defining the term. Teradata was selling million-dollar appliances. Companies were beginning to ask the obvious next question: how do we get data from our SAP system, our mainframe, our AS/400, and our Oracle database into one place where we can analyze it?

The answer at the time was hand-written code — COBOL, PL/SQL, shell scripts — and it was a disaster. Every pipeline was a snowflake, undocumented, fragile, and owned by whichever engineer happened to write it. Gaurav Dhillon and Diaz Nesamoney saw the opportunity and founded Informatica in Silicon Valley with a simple thesis: make ETL a visual product, not a coding exercise.

PowerCenter, released in the late 1990s, was the product that delivered on the thesis. You designed mappings in a Windows GUI by dragging icons onto a canvas, connecting source tables to target tables through transformations (filters, joins, aggregators, lookups). The mapping was stored in a metadata repository. A scheduling server ran the jobs. A monitoring console showed you what failed.

This was a revelation in 1999. For the first time, ETL was something you could see and manage without reading code. Banks adopted it. Insurance companies adopted it. Government agencies adopted it. Informatica went public on NASDAQ in 1999, became the standard pick in every Gartner Magic Quadrant, and rode the data warehouse wave through the 2000s. In 2015, the company was taken private by Permira and CPP Investment Board for $5.3B. It went public again in October 2021 on NYSE under the ticker INFA.

Then in May 2025, Salesforce announced it would acquire Informatica for approximately $8B in an all-cash deal. The stated rationale: bolt enterprise data integration, governance, and metadata onto Salesforce's Data Cloud platform, particularly to feed Agentforce and the broader AI strategy. The acquisition closed with surprisingly little fanfare in the data community, partly because most modern data teams had already mentally written Informatica off as a legacy concern.

What's Actually in the Box

Informatica's product line is enormous. The two things that matter:

PowerCenter (legacy on-prem). The classic ETL platform. Visual mapping designer. Metadata repository. Workflow orchestrator. Adapters for everything — SAP, Oracle, DB2, Teradata, mainframes, files, message queues. Still installed in thousands of large enterprises. Still generating substantial maintenance revenue. Still hated by every analyst who has to wait two weeks for an ETL team to add a column.

Intelligent Data Management Cloud (IDMC). Informatica's cloud-native rewrite, branded as a unified platform that includes:

  • Cloud Data Integration (CDI) — the modern equivalent of PowerCenter, running as a managed cloud service.
  • Cloud Data Quality — profiling, standardization, and rule-based cleansing.
  • MDM (Master Data Management) — the canonical "single source of truth for customers/products/etc." product, where Informatica is genuinely a category leader.
  • Cloud Data Governance and Catalog — metadata management, data lineage, business glossary.
  • CLAIRE — Informatica's metadata-driven AI engine, used for automated mapping suggestions, data quality recommendations, and now LLM-style natural language interfaces.
  • API & Application Integration — the iPaaS pieces, competing with MuleSoft and Boomi.

The IDMC pitch is completeness: integration, quality, governance, MDM, cataloging, and APIs in one platform. The pitch is also Informatica's biggest weakness, which we'll get to.

The Honest Take

The conventional wisdom on Informatica in the modern data community is brutal: it is old, expensive, complicated, and being eaten by Fivetran + dbt. Most of this is true, but it is not the whole story.

Where Informatica is genuinely strong:

  • Master Data Management. Informatica MDM is one of two or three credible products in the world for this. If your enterprise needs a centralized customer master across 40 systems with deterministic and probabilistic matching, this is the category leader.
  • Data governance and lineage in regulated industries. Banks, insurers, and healthcare companies that have to prove to regulators where every data point came from buy Informatica because it has been doing this for 25 years.
  • The PowerCenter installed base. Tens of thousands of organizations have PowerCenter mappings in production that nobody fully understands. Informatica's path to keep collecting maintenance revenue on these is essentially monopolistic.
  • CLAIRE-driven AI features. Whatever you think of the marketing, Informatica's metadata-rich foundation gives them a real head start at automated mapping and natural-language data integration — and the Salesforce acquisition is meant to amplify this.

Where Informatica is genuinely weak:

  • Greenfield wins. Almost no startup or scaleup chooses Informatica voluntarily. It is not on the modern data stack consideration set.
  • Pricing. Informatica deals are six or seven figures, negotiated like Oracle contracts, with the same opacity and the same multi-year commitments.
  • Velocity. Adding a new data source in PowerCenter is a project. Adding one in Fivetran is a click. The productivity gap is real.
  • The cultural gap. Informatica is sold to CIOs and bought by IT. The modern data stack is sold to data engineers and bought by analytics leaders. These are different buyers with different value systems, and Informatica has not won the second one.

What the Salesforce Acquisition Means

Salesforce's acquisition of Informatica in 2025 is best understood as a bet on enterprise data fabric for AI. Salesforce already owns MuleSoft (application integration) and Tableau (BI). Adding Informatica gives them ETL/ELT, MDM, governance, catalog, and data quality — effectively a complete enterprise data control plane to feed Data Cloud and Agentforce. Whether Salesforce can integrate Informatica without the usual post-acquisition product stagnation is the multi-billion-dollar question.

For practitioners, the practical implication is that Informatica's roadmap is now governed by Salesforce's priorities. Expect deeper Data Cloud integration, more emphasis on AI/agent use cases, and — almost certainly — a slower pace of innovation in the parts of the product that don't serve those goals.

How TextQL Works with Informatica

TextQL Ana connects to the warehouses and lakes that Informatica loads into, and reads metadata from Informatica's catalog where customers expose it. In organizations with Informatica MDM and Axon Governance, TextQL can use the business glossary and lineage as semantic context for natural language queries — particularly valuable in regulated industries where the meaning of "customer" or "policy" is defined in Informatica and not anywhere else. TextQL is complementary to Informatica, not competitive: Informatica governs and moves the data; TextQL lets business users ask questions of it.

See TextQL in action

See TextQL in action

Informatica
Founded 1993
Founders Gaurav Dhillon, Diaz Nesamoney
HQ Redwood City, CA
Status Acquired by Salesforce (announced May 2025, ~$8B)
Flagship PowerCenter (legacy), IDMC (cloud)
Category ETL / Integration
Monthly mindshare ~300K · legacy enterprise; ~5K customers but huge per-seat counts; declining but entrenched