Change data capture (CDC)

Database change data capture (CDC) enables changes to a database to be identified, tracked, and sent as updates in real-time. This allows downstream processes and/or systems to act on the change. 

A common use case for CDC is to keep a downstream database in sync with an upstream database, or to keep two databases which are each receiving updates in sync with each other. Debezium is an open-source distributed platform for CDC. 

The data lakehouse is often used for incremental updates delivered by CDC, such as when replicating a transactional database to an analytics database. Some data lakehouse projects perform better for incremental updates than others. 

