Replicator

General

Replicator is a go package that replicates data between multiple data sources using change streams. It can replicate data between any sources, including MySQL, MongoDB, Kafka, Elastic and others in the future.

Replicator uses MySQL replication to read a MySQL change stream as a replica. (including AWS RDS)
Mongo includes Change Streams, so this is a cinch. For Kafka, Replicator uses sarama. Kafka doesn't really have a change stream, but we use it as a bus to distribute change events across data centres. PG uses binary logs (WALS) to transfer replication, so that's technically feasible, but not yet implemented.
AWS DynamoDB provides change stream API and official AWS-SDK-go and even example code

Once Replicator receives an event for a record change, such as insert, update, delete, we transform it using kazaam and propagate the change to the registered database endpoints. We support field mapping, field filtering, and transformations. For example, you can change column names or field names during replication.

Metrics on input/output records are exposed using Prometheus.

General Flow

Getting started

Quick Start

For complete step-by-step instructions to run Replicator locally with working examples, see the Local Setup Guide.

The guide includes:

✅ MySQL to Elasticsearch - Real-time binlog replication with search indexing
✅ MongoDB to MongoDB - Change stream replication between MongoDB instances
Docker setup, configuration, and testing procedures
Troubleshooting and monitoring

Installation

go get -u github.com/cohenjo/replicator

Configuration

Generate a configuration file containing input streams, output estuaries, and the transformations you want to perform on the records. You can define multiple input/output paths. Note: transformations are done using kazaam, so features and limitations are those of kazaam.

The schema must exist before you start the replicator. Also, Replicator does not replicate schema change events.

You should have a unique ID named id

Azure Entra Authentication 🔒

Replicator now supports Azure Entra authentication for MongoDB Cosmos DB using workload identity! This provides enterprise-grade security without managing secrets.

Features

✅ Workload Identity: No secrets in configuration
✅ Token Management: Automatic refresh and caching
✅ Scope Validation: Prevents configuration mistakes
✅ Backwards Compatible: Existing connections unchanged

Configuration Example

streams:
  - name: "cosmos-stream"
    source:
      type: "mongodb"
      uri: "mongodb://cosmos-cluster.mongo.cosmos.azure.com:10255/"
      database: "production"
      options:
        auth_method: "entra"                                    # Enable Entra auth
        tenant_id: "12345678-1234-1234-1234-123456789012"     # Azure tenant
        client_id: "87654321-4321-4321-4321-210987654321"     # App registration
        scopes: ["https://cosmos.azure.com/.default"]          # Cosmos DB scope
        refresh_before_expiry: "5m"                            # Token refresh buffer

Azure Setup Requirements

AKS Cluster: With workload identity enabled
App Registration: Azure Entra application with MongoDB permissions
Cosmos DB: Azure Cosmos DB for MongoDB vCore with AAD authentication
Role Assignment: Cosmos DB Data Contributor role for the application

For complete setup instructions, see MongoDB Entra Implementation Guide.

Performance Status

Current implemenatation is rather "local" in design - it reads from the source streams, transforms and writes to endpoints. If the deployment has remote endpoints it might be better to use a replicated kafka topic with snappy or similar algorithem.

Features

Alternatives

gollum - very robust system but lacking DB suport.
debezium - currently more around trditional db systems (MySQL, Oracle, SQL Server, MongoDB and PostgreSQL)

built using

License

reflector is licensed under MIT License. Some of the components used are Licensed under Apache License, Version 2.0 Please review before using in commercial environments.

Name		Name	Last commit message	Last commit date
Latest commit History 46 Commits
.github		.github
.tasks		.tasks
cmd		cmd
conf		conf
docs		docs
examples		examples
memory		memory
pkg		pkg
scripts		scripts
specs		specs
templates		templates
.gitignore		.gitignore
.golangci.yml		.golangci.yml
.goreleaser.yml		.goreleaser.yml
.travis.yml		.travis.yml
CHANGELOG.md		CHANGELOG.md
Dockerfile		Dockerfile
README.md		README.md
Taskfile.yml		Taskfile.yml
docker-compose.yml		docker-compose.yml
go.mod		go.mod
go.sum		go.sum
quickstart.sh		quickstart.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Replicator

General

General Flow

Getting started

Quick Start

Installation

Configuration

Azure Entra Authentication 🔒

Features

Configuration Example

Azure Setup Requirements

Performance Status

Features

Alternatives

built using

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 2

Uh oh!

Languages

cohenjo/replicator

Folders and files

Latest commit

History

Repository files navigation

Replicator

General

General Flow

Getting started

Quick Start

Installation

Configuration

Azure Entra Authentication 🔒

Features

Configuration Example

Azure Setup Requirements

Performance Status

Features

Alternatives

built using

License

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 2

Uh oh!

Languages

Packages