Apache Doris provides multi-dimensional data ingestion capabilities. In addition to the built-in Routine Load and Flink’s support for reading from Kafka and writing to Doris, the Doris Kafka Connector [1], as an extended component of the Kafka Connect ecosystem, not only supports importing Kafka data into Doris but also relies on the vast Kafka Connect ecosystem to achieve the following features [2]:
Rich Format Support
- Natively parses complex formats such as Avro/Protobuf.
- Automatically registers and converts schemas.
- Optimizes the efficient processing of binary data streams.
Heterogeneous Integration of Multiple Data Sources
- Relational databases: MySQL, Oracle, SQL Server, DB2, Informix, etc.
- NoSQL databases: MongoDB, Cassandra, etc.
- Message queue systems: ActiveMQ, IBM MQ, RabbitMQ, etc.
- Cloud data warehouses: Snowflake, Google BigQuery, Amazon Redshift, etc.