Basic Principles

Introduction to Doris

Doris is a high-performance, real-time analytical database based on MPP architecture, known for its extreme speed and ease of use. It can return query results of mass data in sub-seconds and can support high-concurrency point queries and high-throughput complex analysis. All this makes Apache Doris an ideal tool for report analysis, ad-hoc query, unified data warehouse, and data lake query acceleration. On Doris, users can build various applications, such as user behavior analysis, AB test platform, log retrieval analysis, user portrait analysis, and order analysis. For more information, see Apache Doris.

Doris Architecture

The following figure shows the overall architecture of Doris. The frontend (FE) and backend (BE) nodes can be expanded horizontally and infinitely.

Figure 1 Doris architecture
Table 1 Description

Parameter

Description

MySQL Tools

Doris is fully compatible with MySQL syntax and can be accessed by various client tools. It also supports standard SQL statements and can seamlessly connect to BI tools.

FE

Frontend nodes process user access requests, plan query parsing, and manage metadata and nodes.

BE

Backend nodes store data, execute query plans, and balance load among copies.

Leader

Leader is a role elected from Follower nodes.

Follower

Follower nodes receive metadata logs, which must be written successfully in most nodes.

Doris uses the MPP model for inter-node and intra-node parallel execution, making it suitable for distributed joins of large tables.

It also supports vectorized query execution engines, adaptive query execution (AQE) technology, optimization strategies that combine CBO and RBO, and hot data cache queries.

Basic Concepts

In Doris, data is logically described in the form of tables.