FAQ

Is VictoriaLogs ready for production use? #

Yes. VictoriaLogs is ready for production use starting from v1.0.0 .

What is the difference between VictoriaLogs and Elasticsearch (OpenSearch)? #

Both Elasticsearch and VictoriaLogs allow ingesting structured and unstructured logs and performing fast full-text search over the ingested logs.

Elasticsearch and OpenSearch are designed as general-purpose databases for fast full-text search over large set of documents. They aren’t optimized specifically for logs. This results in the following issues, which are resolved by VictoriaLogs:

High RAM usage
High disk space usage
Non-trivial index setup
Inability to select more than 10K matching log lines in a single query with default configs

VictoriaLogs is optimized specifically for logs. So it provides the following features useful for logs, which are missing in Elasticsearch:

Easy to setup and operate. There is no need in tuning configuration for optimal performance or in creating any indexes for various log types. Just run VictoriaLogs on the most suitable hardware, ingest logs into it via supported data ingestion protocols and get the best available performance out of the box.
Up to 30x less RAM usage than Elasticsearch for the same workload. See the post from a user, who replaced 27-node Elasticsearch cluster with a single-node VictoriaLogs . See also this article for technical details.
Up to 15x less disk space usage than Elasticsearch for the same amounts of stored logs.
Ability to work efficiently with hundreds of terabytes of logs on a single node.
Easy to use query language optimized for typical log analysis tasks - LogsQL .
Fast full-text search over all the log fields out of the box.
Good integration with traditional command-line tools for log analysis. See these docs .

What is the difference between VictoriaLogs and Grafana Loki? #

Both Grafana Loki and VictoriaLogs are designed for log management and processing. Both systems support log stream concept.

VictoriaLogs and Grafana Loki have the following differences:

VictoriaLogs is much easier to setup and operate than Grafana Loki. There is no need in non-trivial tuning - it works great with default configuration.
VictoriaLogs performs typical full-text search queries up to 1000x faster than Grafana Loki.
Grafana Loki doesn’t support log fields with many unique values (aka high cardinality labels) such as user_id, trace_id or ip. It consumes huge amounts of RAM and slows down significantly when logs with high-cardinality fields are ingested into it. See these docs for details.
VictoriaLogs supports high-cardinality log fields out of the box without any additional configuration. It automatically indexes all the ingested log fields, so fast full-text search over any log field works without issues.
Grafana Loki provides very inconvenient query language - LogQL . This query language is hard to use for typical log analysis tasks.
VictoriaLogs provides easy to use query language for typical log analysis tasks - LogsQL .
See how to convert LogQL to LogsQL .
VictoriaLogs usually needs less RAM and storage space than Grafana Loki for the same amounts of logs.

See this article for more details.

What is the difference between VictoriaLogs and ClickHouse? #

ClickHouse is an extremely fast and efficient analytical database. It can be used for logs storage, analysis and processing. VictoriaLogs is designed solely for logs. VictoriaLogs uses similar design ideas as ClickHouse for achieving high performance.

ClickHouse is good for logs if you know the set of log fields and the expected query types beforehand. Then you can create a table with a column per each log field, and use the most optimal settings for the table - sort order, partitioning and indexing - for achieving the maximum possible storage efficiency and query performance.
If the expected log fields or the expected query types aren’t known beforehand, or if they may change over any time, then ClickHouse can still be used, but its’ efficiency may suffer significantly depending on how you design the database schema for log storage.
VictoriaLogs works optimally with any log types out of the box - structured, unstructured and mixed. It works optimally with any sets of log fields , which can change in any way across different log sources.
ClickHouse provides SQL dialect with additional analytical functionality. It allows performing arbitrary complex analytical queries over the stored logs.
VictoriaLogs provides easy to use query language with full-text search specifically optimized for log analysis - LogsQL . LogsQL is usually easier to use than SQL for typical log analysis tasks - see these docs .
VictoriaLogs accepts logs from popular log shippers out of the box - see these docs .
ClickHouse needs an intermediate applications for converting the ingested logs into INSERT SQL statements for the particular database schema. This may increase the complexity of the system and, subsequently, increase its’ maintenance costs.
VictoriaLogs provides built-in Web UI for logs’ exploration.

How does VictoriaLogs work? #

VictoriaLogs accepts logs as JSON entries . Then it stores log fields into distinct data blocks. E.g. values for the same log field across multiple log entries are stored in a single data block. This allows reading data blocks only for the needed fields during querying.

Data blocks are compressed before being saved to persistent storage. This allows saving disk space and improving query performance when it is limited by disk read IO bandwidth.

Smaller data blocks are merged into bigger blocks in background. Data blocks are limited in size. If the size of data block exceeds the limit, then it is split into multiple blocks of smaller sizes.

Every data block is processed in an atomic manner during querying. For example, if the data block contains at least a single value, which needs to be processed, then the whole data block is unpacked and read at once. Data blocks are processed in parallel on all the available CPU cores during querying. This allows scaling query performance with the number of available CPU cores.

This architecture is inspired by ClickHouse architecture .

On top of this, VictoriaLogs employs additional optimizations for achieving high query performance:

It uses bloom filters for skipping blocks without the given word or phrase .
It uses custom encoding and compression for fields with different data types. For example, it encodes IP addresses into 4 bytes. Custom fields’ encoding reduces data size on disk and improves query performance.
It physically groups logs for the same log stream close to each other in the storage. This improves compression ratio, which helps reducing disk space usage. This also improves query performance by skipping blocks for unneeded streams when stream filter is used.
It maintains sparse index for log timestamps , which allow improving query performance when time filter is used.

How to export logs from VictoriaLogs? #

Just send the query with the needed filters to /select/logsql/query - VictoriaLogs will return the requested logs as a stream of JSON lines . It is recommended specifying time filter for limiting the amounts of exported logs.

I want to ingest logs without message field, is that possible? #

VictoriaLogs accepts logs without _msg field . In this case the _msg field is set to the default value, which can be configured via -defaultMsgValue command-line flag.

What if my logs have multiple message fields candidates? #

If you ingest logs into VictoriaLogs without _msg field , then this field is filled according to the _msg_field HTTP query arg and/or VL-Msg-Field HTTP header. See these docs for details. If the _msg_field HTTP query arg and/or VL-Msg-Field HTTP header contains a list of comma-separated field names, then the first non-empty field from this list is used as _msg field.

For example, if the following log entry is ingested into VictoriaLogs with _msg_field=message,body:

      {
  "message": "foo bar in message",
  "body": "foo bar in body"
}
    

Then _msg field is set to foo bar in message.

If the following log entry is ingested into VictoriaLogs with _msg_field=message,body:

      {
  "body": "foo bar in body"
}
    

Then _msg field is set to foo bar in body.

What length a log record is expected to have? #

VictoriaLogs works optimally with log records of up to 10KB. It works OK with log records of up to 100KB. It works not so optimal with log records exceeding 100KB.

The max size of a log record VictoriaLogs can accept during data ingestion is 2MB, because log records are stored in blocks of up to 2MB size. Blocks of this size fit the L2 cache of a typical CPU, which gives an optimal performance during data ingestion and querying.

Note that log records with sizes close to 2MB aren’t handled efficiently by VictoriaLogs because per-block overhead translates to a single log record, and this overhead is big.

The 2MB limit is hardcoded and is unlikely to increase.

The limit can be set to the lower value during data ingestion via -insert.maxLineSizeBytes command-line flag.

What is the maximum supported field name length #

VictoriaLogs limits log field name length to 128 bytes - Log entries with longer field names are ignored during data ingestion .

The maximum length of a field name is hardcoded and is unlikely to increase, since this may increase RAM and CPU usage.

How many fields a single log entry may contain #

A single log entry may contain up to 2000 fields. This fits well the majority of use cases for structured logs and for wide events .

The maximum number of fields per log entry is hardcoded and is unlikely to increase, since this may increase RAM and CPU usage.

The limit can be set to the lower value during data ingestion via -insert.maxFieldsPerLine command-line flag.

The most frequent source of too big number of unique log fields is JSON logs with many unique keys. For example:

      {
  "level":"info",
  "_msg":"foo bar",
  "items":{
    "item-1":{...},
    ...
    "item-N":{...}
  }
}
    

This JSON contains many unique keys - item-*. They are flattened into the following keys according to VictoriaLogs data model :

“items.item-1”
…
“items.item-N”

Where N can be arbitrary large. Do not ingest such logs into VictoriaLogs.

It is possible to instruct VictoriaLogs preserving the items field value without flattening it, by passing preserve_json_keys=items query arg to HTTP data ingestion endpoints, which accept JSON-encoded logs. This will result into a single items field with the following string value:

      {
  "item-1":{...},
  ...
  "item-N":{...}
}
    

See these docs for details.

How to determine which log fields occupy the most of disk space? #

Run the following LogsQL query based on block_stats pipe :

      _time:1d
  | block_stats
  | stats by (field)
      sum(values_bytes) as values_bytes,
      sum(bloom_bytes) as bloom_bytes,
      sum(rows) as rows
  | math
      (values_bytes+bloom_bytes) as total_bytes,
      round(total_bytes / rows, 0.01) as bytes_per_row
  | first 10 (total_bytes desc)
    

This query returns top 10 log fields , which occupy the most of disk space across the logs ingested during the last day. The occupied disk space is returned in the total_bytes field.

If you use VictoriaLogs web UI or Grafana plugin for VictoriaLogs , then make sure the selected time range covers the last day. Otherwise, the query above returns results on the intersection of the last day and the selected time range.

How to determine which log streams occupy the most of disk space? #

Run the following LogsQL query based on block_stats pipe :

      _time:1d
  | block_stats
  | stats by (_stream)
      sum(values_bytes) as values_bytes,
      sum(bloom_bytes) as bloom_bytes
  | math
      (values_bytes+bloom_bytes) as total_bytes
  | first 10 (total_bytes desc)
    

This query returns top 10 log streams , which occupy the most of disk space across the logs ingested during the last day. The occupied disk space is returned in the total_bytes field.

Why the log field occupies a lot of disk space? #

See how to determine which log fields occupy the most of disk space . Log field may occupy a lot of disk space if it contains values with many unique parts (aka “random” values). Such values do not compress well, so they occupy a lot of disk space. If you want reducing the amounts of occupied disk space, then either remove the given log field from the ingested logs or remove the unique parts from the log field before ingesting it into VictoriaLogs.

How to detect the most frequently seen logs? #

Use collapse_nums pipe . For example, the following LogsQL query returns top 10 the most frequently seen log messages over the last hour:

      _time:1h | collapse_nums prettify | top 10 (_msg)

Add _stream field to the top (...) list in order to get top 10 the most frequently seen logs with the _stream field:

      _time:1h | collapse_nums prettify | top 10 (_stream, _msg)

How to get field names seen in the selected logs? #

Use field_names pipe . For example, the following LogsQL query returns all the field names seen across all the logs during the last hour:

      _time:1h | field_names | sort by (name)

The hits field in the returned results contains an estimated number of logs with the given log field.

How to get unique field values seen in the selected logs? #

Use field_values pipe . For example, the following LogsQL query returns all the values for the level field across all the logs seen during the last hour:

      _time:1h | field_values level

The hits field in the returned results contains an estimated number of logs with the given value for the level field.

How to get the number of unique log streams on the given time range? #

Use count_uniq(...) stats function over _stream field. For example, the following LogsQL query returns the number of unique log streams across all the logs over the last day:

      _time:1d | count_uniq(_stream)

Does LogsQL support subqueries? #

Yes. See these docs . For example, the following query returns the total number of unique values for the user_id field across top 3 log streams with the biggest number of logs during the last hour:

      _time:1h _stream_id:in(_time:1h | top 3 (_stream_id) | keep _stream_id) | count_uniq(user_id)

The query works in the following way:

It selects top 3 log streams with the biggest number of logs during the last hour with the following subquery:
```
      _time:1h | top 3 (_stream_id) | keep _stream_id
    
```
This subquery uses top and keep pipes.
Then it selects all the logs across the selected log streams over the last hour with the help of _stream_id:... filter .

How to estimate the needed compute resources for the given workload? #

The needed storage space depends on the following factors:

Data compressibility. VictoriaLogs compresses the ingested logs before storing them to disk. The compression ratio depends on the “randomness” of the ingested logs. Less “random” logs with many repeated field values and small differences between log messages compress the best (up to 100x and more). More “random” logs with many unique field values may have very low compression rate.
Data retention . For example, a year-long retention needs 52x more storage space than a week-long retention.

The needed RAM, CPU, storage IO and network bandwidth depends on the type and the rate of queries over the ingested logs.

“Lightweight” queries over the recently ingested logs with very narrow log stream filters require very low compute resources, even if they are executed at 1000 rps.
“Heavy” queries over the long time range, which do not contain log stream filters or have some heavy pipe processing such as analytics’ calculations or sorting over billions of rows may require hundreds of CPU cores and terabytes of RAM for fast execution. It is OK to execute such queries on machines with a few CPU cores and a few GiB of RAM - these queries will take more time to execute.

The best approach to estimate the needed compute resources for the given workload is to start a VictoriaLogs, to ingest a share (1%-10%) of your production logs into it, and to execute typical queries on it, while measuring the consumed compute resources. Then you can extrapolate the needed compute resources for the full production workload in your case.

Previous Security and Load Balancing Next Integrations

FAQ #