vmctl
can migrate historical data from InfluxDB (v1) to VictoriaMetrics. See ./vmctl influx --help
for details and
full list of flags. Also see migrating data from InfluxDB to VictoriaMetrics
article.
To start migration, specify the InfluxDB address --influx-addr
, database --influx-database
and VictoriaMetrics address --vm-addr
:
./vmctl influx --influx-addr=http://<influx-addr>:8086 \
--influx-database=benchmark \
--vm-addr=http://<victoriametrics-addr>:8428
InfluxDB import mode
2020/01/18 20:47:11 Exploring scheme for database "benchmark"
2020/01/18 20:47:11 fetching fields: command: "show field keys"; database: "benchmark"; retention: "autogen"
2020/01/18 20:47:11 found 10 fields
2020/01/18 20:47:11 fetching series: command: "show series "; database: "benchmark"; retention: "autogen"
Found 40000 timeseries to import. Continue? [Y/n] y
40000 / 40000 [----------------------------------------------------------------------------------------] 100.00% 21 p/s
2020/01/18 21:19:00 Import finished!
2020/01/18 21:19:00 VictoriaMetrics importer stats:
idle duration: 13m51.461434876s;
time spent while importing: 17m56.923899847s;
total samples: 345600000;
samples/s: 320914.04;
total bytes: 5.9 GB;
bytes/s: 5.4 MB;
import requests: 40001;
2020/01/18 21:19:00 Total time: 31m48.467044016s
See how to configure –vm-addr .
Data mapping #
vmctl modifies InfluxDB data by using the following rules:
- Field values are mapped to time series values.
- Tags are mapped to labels format as-is.
influx-database
is mapped intodb
label value, unlessdb
tag already exists in the InfluxDB line. To skip this mapping, enable flaginflux-skip-database-label
.- Field names are mapped to time series names prefixed with
{measurement}{separator}
value, where{separator}
equals to_
by default. It can be changed with--influx-measurement-field-separator
cmd-line flag.
For example, the following InfluxDB line:
foo,tag1=value1,tag2=value2 field1=12,field2=40
is converted into the following format in VictoriaMetrics:
foo_field1{tag1="value1", tag2="value2"} 12
foo_field2{tag1="value1", tag2="value2"} 40
See more about data model differences between VictoriaMetrics and InfluxDB.
Filtering #
Additional filtering for exported data from InfluxDB can be applied via --influx-filter-series
flag. For example:
./vmctl influx --influx-database benchmark \
--influx-filter-series "on benchmark from cpu where hostname='host_1703'"
InfluxDB import mode
2020/01/26 14:23:29 Exploring scheme for database "benchmark"
2020/01/26 14:23:29 fetching fields: command: "show field keys"; database: "benchmark"; retention: "autogen"
2020/01/26 14:23:29 found 12 fields
2020/01/26 14:23:29 fetching series: command: "show series on benchmark from cpu where hostname='host_1703'"; database: "benchmark"; retention: "autogen"
Found 10 timeseries to import. Continue? [Y/n]
The timeseries select query would be following:
fetching series: command: "show series on benchmark from cpu where hostname='host_1703'"; database: "benchmark"; retention: "autogen"
To filter by time specify the following flags:
--influx-filter-time-start
--influx-filter-time-end
Here’s an example of importing timeseries for one day only:
./vmctl influx --influx-database benchmark \
--influx-filter-time-start "2020-01-01T10:07:00Z" \
--influx-filter-time-end "2020-01-01T15:07:00Z"
See more about time filtering in InfluxDB .
InfluxDB v2 #
Migrating data from InfluxDB v2.x is not supported yet (#32 ). You may find useful a 3rd party solution for this - https://github.com/jonppe/influx_to_victoriametrics .
Configuration #
In Influx mode, vmctl fetches data from the InfluxDB by executing read queries. The speed of migration is mostly limited
by capabilities of InfluxDB to respond to these queries fast enough. vmctl executes one read request at a time by default.
Increase --influx-concurrency
to execute more read requests concurrently. But make sure to not overwhelm InfluxDB
during migration.
The flag --influx-chunk-size
controls the max amount of datapoints to return in single chunk from fetch requests.
Please see more details here
.
The chunk size is used to control InfluxDB memory usage, so it won’t OOM on processing large timeseries with
billions of datapoints.
See general vmctl migration tips .
See ./vmctl influx --help
for details and full list of flags:
--influx-addr value InfluxDB server addr (default: "http://localhost:8086")
--influx-user value InfluxDB user [$INFLUX_USERNAME]
--influx-password value InfluxDB user password [$INFLUX_PASSWORD]
--influx-database value InfluxDB database
--influx-retention-policy value InfluxDB retention policy (default: "autogen")
--influx-chunk-size value The chunkSize defines max amount of series to be returned in one chunk (default: 10000)
--influx-concurrency value Number of concurrently running fetch queries to InfluxDB (default: 1)
--influx-filter-series value InfluxDB filter expression to select series. E.g. "from cpu where arch='x86' AND hostname='host_2753'".
See for details https://docs.influxdata.com/influxdb/v1.7/query_language/schema_exploration#show-series
--influx-filter-time-start value The time filter to select timeseries with timestamp equal or higher than provided value. E.g. '2020-01-01T20:07:00Z'
--influx-filter-time-end value The time filter to select timeseries with timestamp equal or lower than provided value. E.g. '2020-01-01T20:07:00Z'
--influx-measurement-field-separator value The {separator} symbol used to concatenate {measurement} and {field} names into series name {measurement}{separator}{field}. (default: "_")
--influx-skip-database-label Whether to skip adding the label 'db' to timeseries. (default: false)
--influx-prometheus-mode Whether to restore the original timeseries name previously written from Prometheus to InfluxDB v1 via remote_write. (default: false)
--influx-cert-file value Optional path to client-side TLS certificate file to use when connecting to -influx-addr
--influx-key-file value Optional path to client-side TLS key to use when connecting to -influx-addr
--influx-CA-file value Optional path to TLS CA file to use for verifying connections to -influx-addr. By default, system CA is used
--influx-server-name value Optional TLS server name to use for connections to -influx-addr. By default, the server name from -influx-addr is used
--influx-insecure-skip-verify Whether to skip tls verification when connecting to -influx-addr (default: false)