CHANGELOG #

Please find the changelog for VictoriaMetrics Anomaly Detection below.

v1.24.1 #

Released: 2025-06-20

BUGFIX: Resolved the issue first seen in v1.23.0 where some fit and infer jobs were silently skipped at task submission time (due to a bug in the new background scheduler behind PeriodicScheduler ) followed by similar warnings in the logs later on, such as:
```
      2025-06-19 14:32:50,568 - apscheduler.executors.default - WARNING - Run time of job "{job_name}" (trigger: interval[1 day, 0:00:00], next run at: 2025-06-20 14:32:50 UTC)" was missed by 0:00:01.024753
    
```
BUGFIX: Resolved the issue where vmanomaly service on PeriodicScheduler where start_from argument was set and state restoration was enabled, didn’t resume infer jobs after respective fitted models were restored from the previous run. This could lead to a situation where the service, if restore happened in-between fit calls, would not produce any anomaly scores and stay idle until the next fit_every happens, which is expected in stateless mode, but not in stateful mode with restore_state enabled.

v1.24.0 #

Released: 2025-06-18

FEATURE: Introduced stateful vmanomaly service with job persistence and state restoration capabilities. Added a new restore_state setting that enables the service to persist and restore its state between runs, including anomaly detection model instances and training data. This prevents unnecessary model refitting when restarting the service, significantly reducing startup time and computational overhead.
IMPROVEMENT: More informative log messages for fit and infer stages and for sub-optimal configurations used in the sharded mode .
BUGFIX: Now system interrupt signals are properly handled and lead to expected graceful shutdown if for some reason new background scheduler, introduced in v1.23.0 was already stopped in the middle of the fit or infer call. Previously, this could lead to a service crash with an unhandled exception.

v1.23.3 #

Released: 2025-06-13

IMPROVEMENT: Added backward-compatible single-dashed form support for vmanomaly’s command-line arguments to enhance compatibility with VictoriaMetrics ecosystem and ease devOps experience. For example, -license.forceOffline can now be used in addition to --license.forceOffline - for the users who prefer the single-dash format or are accustomed to it from other VictoriaMetrics tools.

v1.23.2 #

Released: 2025-06-09

IMPROVEMENT: Increased convergence speed for OnlineZScoreModel , ZScoreModel , MADModel , and OnlineMADModel models. Now it works better for tight optimization budgets (n_trials < 10, timeout < 1s)
BUGFIX: Now mean and variance of OnlineZScoreModel with exponential decay < 1 arg are properly calculated for unbiased predictions.

v1.23.1 #

Released: 2025-06-08

BUGFIX: In sharding mode the corner case when shard number (VMANOMALY_MEMBER_NUM) is greater than the number of configured shards (VMANOMALY_MEMBERS_COUNT) is now properly handled.
BUGFIX: In sharding mode , the corner case when the number of produced sub-configurations is less than the number of configured shards (VMANOMALY_MEMBERS_COUNT) is now properly handled. Until config hot-reload is supported, such “idle” shards will be turned off with exit code 1 and respective critical message logged.

v1.23.0 #

Released: 2025-06-05

Note

There is a known bug that can cause some fit and infer jobs to be silently skipped at task submission time (due to a bug in the new background scheduler behind PeriodicScheduler ) followed by similar warnings in the logs later on, such as:

      2025-06-19 14:32:50,568 - apscheduler.executors.default - WARNING - Run time of job "{job_name}" (trigger: interval[1 day, 0:00:00], next run at: 2025-06-20 14:32:50 UTC)" was missed by 0:00:01.024753

Releases affected: v1.23.0 - v1.23.3 . The issue has been resolved in patch v1.24.1 , upgrade is recommended.

FEATURE: Added decay argument to online models . This parameters allows for newer data to be weighted more heavily in online models. By default this is set to 1 which means all data points are weighted the same to maintain backward compatibility with existing configs. The closer this value is to 0 the more important new data is.
IMPROVEMENT: Restored back parallelization in the read/fit/infer pipeline, previously disabled in v1.22.0 due to deadlock issues. The new implementation prevents deadlocks, allowing to control the parallelization level via n_workers in settings section . It’s suggested to upgrade from v1.22.0 - v1.22.1 to this version to regain the performance benefits of parallel processing.
IMPROVEMENT: Added --dryRun argument to vmanomaly to enable dry run mode. This mode allows to validate configuration without executing any actual operations and doesn’t require a license. It is particularly useful to test the configurations before deploying them in a production environment.
IMPROVEMENT: Enhanced task scheduling to reduce locks between anomaly detection models’ fit and inference calls, improving their concurrent performance.
IMPROVEMENT: min_dev_from_expected model common argument is now bi-directional, allowing you to set different thresholds for peaks and drops.
BUGFIX: Now clip_predictions model common arg is properly used with online models , ensuring that the predictions are clipped to the respective query’s data_range values even if the model saw less datapoints than required min_n_samples_seen_ to produce anomaly scores (e.g., when a new model instance was created during infer call for new timeseries not seen at training time).

v1.22.1 #

Released: 2025-05-11

FEATURE: Introduced a simplified backtesting mode for the BacktestingScheduler that treats your entire [from, to] (or [from_iso, to_iso]) range as an inference window and automatically generates the corresponding fit windows based on your fit_window setting. To enable it, set the inference_only: true flag in your BacktestingScheduler configuration.
BUGFIX: Resolved a crash when running the BacktestingScheduler with n_jobs greater than 1.
BUGFIX: Corrected the start_from logic in the PeriodicScheduler so that the first job now fires exactly at start_from (instead of occasionally adding fit_every to that time).

v1.22.0-experimental #

Released: 2025-04-11

(Experimental Patch Release)

Note

Important Notice - this patch disables parallelization to resolve rate but critical deadlock issue that completely halted the fit/infer pipeline (resulting in no anomaly scores, no model refits, and no log output) on multicore systems. Although this change improves resource usage by reducing peak-to-average RAM consumption, it incurs a 2–4x slowdown in fit/infer routines. We recommend upgrading only if your current deployments are experiencing deadlock-related outages. Please upgrade to v1.23.0 or newer for restored parallelization.

BUGFIX: Resolved an intermittent deadlock in the fit/infer process that previously caused the service to freeze indefinitely, thereby preventing anomaly score production and model refits on multicore systems.
BUGFIX: Fixed incorrect propagation of the scale model common argument from the old format (single float) to the new format (list of 2 floats).
IMPROVEMENT: Reduced the peak-to-average RAM usage for fit/infer calls from 2–2.5x to 1.1–1.3x, significantly lowering the risk of out-of-memory errors at startup.

v1.21.0 #

Released: 2025-03-19

FEATURE: Introduced horizontal scalability and high availability in vmanomaly service. Dedicated page can be found here .

v1.20.1 #

Released: 2025-03-16

BUGFIX: Resolved an issue in PeriodicScheduler introduced in v1.18.7 when inference is incorrectly skipped due to an outdated fit validation. The check mistakenly caused inference call skips in configurations where fit_every > fit_window. Affected releases: v1.18.7 - v1.20.0 . For fit_every > fit_window configurations we recommend upgrading to this patch release.

v1.20.0 #

Released: 2025-03-03

Note

This release contains a bug introduced in v1.18.7 - PeriodicScheduler where configurations with fit_every > fit_window could cause inference to be skipped for |fit_every - fit_window| time, until the next fit_every call happens. For fit_every > fit_window configurations we recommend upgrading to v1.20.1 , which resolves this issue.

FEATURE: The scale argument is now a common argument , previously supported only by ProphetModel and OnlineQuantileModel . Additionally, scale is now two-sided, represented as [scale_lb, scale_ub]. The previous format (scale: x) remains supported and will be automatically converted to scale: [x, x].
FEATURE: Introduced a post-processing step to clip yhat, yhat_lower, and yhat_upper to the configured data_range values in VmReader, if defined. This feature is disabled by default for backward compatibility. It can be enabled for models that generate predictions and estimates, such as ProphetModel , by setting the common argument clip_predictions to True.
IMPROVEMENT: Introduced the anomaly_score_outside_data_range parameter to allow overriding the default anomaly score (1.01) assigned when input values (y) fall outside the defined data_range (data domain violation). It improves flexibility for alerting rules and enables clearer visual distinction between different anomaly scenarios. Override can be configured at the service level (settings) or per model instance (models.model_xxx), with model-level values taking priority. If not explicitly set, the default anomaly score remains 1.01 for backward compatibility.

v1.19.2 #

Released: 2025-01-27

Note

IMPROVEMENT: Added the complete option to the --splitBy argument in config_splitter.py util . This allows splitting a parent configuration into the smallest possible sub-configurations, each containing exactly one scheduler, one model, and either one or multiple queries (depending on whether the model is multivariate or not).
BUGFIX: Resolved an issue where duplicate log messages were generated during sub-config validation of the parent configuration.
BUGFIX: Corrected usage of AccountID and ProjectID extracted from tenant_id, which are appended as labels vm_account_id and vm_project_id, respectively (previously swapped) by VmReader when using the per-query tenant_id feature. This issue affected versions v1.19.0 and v1.19.1 .
BUGFIX: Resolved an issue with the VmReader instance string representation that caused errors when vmanomaly was run with --loggerLevel DEBUG.

v1.19.1 #

Released: 2025-01-21

Note

There is a known bug in v1.19.0 - the AccountID and ProjectID are swapped when they are extracted from the tenant_id argument in VMReader. This can cause correctly read results being written to the wrong tenant when using the per-query tenant_id feature with AccountID != ProjectID. Please update to patch v1.19.2 , which resolves this issue.

Note

BUGFIX: Resolved writer warnings for configurations where reader.tenant_id equals writer.tenant_id and is not multitenant, as this is a valid setup. Enhanced tenant_id-related log messages across config validation, reader, and writer for improved clarity.

v1.19.0 #

Released: 2025-01-20

Note

FEATURE: Added support for per-query tenant_id in the VmReader . This allows overriding the reader-level tenant_id within a single global vmanomaly configuration on a per-query basis, enabling isolation of data for different tenants in separate queries when querying the VictoriaMetrics cluster version . For details, see the documentation .
IMPROVEMEMT: Speedup the model infer stage on multicore systems.
IMPROVEMEMT: Speedup the model fitting stage by 1.25–3x, depending on configuration complexity.
IMPROVEMENT: Reduced service RAM usage by 5-10%, depending on configuration complexity.
BUGFIX: Now VmReader properly handles the cases where the number of queries processed in parallel (up to reader.queries cardinality) exceeds the default limit of 10 HTTP(S) connections, preventing potential data loss from discarded queries. The pool limit will automatically adjust to match reader.queries cardinality.
BUGFIX: Corrected the construction of write endpoints for cluster VictoriaMetrics urls (tenant_id arg is set) in monitoring.push section configurations .

v1.18.8 #

Released: 2024-12-03

Note

IMPROVEMENT: Added a scale parameter to ProphetModel . It allows for proportional scaling of the confidence intervals generated by interval_width. If set > 1, it may help reducing false positives in scenarios where the data contains many sharp but expected seasonal peaks that may not be well captured by Prophet’s seasonal Fourier terms .
BUGFIX: Corrected an issue in ProphetModel when using tz-aware mode with tz_seasonalities including dow (day of the week). Previously, Sundays were incorrectly handled due to a mismatch between the weekday indices. This caused Sundays to lack weekly seasonality features, defaulting to just averaged trends.

v1.18.7 #

Released: 2024-12-02

Note

This release introduced a bug in PeriodicScheduler where configurations with fit_every > fit_window could cause inference to be skipped for |fit_every - fit_window| time, until the next fit_every call happens. For fit_every > fit_window configurations we recommend upgrading to v1.20.1 , which resolves this issue.

IMPROVEMENT: Introduced a new push_frequency parameter for the monitoring.push component , with a default value of 15m. This enhancement ensures better alignment with pull-based monitoring behavior and improves self-monitoring experience of vmanomaly in setups with infrequent schedules (e.g., rare fit_every or infer_every intervals) to deal with data staleness.
BUGFIX: Fixed a bug, introduced in v1.18.5 , that prevented the monitoring.push component from properly instantiating and pushing self-monitoring metrics .

v1.18.6 #

Released: 2024-12-01

Note

Release v1.18.5 contained an issue that prevented the monitoring.push component from properly instantiating and pushing self-monitoring metrics . This issue has been resolved in patch v1.18.7 , please update to apply the fix.

BUGFIX: Assure proper validation of BacktestingScheduler arguments, if specified in ISO-8601 format, preventing service crashes due to validation errors.

v1.18.5 #

Released: 2024-11-27

Note

This release contained an issue that prevented the monitoring.push component from properly instantiating and pushing self-monitoring metrics . This issue has been resolved in patch v1.18.7 , please update to apply the fix.

IMPROVEMENT: Introduced the ability to run vmanomaly using a configuration directory. This enhancement allows users to recursively merge multiple full configuration files (previously limited to merging specific sections, such as reader) and execute a single instance of the service with the combined configuration.
IMPROVEMENT: Added a new utility, config_splitter.py, to streamline the process of splitting a single configuration file into multiple standalone configurations. The configurations are split by specified entities like schedulers, models, queries or extra_filters. The split configurations can be saved to a designated directory. It simplifies scaling vmanomaly and enhances user experience by automating the process of separating config files so they can be run on separate instances of vmanomaly. For more details, refer to this section .
IMPROVEMENT: Introduced the ability to configure the PeriodicScheduler to start at a specific time using the start_from and tz parameters. The start_from parameter accepts either HH:MM or ISO 8601 formats , with tz defaulting to UTC. If start_from is in the past, the next valid start time is automatically calculated based on the fit_every interval.

v1.18.4 #

Released: 2024-11-18

IMPROVEMENT: Introduced self-monitoring guide for vmanomaly. Added metrics for total RAM vmanomaly_available_memory_bytes and the number of logical CPU cores vmanomaly_cpu_cores_available to the self-monitoring metrics .

v1.18.3 #

Released: 2024-11-14

BUGFIX: This patch release resolves an issue that could cause a service crash when parallelizing data processing with VmReader. Affected releases: v1.18.1 - v1.18.2 .

v1.18.2 #

Released: 2024-11-13

Note

In release v1.18.1 , an issue was identified that could lead to a service crash during parallelized data processing with VmReader . Please update to patch v1.18.3 , which resolves this issue.

IMPROVEMENT: Enhanced the flexibility of the ProphetModel for tz-aware data (tz_aware = True). The tz_seasonalities argument has been reformatted to align with the structure of the existing seasonalities argument. For more details, refer to the model section here . Additionally, tz-aware support for ProphetModel has been added to AutoTuned model wrapper. This feature is automatically enabled if the data is timezone-aware and its timezone is not set to the default (‘UTC’), otherwise default timezone-free optimization flow will be used.

v1.18.1 #

Released: 2024-11-12

Note

In release v1.18.1 , an issue was identified that could lead to a service crash during parallelized data processing with VmReader . Please update to patch v1.18.3 , which resolves this issue.

IMPROVEMENT: Added a reader-level data_range argument, allowing users to define a default valid data range for all input queries in queries. Individual queries can still override this default with their own data_range if needed.
IMPROVEMENT: Added the url label to enhance labelset consistency across self-monitoring metrics in both reader and writer components. Metrics affected:
- vmanomaly_reader_received_bytes
- vmanomaly_reader_response_parsing_seconds
- vmanomaly_reader_timeseries_received
- vmanomaly_reader_datapoints_received
- vmanomaly_writer_request_serialize_seconds
- vmanomaly_writer_datapoints_sent
- vmanomaly_writer_timeseries_sent
BUGFIX: Resolved an issue where rolling models incorrectly set their last seen infer timestamp during first fit_infer call, resulting in output being produced for every datapoint within the fit_window on its first invocation.
BUGFIX: Resolved an issue in multi-scheduler configurations where self-monitoring metric values were overwriting each other.
BUGFIX: Resolved an issue causing incorrect query_key label values in the vmanomaly_model_datapoints_produced self-monitoring metric for univariate models .
BUGFIX: Resolved an issue that caused the vmanomaly_model_runs self-monitoring metric to miss increments for rolling models .
BUGFIX: Aligned the calculations of vmanomaly_model_datapoints_accepted and vmanomaly_model_datapoints_produced self-monitoring model metrics across all stages (fit, infer, and fit_infer) for consistency.

v1.18.0 #

Released: 2024-10-28

FEATURE: Introduced timezone-aware support in VmReader for accurate seasonality modeling, especially during DST shifts. A new tz argument enables timezone offset management at both global and query-specific levels .
- Enhanced ProphetModel with a tz_aware argument (combined with tz_seasonalities and tz_use_cyclical_encoding) for timezone-aware timestamps. This addresses a limitation in Prophet’s native design that doesn’t allow timezone-aware and DST-aware seasonality.
IMPROVEMENT: Enhanced error handling in VmReader to provide clearer diagnostics and broader coverage.
BUGFIX: Updated vmanomaly_version_info and vmanomaly_ui_version_info gauges to correctly set the version label value based on image tags.
BUGFIX: The n_samples_seen_ attribute now properly resets to 0 with each new fit call in online model classes (OnlineMADModel and OnlineQuantileModel ), ensuring accurate tracking of processed sample count.

v1.17.2 #

Released: 2024-10-22

IMPROVEMENT: Added vmanomaly_version_info (service) and vmanomaly_ui_version_info (vmui) gauges to self-monitoring metrics.
IMPROVEMENT: Added instance and job labels to pushed metrics so they have the same labels as vmanomaly metrics that are pulled /scraped. Metric labels can be customized via the extra_labels argument . By default job label will be vmanomaly and the instance label will be f'{hostname}:{vmanomaly_port}. See monitoring.push for examples and details.
IMPROVEMENT: Added a subsection to monitoring page with detailed per-component service logs, including reader and writer logs, error handling, metrics updates, and multi-tenancy warnings.
IMPROVEMENT: Added a new Command-line arguments subsection to the Quickstart guide , providing details on available options for configuring vmanomaly.

v1.17.1 #

Released: 2024-10-18

BUGFIX: Prophet models no longer fail to train on constant data, data consisting of the same value and no variation across time. The bug prevented the fit stage from completing successfully, resulting in the model instance not being stored in the model registry, after automated model cleanup was added in v1.17.0 .

v1.17.0 #

Released: 2024-10-17

FEATURE: Added max_points_per_query (global and query-specific ) VmReader arg to control query chunking. This overrides how search.maxPointsPerTimeseries flag (introduced in v1.14.1 ) is used in vmanomaly for splitting long fit_window queries into smaller sub-intervals. This helps users avoid hitting the search.maxQueryDuration limit for individual queries by distributing initial query across multiple subquery requests with minimal overhead.
IMPROVEMENT: Enhanced the self-monitoring metrics for consistency across the components. Key changes include:
- Converted several self-monitoring metrics from Summary to Histogram to enable quantile calculation. This addresses the limitation of the prometheus_client’s Summary implementation, which does not support quantiles. The change ensures metrics are more informative for performance analysis. Affected metrics are:
  - vmanomaly_reader_request_duration_seconds (VmReader )
  - vmanomaly_reader_response_parsing_seconds (VmReader )
  - vmanomaly_writer_request_duration_seconds (VmWriter )
  - vmanomaly_writer_request_serialize_seconds (VmWriter )
- Added a query_key label to the vmanomaly_reader_response_parsing_seconds metric to provide finer granularity in tracking the performance of individual queries. This metric has also been switched from Summary to Histogram to align with the other metrics and support quantile calculations.
- Added preset and scheduler_alias keys to VmReader and VmWriter metrics for consistency in multi-scheduler setups.
- Renamed Counters vmanomaly_reader_response_count to vmanomaly_reader_responses and vmanomaly_writer_response_count to vmanomaly_writer_responses.
- Updated docs for better clarity.
IMPROVEMENT: Accelerated performance of model fitting stages on multicore systems.
IMPROVEMENT: Optimized query handling in multi-scheduler setups by filtering queries for each scheduler based on model requirements. This reduces unnecessary data fetching from VictoriaMetrics, ensuring only relevant queries are processed by the VmReader , leading to better performance and efficiency of configs with multiple active schedulers.
IMPROVEMENT: Implemented automatic cleanup of files in subdirectories within /tmp (generated by the Stan backend when utilizing Prophet models) after each fit operation. This prevents the accumulation of unused data over time in /tmp, addressing a potential issue where these files would only be deleted upon termination of the current Python session or service, leading to uncontrolled disk growth.
BUGFIX: Re-enable the vmanomaly_reader_response_count (now called vmanomaly_reader_responses) self-monitoring metric for the VmReader , which was unintentionally disabled in previous releases and now updates correctly as intended.

v1.16.3 #

Released: 2024-10-08

IMPROVEMENT: Added tls_cert_file and tls_key_file arguments to support mTLS (mutual TLS) in vmanomaly components. This enhancement applies to the following components: VmReader , VmWriter , and Monitoring/Push . You can also use these arguments in conjunction with verify_tls when it is set as a path to a custom CA certificate file.

v1.16.2 #

Released: 2024-10-06

FEATURE: Added support for multitenant value in tenant_id arg to enable querying across multiple tenants in VictoriaMetrics cluster (option available from v1.104.0 ):
- Applied when reading input data from vmselect via the VmReader .
- Applied when writing generated results through vminsert via the VmWriter .
- For more details, refer to the tenant_id arg description in the documentation of the mentioned components.
BUGFIX: Resolved an issue with handling an empty preset value (e.g., preset: "") that was preventing the default helm chart from being deployed.

v1.16.1 #

Released: 2024-10-02

BUGFIX: This patch release prevents the service from crashing by rolling back the version of a third-party dependency. Affected releases: v1.16.0 .

v1.16.0 #

Released: 2024-10-01

Note

A bug was discovered in this release that causes the service to crash. Please use the patch v1.16.1 to resolve this issue.

FEATURE: Introduced data dumps to a host filesystem for VmReader . Resource-intensive setups (multiple queries returning many metrics, bigger fit_window arg) will have RAM consumption reduced during fit calls.
IMPROVEMENT: Added a groupby argument for logical grouping in multivariate models . When specified, a separate multivariate model is trained for each unique combination of label values in the groupby columns. For example, to perform multivariate anomaly detection on metrics at the machine level without cross-entity interference, you can use groupby: [host] or groupby: [instance], ensuring one model per entity being trained (e.g., per host). Please find more details here .
IMPROVEMENT: Improved performance of VmReader on multicore instances for reading and data processing.
IMPROVEMENT: Introduced new CLI argument aliases to enhance compatibility with Helm charts (i.e. using secrets) and better align with VictoriaMetrics flags :
- --licenseFile as an alias for --license-file
- --license.forceOffline as an alias for --license-verify-offline
- --loggerLevel as an alias for --log-level
- The previous argument format is retained for backward compatibility.
BUGFIX: The provide_series common argument now correctly filters the written time series in the IsolationForestMultivariate model.

v1.15.9 #

Released: 2024-08-27

IMPROVEMENT: Added support for bearer token authentication in push mode within the self-monitoring configuration section .

v1.15.8 #

Released: 2024-08-27

BUGFIX: Made minor adjustments to how the reader and writer handle bearer tokens across different modes.

v1.15.7 #

Released: 2024-08-27

BUGFIX: Made minor adjustments to how the reader and writer handle bearer tokens across different modes.

v1.15.6 #

Released: 2024-08-26

IMPROVEMENT: Introduced the bearer_token_file argument to the reader and writer components to enhance secret management.

v1.15.5 #

Released: 2024-08-19

BUGFIX: following v1.15.2 online model enhancement, now data_range parameter is correctly initialized for online models, created (for new time series returned by particular query) during infer calls.

v1.15.4 #

Released: 2024-08-15

IMPROVEMENT: better config handling of writer and monitoring sections if using vmanomaly with helm charts .

v1.15.3 #

Released: 2024-08-14

IMPROVEMENT: better config handling of reader section if using vmanomaly with helm charts .

v1.15.2 #

Released: 2024-08-13

IMPROVEMENT: Enhanced online models (e.g., OnlineQuantileModel ) to automatically create model instances for unseen time series during infer calls, eliminating the need to wait for the next fit call. This ensures no inferences are skipped when using online models.
BUGFIX: Corrected an issue with the OnlineMADModel to ensure proper functionality when used in combination with on-disk model dump mode .
BUGFIX: Addressed numerical instability in the OnlineQuantileModel when use_transform is set to True.
BUGFIX: Resolved a logging issue that could cause a RuntimeError: reentrant call inside <_io.BufferedWriter name='<stderr>'> when a termination event was received.

v1.15.1 #

Released: 2024-08-10

FEATURE: Introduced backward-compatible data_range query-specific parameter to the VmReader . It enables the definition of valid data ranges for input per individual query in queries, resulting in:
- High anomaly scores (>1) when the data falls outside the expected range, indicating a data constraint violation.
- Lowest anomaly scores (=0) when the model’s predictions (yhat) fall outside the expected range, signaling uncertain predictions.
- For more details, please refer to the documentation .
IMPROVEMENT: Added latency_offset argument to the VmReader to override the default -search.latencyOffset flag of VictoriaMetrics (30s). The default value is set to 1ms, which should help in cases where sampling_frequency is low (10-60s) and sampling_frequency equals infer_every in the PeriodicScheduler . This prevents users from receiving service - WARNING - [Scheduler [scheduler_alias]] No data available for inference. warnings in logs and allows for consecutive infer calls without gaps. To restore the backward compatible behavior, set it equal to your -search.latencyOffset value in VmReader config section.
BUGFIX: Ensure the use_transform argument of the OnlineQuantileModel functions as intended.
BUGFIX: Add a docstring for query_from_last_seen_timestamp arg of VmReader .

v1.15.0 #

Released: 2024-08-06

FEATURE: Introduced models that support online learning for stream-like input. These models significantly reduce the amount of data required for the initial fit stage. For example, they enable reducing fit_every from weeks to hours and increasing fit_every from hours to weeks in the PeriodicScheduler , significantly reducing the peak amount of data queried from VictoriaMetrics during fit stages. The next models were added:
- OnlineZscoreModel - online version of existing Z-score implementation with the same exact behavior.
- OnlineMADModel - online version of existing MADModel implementation with approximate behavior, based on t-digests for online quantile estimation.
- OnlineQuantileModel - online quantile model, that supports custom ranges for seasonality estimation to cover more complex data patterns.
- Find out more about online models specifics in correspondent section .
FEATURE: Introduced the optimized_business_params key (list of strings) to the AutoTuned optimization_params. This allows particular business-specific parameters such as detection_direction and min_dev_from_expected to remain unchanged during optimizations, retaining their default values.
IMPROVEMENT: Optimized the AutoTuned model logic to minimize deviations from the expected anomaly_percentage specified in the configuration and the detected percentage in the data, while also reducing discrepancies between the actual values (y) and the predictions (yhat).
IMPROVEMENT: Allow ProphetModel to fit with multiple seasonalities when used in AutoTuned mode.

v1.14.2 #

Released: 2024-07-26

BUGFIX: Patch a bug introduced in v1.14.1 , causing vmanomaly to crash in preset mode .

v1.14.1 #

Released: 2024-07-26

FEATURE: Allow to process larger data chunks in VmReader that exceed -search.maxPointsPerTimeseries constraint in VictoriaMetrics by splitting the range and sending multiple requests. A warning is printed in logs, suggesting reducing the range or step, or increasing search.maxPointsPerTimeseries constraint in VictoriaMetrics, which is still a recommended option.
FEATURE: Backward-compatible redesign of queries arg of VmReader . Old format of {q_alias1: q_expr1, q_alias2: q_expr2, ...} will be implicitly converted to a new one with a warning raised in logs. New format allows to specify per-query parameters, like step to reduce amount of data read from VictoriaMetrics TSDB and to allow config flexibility. Find out more in Per-query parameters section of VmReader .
IMPROVEMENT: Added multi-platform builds for linux/amd64 and linux/arm64 architectures.

v1.13.3 #

Released: 2024-07-17

BUGFIX: now validation of args argument for HoltWinters model works properly.

v1.13.2 #

Released: 2024-07-15

IMPROVEMENT: update node-exporter preset to reduce false positives
BUGFIX: add verify_tls arg for push monitoring section. Also, verify_tls is now correctly used in VmWriter .
BUGFIX: now AutoTuned model wrapper works correctly in on-disk model storage mode .
BUGFIX: now rolling models , like RollingQuantile are properly handled in One-off scheduler , when wrapped in AutoTuned

v1.13.0 #

Released: 2024-06-11

FEATURE: Introduced preset mode to run vmanomaly service with minimal user input and on widely-known metrics, like those produced by node_exporter .
FEATURE: Introduced min_dev_from_expected model common arg , aimed at reducing false positives in scenarios where deviations between the real value y and the expected value yhat are relatively high and may cause models to generate high anomaly scores . However, these deviations are not significant enough in absolute values to be considered anomalies based on domain knowledge.
FEATURE: Introduced detection_direction model common arg , enabling domain-driven anomaly detection strategies. Configure models to identify anomalies occurring above, below, or in both directions relative to the expected values.
FEATURE: add n_jobs arg to BacktestingScheduler to allow proportionally faster (yet more resource-intensive) evaluations of a config on historical data. Default value is 1, that implies sequential execution.
FEATURE: allow anomaly detection models to be dumped to a host filesystem after fit stage (instead of in-memory). Resource-intensive setups (many models, many metrics, bigger fit_window arg ) and/or 3rd-party models that store fit data (like ProphetModel or HoltWinters ) will have RAM consumption greatly reduced at a cost of slightly slower infer stage. Please find how to enable it here
IMPROVEMENT: Reduced the resource used for each fitted ProphetModel by up to 6 times. This includes both RAM for in-memory models and disk space for on-disk models storage. For more details, refer to this discussion on Facebook’s Prophet .
IMPROVEMENT: now config components class can be referenced by a short alias instead of a full class path - i.e. model.zscore.ZscoreModel becomes zscore, reader.vm.VmReader becomes vm, scheduler.periodic.PeriodicScheduler becomes periodic, etc.
BUGFIX: if using multi-scheduler setup (introduced in v1.11.0 ), prevent schedulers (and correspondent services) that are not attached to any model (so neither found in ‘schedulers’ arg nor left blank in model section) from being spawn, causing resource overhead and slight interference with existing ones.
BUGFIX: set random seed for ProphetModel to assure uncertainty estimates (like yhat_lower, yhat_upper) and dependent series (like anomaly_score), produced during .infer() calls are always deterministic given the same input. See initial issue for the details.
BUGFIX: prevent orphan queries (that are not attached to any model or scheduler) found in queries arg of Reader config section to be fetched from VictoriaMetrics TSDB, avoiding redundant data processing. A warning will be logged, if such queries exist in a parsed config.

v1.12.0 #

Released: 2024-03-31

FEATURE: Introduction of AutoTunedModel model class to optimize any built-in model on data during fit phase. Specify as little as anomaly_percentage param from (0, 0.5) interval and tuned_model_class (i.e. model.zscore.ZscoreModel ) to get it working with best settings that match your data. See details here .

IMPROVEMENT: Better logging of model lifecycle (fit/infer stages).
IMPROVEMENT: Introduce provide_series arg to all the built-in models to define what output fields to generate for writing (i.e. provide_series: ['anomaly_score'] means only scores are being produced)
BUGFIX: Self-monitoring metrics are now aggregated to queries aliases level (not to label sets of individual timeseries) and aligned with reader, writer and model sections description , so /metrics endpoint holds only necessary information for scraping.
BUGFIX: Self-monitoring metric vmanomaly_models_active now has additional labels model_alias, scheduler_alias, preset to align with model-centric self-monitoring .
IMPROVEMENT: Add possibility to use temporal information in IsolationForest models via cyclical encoding . This is particularly helpful to detect multivariate seasonality -dependant anomalies.
BREAKING CHANGE: ARIMA model is removed from built-in models . For affected users, it is suggested to replace ARIMA by Prophet or Holt-Winters .

v1.11.0 #

Released: 2024-02-22

FEATURE: Multi-scheduler support. Now users can use multiple model specs in a single config (via aliasing), each spec can be run with its own (even multiple) schedulers .
- Introduction of schedulers arg in model spec:
  - It allows each model to be managed by 1 (or more) schedulers, so overall resource usage is optimized and flexibility is preserved.
  - Passing an empty list or not specifying this param implies that each model is run in all the schedulers, which is a backward-compatible behavior.
  - Please find more details in docs on Model section
DEPRECATION: slight refactor of a scheduler config section
- Now schedulers are passed as a mapping of scheduler_alias: scheduler_spec under scheduler sections. Using old format (< 1.11.0 ) will produce warnings for now and will be removed in future versions.
DEPRECATION: The --watch CLI option for config file reloads is deprecated and will be ignored in the future.

v1.10.0 #

Released: 2024-02-15

FEATURE: Multi-model support. Now users can specify multiple model specs in a single config (via aliasing), as well as to reference what queries from VmReader it should be run on.
- Introduction of queries arg in model spec:
  - It allows the model to be executed only on a particular query subset from reader section.
  - Passing an empty list or not specifying this param implies that each model is run on results from all queries, which is a backward-compatible behavior.
  - Please find more details in docs on Model section
DEPRECATION: slight refactor of a model config section
- Now models are passed as a mapping of model_alias: model_spec under model sections. Using old format (<= 1.9.2 ) will produce warnings for now and will be removed in future versions.
- Please find more details in docs on Model section
IMPROVEMENT: now logs from monitoring.pull GET requests to /metrics endpoint are shown only in DEBUG mode
IMPROVEMENT: labelset for multivariate models is deduplicated and cleaned, resulting in better UX

Note

These updates support more flexible setup and effective resource management in service, as now it’s not longer needed to spawn several instances of vmanomaly to split queries/models context across.

v1.9.2 #

Released: 2024-01-29

BUGFIX: now multivariate models (like IsolationForestMultivariateModel ) are properly handled throughout fit/infer phases.

v1.9.1 #

Released: 2024-01-27

IMPROVEMENT: Updated the offline license verification backbone to mitigate a critical vulnerability identified in the ecdsa library, ensuring enhanced security despite initial non-impact.
IMPROVEMENT: bump 3rd-party dependencies for Python 3.12.1

v1.9.0 #

Released: 2024-01-26

BUGFIX: The query_from_last_seen_timestamp internal logic in VmReader , first introduced in v1.5.1 , now functions correctly. This fix ensures that the input data shape remains consistent for subsequent fit-based model calls in the service.
BREAKING CHANGE: The sampling_period parameter is now mandatory in VmReader . This change aims to clarify and standardize the frequency of input/output in vmanomaly, thereby reducing uncertainty and aligning with user expectations.

Note

The majority of users, who have been proactively specifying the sampling_period parameter in their configurations, will experience no disruption from this update. This transition formalizes a practice that was already prevalent and expected among our user base.

v1.8.0 #

Released: 2024-01-15

FEATURE: Added Univariate MAD (median absolute deviation) model support.
IMPROVEMENT: Update Python to 3.12.1 and all the dependencies.
IMPROVEMENT: Don’t check /health endpoint, check the real /query_range or /import endpoints directly. Users kept getting problems with /health.
DEPRECATION: “health_path” param is deprecated and doesn’t do anything in config (reader , writer , monitoring.push ).

v1.7.2 #

Released: 2023-12-21

BUGFIX: fit/infer calls are now skipped if we have insufficient valid data to run on.
BUGFIX: proper handling of inf and NaN in fit/infer calls.
FEATURE: add counter of skipped model runs vmanomaly_model_runs_skipped to healthcheck metrics.
FEATURE: add exponential retries wrapper to VmReader’s read_metrics().
FEATURE: add BacktestingScheduler for consecutive retrospective fit/infer calls.
FEATURE: add improved & numerically stable anomaly scores.
IMPROVEMENT: add full config validation. The probability of getting errors in later stages (say, model fit) is greatly reduced now. All the config validation errors that needs to be fixed are now a part of logging.
Note
This is an backward-incompatible change, as model config section now expects key-value args for internal model defined in nested args.
IMPROVEMENT: add explicit support of gzip-ed responses from vmselect in VmReader.

v1.6.0 #

Released: 2023-10-30

IMPROVEMENT:
- now all the produced healthcheck metrics have vmanomaly_ prefix for easier accessing.
- updated docs for monitoring.
Note
This is an backward-incompatible change, as metric names will be changed, resulting in new metrics creation, i.e. model_datapoints_produced will become vmanomaly_model_datapoints_produced
IMPROVEMENT: Set default value for --log_level from DEBUG to INFO to reduce logs verbosity.
IMPROVEMENT: Add alias --log-level to --log_level.
FEATURE: Added extra_filters parameter to reader. It allows to apply global filters to all queries.
FEATURE: Added verify_tls parameter to reader and writer. It allows to disable TLS verification for remote endpoint.
FEATURE: Added bearer_token parameter to reader and writer. It allows to pass bearer token for remote endpoint for authentication.
BUGFIX: Fixed passing workers parameter for reader. Previously it would throw a runtime error if workers was specified.

v1.5.1 #

Released: 2023-09-18

IMPROVEMENT: Infer from the latest seen datapoint for each query. Handles the case datapoints arrive late.

v1.5.0 #

Released: 2023-08-11

FEATURE: add --license and --license-file command-line flags for license code verification.
IMPROVEMENT: Updated Python to 3.11.4 and updated dependencies.
IMPROVEMENT: Guide documentation for Custom Model usage.

v1.4.2 #

Released: 2023-06-09

BUGFIX: Fix case with received metric labels overriding generated.

v1.4.1 #

Released: 2023-06-09

IMPROVEMENT: Update dependencies.

v1.4.0 #

Released: 2023-05-06

FEATURE: Reworked self-monitoring grafana dashboard for vmanomaly.
IMPROVEMENT: Update python version and dependencies.

v1.3.0 #

Released: 2023-03-21

FEATURE: Parallelized queries. See reader.workers param to control parallelism. By default it’s value is equal to number of queries (sends all the queries at once).
IMPROVEMENT: Updated self-monitoring dashboard.
IMPROVEMENT: Reverted back default bind address for /metrics server to 0.0.0.0, as vmanomaly is distributed in Docker images.
IMPROVEMENT: Silenced Prophet INFO logs about yearly seasonality.

v1.2.2 #

Released: 2023-03-19

BUGFIX: Fix for metric label to pass QUERY_KEY.
FEATURE: Added timeout config param to reader, writer, monitoring.push.
BUGFIX: Don’t hang if scheduler-model thread exits.
FEATURE: Now reader, writer and monitoring.push will not halt the process if endpoint is inaccessible or times out, instead they will increment metrics *_response_count{code=~"timeout|connection_error"}.

v1.2.1 #

Released: 2023-02-18

BUGFIX: Fixed scheduler thread starting.
BUGFIX: Fix rolling model fit+infer.
BREAKING CHANGE: monitoring.pull server now binds by default on 127.0.0.1 instead of 0.0.0.0. Please specify explicitly in monitoring.pull.addr what IP address it should bind to for serving /metrics.

v1.2.0 #

Released: 2023-02-04

FEATURE: With arg --watch watches for config(s) changes and reloads the service automatically.
IMPROVEMENT: Remove “provide_series” from HoltWinters model. Only Prophet model now has it, because it may produce a lot of series if “holidays” is on.
IMPROVEMENT: if Prophet’s “provide_series” is omitted, then all series are returned.
DEPRECATION: Config monitoring.endpoint_url is deprecated in favor of monitoring.url.
DEPRECATION: Remove ’enable’ param from config monitoring.pull. Now /metrics server is started whenever monitoring.pull is present.
IMPROVEMENT: include example configs into the docker image at /vmanomaly/config/*
IMPROVEMENT: include self-monitoring grafana dashboard into the docker image under /vmanomaly/dashboard/vmanomaly_grafana_dashboard.json

v1.1.0 #

Released: 2023-01-23

IMPROVEMENT: update Python dependencies
FEATURE: Add multivariate IsolationForest model.

v1.0.1 #

Released: 2023-01-06

BUGFIX: prophet model incorrectly predicted two points in case of only one

v1.0.0-beta #

Released: 2022-12-08

First public release is available

Previous FAQ Next