Ultralytics YOLO

Ultralytics YOLO supports a wide range of models, from early versions like YOLOv3 to the latest YOLO11.

warning

This component has undergone limited testing. In addition to partial functional testing, only the following models have been confirmed to work: yolov5mu.pt, yolov8n, and yolo11s.pt

note

yolo component uses the official ultralytics python package. A GPU is used when available.

info

Models are not installed by default. See below for steps to define the model as well as make them available to Viseron.

Configuration

Configuration example

yolomap required

YOLO configuration.

object_detectormap required

Object detector domain config.

camerasmap required

Camera-specific configuration. All subordinate keys corresponds to the camera_identifier of a configured camera.

<CAMERA_IDENTIFIER>map required

Camera identifier. Valid characters are lowercase a-z, numbers and underscores.

fpsfloat (optional, default: 1)

The FPS at which the object detector runs.
Higher values will result in more scanning, which uses more resources.

Lowest value: 0

scan_on_motion_onlyboolean (optional, default: true)

When set to true and a motion_detector is configured, the object detector will only scan while motion is detected.

labelslist (optional)

A list of labels (objects) to track.

labelstring required

The label to track.

confidencefloat (optional, default: 0.8)

Lowest confidence allowed for detected objects. The lower the value, the more sensitive the detector will be, and the risk of false positives will increase.

Lowest value: 0

Highest value: 1

height_minfloat (optional, default: 0)

Minimum height allowed for detected objects, relative to stream height.

Lowest value: 0

Highest value: 1

height_maxfloat (optional, default: 1)

Maximum height allowed for detected objects, relative to stream height.

Lowest value: 0

Highest value: 1

width_minfloat (optional, default: 0)

Minimum width allowed for detected objects, relative to stream width.

Lowest value: 0

Highest value: 1

width_maxfloat (optional, default: 1)

Maximum width allowed for detected objects, relative to stream width.

Lowest value: 0

Highest value: 1

trigger_recorderboolean deprecated

DEPRECATED. Use trigger_event_recording instead.

If set to true, objects matching this filter will start the recorder.

trigger_event_recordingboolean (optional, default: true)

If set to true, objects matching this filter will trigger an event recording.

storeboolean (optional, default: true)

If set to true, objects matching this filter will be stored in the database, as well as having a snapshot saved. Labels with trigger_event_recording set to true will always be stored when a recording starts, regardless of this setting.

store_intervalinteger (optional, default: 60)

The interval at which the label should be stored in the database, in seconds. If set to 0, the label will be stored every time it is detected.

require_motionboolean (optional, default: false)

If set to true, the recorder will stop as soon as motion is no longer detected, even if the object still is. This is useful to avoid never ending recordings of stationary objects, such as a car on a driveway

max_frame_agefloat (optional, default: 2)

Drop frames that are older than the given number. Specified in seconds.

Lowest value: 0

log_all_objectsboolean (optional, default: false)

When set to true and loglevel is DEBUG, all found objects will be logged, including the ones not tracked by labels.

masklist (optional)

A mask is used to exclude certain areas in the image from object detection.

coordinateslist required

List of X and Y coordinates to form a polygon

Minimum items: 3

xinteger required

X-coordinate (horizontal axis).

yinteger required

Y-coordinate (vertical axis).

zoneslist (optional)

Zones are used to define areas in the cameras field of view where you want to look for certain objects (labels).

namestring required

Name of the zone. Has to be unique per camera.

coordinateslist required

List of X and Y coordinates to form a polygon

Minimum items: 3

xinteger required

X-coordinate (horizontal axis).

yinteger required

Y-coordinate (vertical axis).

labelslist (optional)

A list of labels (objects) to track.

labelstring required

The label to track.

confidencefloat (optional, default: 0.8)

Lowest confidence allowed for detected objects. The lower the value, the more sensitive the detector will be, and the risk of false positives will increase.

Lowest value: 0

Highest value: 1

height_minfloat (optional, default: 0)

Minimum height allowed for detected objects, relative to stream height.

Lowest value: 0

Highest value: 1

height_maxfloat (optional, default: 1)

Maximum height allowed for detected objects, relative to stream height.

Lowest value: 0

Highest value: 1

width_minfloat (optional, default: 0)

Minimum width allowed for detected objects, relative to stream width.

Lowest value: 0

Highest value: 1

width_maxfloat (optional, default: 1)

Maximum width allowed for detected objects, relative to stream width.

Lowest value: 0

Highest value: 1

trigger_recorderboolean deprecated

DEPRECATED. Use trigger_event_recording instead.

If set to true, objects matching this filter will start the recorder.

trigger_event_recordingboolean (optional, default: true)

If set to true, objects matching this filter will trigger an event recording.

storeboolean (optional, default: true)

store_intervalinteger (optional, default: 60)

The interval at which the label should be stored in the database, in seconds. If set to 0, the label will be stored every time it is detected.

require_motionboolean (optional, default: false)

model_pathstring (optional, default: /detectors/models/yolo/default.pt)

Path to a YOLO model.More information here.

min_confidencefloat (optional, default: 0.25)

Minimum confidence to consider a detection.
This minimum is enforced during inference before being filtered by values in labels

Lowest value: 0

Highest value: 1

ioufloat (optional, default: 0.7)

Intersection Over Union (IoU) threshold for Non-Maximum Suppression (NMS).

Lowest value: 0

Highest value: 1

half_precisionboolean (optional, default: false)

Enable/disable half precision accuracy.
If your GPU supports FP16, enabling this might give you a performance increase.

devicestring (optional)

Specifies the device for inference (e.g., cpu, cuda:0 or 0).

Pre-trained models

These steps should assist in locating models, configuring your container to access them, and configuring Viseron to use them.

Finding models

Pre-trained YOLO models can be found online or you can train them yourself.

Examples of where to find pre-trained models:

There are models for many different tasks, including object detection. If you are not sure if there is a problem with Viseron please confirm your Viseron environment with a stock YOLO model from Ultralytics. For example: yolov8n.pt

This component does not provide any training capabilities. See the Ultralytics training documentation for more information.

Where to place models

Place your YOLO models in a directory of your choice.

There will be a later step to map the directory to the container. Therefore, choose a location supported by docker compose. If in doubt, do not use a SMB or NFS share.

Configuring Docker to make models available to Viseron

The following docker-compose.yaml snippet will show how to map the directory above to the container:

/docker-compose.yaml
    volumes:
      - {models path}:/detectors/models/yolo

This is the only change to docker-compose.yaml required for this component.

Configuring Viseron to use a model

Modify the model_path setting in your Viseron config.yaml to point to one of the model(s) you installed. See the example above.

Only one model can be used at a time.

Image resizing

Images inferenced by the component are resized by the underlying ultralytics package to match the model's input size.

There is no functionality to resize the image in the yolo component configuration before inferencing.

Labels

When Viseron loads the model, it will print that model's labels to the log.

cd {location of Viseron docker-compose.yaml}
docker compose logs | grep "Labels"
viseron  | 2025-05-29 08:19:04.943 [INFO    ] [viseron.components.yolo.object_detector] - Labels: {0: 'bicycle', 1: 'bird', 2: 'bus', 3: 'car', 4: 'cat', 5: 'dog', 6: 'motorcycle', 7: 'person', 8: 'truck', 9: 'squirrel', 10: 'car-light', 11: 'rabbit', 12: 'fox', 13: 'opossum', 14: 'skunk', 15: 'racoon'}

Troubleshooting

To enable debug logging for yolo, add the following to your config.yaml

/config/config.yaml
logger:
  logs:
    viseron.components.yolo: debug

Ultralytics YOLO

Configuration​

Pre-trained models​

Finding models​

Where to place models​

Configuring Docker to make models available to Viseron​

Configuring Viseron to use a model​

Image resizing​

Labels​

Troubleshooting​

Configuration

Pre-trained models

Finding models

Where to place models

Configuring Docker to make models available to Viseron

Configuring Viseron to use a model

Image resizing

Labels

Troubleshooting