When to Stop Training

Training doesn’t need to run “forever”. In real projects, the best results come from stopping at the right moment:

not too early (model hasn’t learned yet)
not too late (model starts to overfit / memorize)

In AugeLab Studio, training usually ends when it reaches the configured max iterations, or when you click Stop Training. The Training Chart helps you decide whether it’s worth continuing.

If this is your first training, start with the Starter Checklist.

Monitor Training Progress

During training, monitor the progress of the model and watch the relationship between:

Loss
mAP
IOU
Iterations

Loss and mAP are shown on a chart like below:

All metrics can wildly vary by:

Data variety
Data size
Annotation accuracy
Model size

Numbers below are only provided for setting an initial ground for newcomers.

Quick Rule (what usually works)

If you only remember one rule:

Stop when validation mAP stops improving for a long time, or when it starts going down while loss keeps going down.

That second case is the classic sign of overfitting.

Common Training Patterns (cheat sheet)

These patterns are common in real use. For each one, look at the chart first, then read the explanation.

These example charts are generated for training/documentation purposes. In your repository, place them under the .assets/ folder next to this page.

Insufficient data

Explanation:

What it means: you don’t have enough signal yet to trust the trend.
Likely causes: too few images, too short run, weak/too small validation split.
What to do: train longer; add data; ensure validation exists and includes real variety.

Low variance

Explanation:

What it means: the model learns the “easy repetition” quickly, then stops getting new information.
Likely causes: repetitive dataset (same background/angle/light), missing negatives, missing edge cases.
What to do: add variety (angles, backgrounds, lighting), add negatives, capture hard cases on purpose.

Overtraining

Overtraining is not always catastrophic, but it usually indicates memorization rather than generalization. For strict environments (fixed camera, fixed lighting), it is acceptable.

What it means: the model is getting better at the training set, but worse at validation (memorization).
Likely causes: not enough variety, too-small validation, duplicates/near-duplicates.
What to do: stop and keep best weights; add more variety; increase validation split; remove duplicates.

Model not learning

Explanation:

What it means: training is not progressing in a meaningful way.
Likely causes: wrong labels/classes, class IDs mismatch, broken annotation format, incorrect config/settings.
What to do: verify .names order vs label IDs; spot-check labels; confirm YOLO format; adjust training settings.

Corrupted dataset

Explanation:

What it means: training is being disrupted by inconsistent or broken data.
Likely causes: corrupted image files, invalid labels, mixed sources/resolutions, “empty-but-contains-objects” images.
What to do: run dataset checks; remove corrupted data; fix label format; re-export a clean set.

Good training

Explanation:

What it means: healthy learning and generalization.
Likely causes: consistent labels + enough variety.
What to do: stop when mAP plateaus; validate on real footage / a “golden set”; deploy best weights.

Loss

Loss is a training-fit signal. It represents how well the model is fitting the training batches.

Loss is useful, but it can be misleading:

Loss can keep decreasing even when the model is already overfitting.
Loss does not guarantee “real-world performance”.

Loss alone is not enough to judge accuracy. Use mAP to understand generalization on validation data.

2.0 ≥ Loss

Often indicates “learning has started”, but quality may still be poor. Use it as a sign that the pipeline works, not as a finish line.

As shown in the graph above, loss values around 2.0 may not produce accurate models.

1.0 ≥ Loss

Commonly a usable baseline on many focused datasets.

0.5 ≥ Loss

Often indicates a well-fit model on a clean, consistent dataset. After this point improvements can be slow, and overfitting risk increases.

Loss thresholds are not universal (why)

Loss values depend on model architecture, image size, classes, label noise, augmentation, and dataset complexity. Use loss thresholds to build intuition, not as a universal pass/fail.

mAP

The mAP (mean average precision) metric combines both precision and recall to provide a comprehensive evaluation of the model's accuracy in detecting objects in an image.

It is calculated by evaluating predictions against ground-truth labels at specific IoU thresholds (exact details depend on the training backend/settings).

mAP is only as good as your validation set. If validation images are too few, too “clean”, too similar to training, or mislabeled, mAP can look great while the model fails in production.

Practical interpretation:

A stable plateau is often more important than chasing the last +1%.
Very high mAP (like 95–99%) on a small or repetitive dataset is a common overfitting trap.
If mAP peaks then drops, see Over-Fitting.

IOU

IOU (Intersection over Union) measures the overlap between predicted and true bounding boxes for individual object detections. mAP evaluates the overall performance of the object detection model across all object categories, considering both precision and recall.

Higher the IOU value, tighter the predicted box is.

You can track each IOU in Training Window loggings:

Fine Tuning

Training Time

Define a maximum training time budget based on available computational resources and project constraints. If the model does not achieve satisfactory performance within the allocated time, consider stopping training and exploring other approaches such as:

Manually analyze annotation accuracy
Check class variety
Choose different model sizes and batch sizes
Increase database size

Over-Fitting

Avoid overfitting by monitoring how mAP behaves over time.

The most reliable “real life” overfit signal is:

loss decreases, but mAP peaks and then gets worse.

Overfitting is not always “catastrophic” on very constrained, fixed-camera setups. But if you care about robustness (different lighting, different shifts, different backgrounds), overfitting will show up quickly.

What usually helps:

Add more variety (new days, new lighting, new backgrounds)
Add negatives that look like your real environment
Tighten label consistency (same style across labelers)
Increase validation split so mAP is harder to “cheat”

Balancing Time and Performance

Balance the training time with the desired model performance. In some cases, additional training iterations may improve performance, but the returns may diminish over time. Weigh the benefits against the computational cost and the urgency of the project.

Usually, depending on class numbers and database size, training process length can vary between a day or a week.

Starter Checklist

Database:

Labels are consistent (box style + class meaning)
Dataset has real-world variety (lighting, angles, backgrounds)
You have enough examples per class to learn (more is better; start small, then improve)
(Optional) Augmentation is enabled after labels are correct

Model:

Chosen a model size that meets FPS requirements
Right model for the right system requirements and CUDA compatibility, GPU memory.
Batch size according to GPU memory (use subdivisions to avoid OOM)

Training (stop if):

mAP plateaus for a long time (diminishing returns)
mAP falls while Loss continues falling (overfitting)
You hit your time budget and results are “good enough” to test on real footage

Fast debugging checklist (when things look wrong)

Spot-check 20–50 images across the dataset (not just the first page)
Confirm class mapping:

.names file order matches label IDs
no missing/extra classes

Spot-check label files:

YOLO format: class x_center y_center width height (normalized)
boxes are in-bounds and not zero-sized

If mAP looks “too good to be true”:

validation split may be too small or too similar to training
you may have duplicates / near-duplicates

If training is unstable or OOM:

increase subdivisions or reduce batch
temporarily reduce input resolution to debug

PreviousTrain Object Detection(YOLO) Models NextAfter Training

Last updated 1 month ago

hashtagMonitor Training Progress

hashtagQuick Rule (what usually works)

hashtagCommon Training Patterns (cheat sheet)

hashtagInsufficient data

hashtagLow variance

hashtagOvertraining

hashtagModel not learning

hashtagCorrupted dataset

hashtagGood training

hashtagLoss

hashtag**2.0 ≥** Loss

hashtag**1.0 ≥** Loss

hashtag**0.5 ≥** Loss

hashtagmAP

hashtagIOU

hashtagFine Tuning

hashtagTraining Time

hashtagOver-Fitting

hashtagBalancing Time and Performance

hashtagStarter Checklist

Monitor Training Progress

Quick Rule (what usually works)

Common Training Patterns (cheat sheet)

Insufficient data

Low variance

Overtraining

Model not learning

Corrupted dataset

Good training

Loss

2.0 ≥ Loss

1.0 ≥ Loss

0.5 ≥ Loss

mAP

IOU

Fine Tuning

Training Time

Over-Fitting

Balancing Time and Performance

Starter Checklist