CVAT fails health check using >90% disk

See original GitHub issue

My actions before raising this issue

Clone latest develop. docker-compose -f docker-compose.yml -f docker-compose.dev.yml build docker-compose up -d

Expected Behaviour

Startup and run application.

Current Behaviour

Computer runs very slowly as CVAT uses significant resources. When trying to login, the following occurs:

get http://10.0.253.33:9001/api/server/health/?format=json&org= {"Cache backend: default": "working", "DatabaseBackend": "working", "DiskUsage": "warning: 86c1cd71866c 92.4% disk usage exceeds 90%", "MemoryUsage": "working", "MigrationsHealthCheck": "working"}

Server logs:

2022-12-12 04:54:32,063 DEBG 'runserver' stderr output:
[Mon Dec 12 04:54:32.063061 2022] [wsgi:error] [pid 178:tid 139765320791808] ERROR:health-check:warning: 35a75041c750 92.4% disk usage exceeds 90%
[Mon Dec 12 04:54:32.063076 2022] [wsgi:error] [pid 178:tid 139765320791808] Traceback (most recent call last):
[Mon Dec 12 04:54:32.063085 2022] [wsgi:error] [pid 178:tid 139765320791808]   File "/opt/venv/lib/python3.8/site-packages/health_check/backends.py", line 30, in run_check

2022-12-12 04:54:32,063 DEBG 'runserver' stderr output:
[Mon Dec 12 04:54:32.063125 2022] [wsgi:error] [pid 178:tid 139765320791808]     self.check_status()

2022-12-12 04:54:32,063 DEBG 'runserver' stderr output:
[Mon Dec 12 04:54:32.063420 2022] [wsgi:error] [pid 178:tid 139765320791808]   File "/opt/venv/lib/python3.8/site-packages/health_check/contrib/psutil/backends.py", line 21, in check_status
[Mon Dec 12 04:54:32.063422 2022] [wsgi:error] [pid 178:tid 139765320791808]     raise ServiceWarning(
[Mon Dec 12 04:54:32.063422 2022] [wsgi:error] [pid 178:tid 139765320791808] health_check.exceptions.ServiceWarning: warning: 35a75041c750 92.4% disk usage exceeds 90%

2022-12-12 04:54:32,082 DEBG 'runserver' stderr output:
[Mon Dec 12 04:54:32.082022 2022] [wsgi:error] [pid 178:tid 139767587452672] [remote 172.27.0.4:39110] [2022-12-12 04:54:32,081] ERROR django.request: Internal Server Error: /api/server/health/
[Mon Dec 12 04:54:32.082094 2022] [wsgi:error] [pid 178:tid 139767587452672] [remote 172.27.0.4:39110] ERROR:django.request:Internal Server Error: /api/server/health/

Issue Analytics

  • State:open
  • Created 9 months ago
  • Comments:9 (4 by maintainers)

github_iconTop GitHub Comments

1reaction
zhiltsov-maxcommented, Dec 21, 2022

Is there a way to turn off use chunks after a task is created? I’v already created a number of tasks with this option enabled.

Unfortunately, no. I can suggest you to export the annotations without images, and then you can download images separately using SDK or CLI: cvat-cli frames --quality original --outdir task_<N>_images <task_id> (modify the command to fit your case).

1reaction
nmanoviccommented, Dec 13, 2022

@harrystuart , you can adjust health check limits (https://github.com/revsys/django-health-check/blob/master/health_check/conf.py)

Please try to define in cvat/settings/base.py the following variable:

HEALTH_CHECK = {
    "DISK_USAGE_MAX": 99
}
Read more comments on GitHub >

github_iconTop Results From Across the Web

CVAT fails health check using >90% disk - GitHub
Used and trusted by teams at any scale, for data of any scale. - CVAT fails health check using >90% disk · opencv/cvat@760f40d....
Read more >
NCC Health Check: pcvm_disk_usage_check - Nutanix Support
The NCC health check pcvm_disk_usage_check verifies that the amount of disk ... is more than 90% for several hours, a FAIL is returned...
Read more >
Fix common cluster issues | Elasticsearch Guide [7.17] | Elastic
This guide describes how to fix common errors and problems with Elasticsearch clusters. Error: disk usage exceeded flood-stage watermark, ...
Read more >
How to Adjust an Elasticsearch Cluster's Disk Allocation ...
This step-by-step tutorial explains how to adjust an Elasticsearch cluster's disk allocation watermark.
Read more >
Chapter 12. Troubleshooting Logging OpenShift Container ...
Continue freeing up and monitoring the disk space until the used disk space drops below 90%. Then, unblock write to this particular node....
Read more >

github_iconTop Related Medium Post

No results found

github_iconTop Related StackOverflow Question

No results found

github_iconTroubleshoot Live Code

Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free

github_iconTop Related Reddit Thread

No results found

github_iconTop Related Hackernoon Post

No results found

github_iconTop Related Tweet

No results found

github_iconTop Related Dev.to Post

No results found

github_iconTop Related Hashnode Post

No results found