python
HuggingFace Model Too Large for GPU
torch\.cuda\.OutOfMemoryError.*loading.*model
Fixes
- 1.Load with device_map='auto' for automatic sharding
- 2.Use load_in_8bit=True or load_in_4bit=True with bitsandbytes
- 3.Load to CPU first then move specific layers to GPU
huggingfacemodeloom
Related Errors
python3 fixes
Asyncio event loop already running
RuntimeError: This event loop is already running
- •Use nest_asyncio.apply() to allow nested event loops
- •Use asyncio.run_coroutine_threadsafe() instead of asyncio.run()
python3 fixes
Coroutine never awaited
RuntimeWarning: coroutine '.*' was never awaited
- •Add 'await' before the coroutine call
- •Use asyncio.create_task() to schedule the coroutine
python3 fixes
Asyncio task was cancelled
asyncio\.CancelledError
- •Handle CancelledError in try/except within the task
- •Use asyncio.shield() to protect critical sections from cancellation