>>46208670NTA but
1000 series perform fp16 1/64th as slow as fp32
2000 series perform fp16 twice as fast as fp32
3000 series and up perform fp16 the same rate as fp32
Webui autocasts up if it detects that it wants to otherwise it runs at fp16
I think the relevant code is in modules/
devices.py