>>65658611Quantized models have less precision so they will be a bit dumber but the effect is not significant. It is not cumbersome to quantize them as The Bloke already does it for most models as soon as they appear, but I have no idea how to implement it in the script provided by pixart as my knowledge of LLMs, directly accessing diffusion models and stuff is limited. But I can probably handle API requests and saving the result to file.
The main issue is to get it to work in the first place. I managed to squish down all the errors but now it seems to be hallucinating and describing images even when I don't give it an image.