/vt/ - Virtual YouTubers » Searching for posts with the image hash ‘a4qRdGzC88GmYIbfCLbOIw==’.

Anonymous

View Same Google ImgOps iqdb SauceNAO pic-selected-231004-1808-09.png, 25KiB, 571x192

Anonymous Wed 04 Oct 2023 18:10:41 No.59847945 View View Report

Quoted By:

>>59847680
Depends on the model. If we're serving 13B, each GPU should be able to handle it if like a 50 users prompt **simultaneously**. But if it's properly paced we can do hundreds to thousands per GPU. Keep in mind that these are estimations based on benchmarks, not sure how they'd apply to real-world cases. Here's a chart I made a couple months ago, but by now we should be like 2.5x more efficient (too lazy to update it):

13B model:
- # GPUs is the number of GPUs model is running on
- N is how many prompts each user requests (think of it like the pre-loaded swipes of the early day C.AI)
- Requests/s is how many requests we'll be able to serve every second
- Tokens/s is the total throughput per second

Now that we natively support Exllama and AWQ, and I've improved the efficiency by a factor of at least 60%, we're likely much better than this.

Capcode	All Only User Posts Only Verified Posts Only Moderator Posts Only Manager Posts Only Admin Posts Only Developer Posts Only Founder Posts
Show Posts	All Only With Images Only Without Images Only Spoiler Images Only Non-Spoiler Images
Deleted Posts	All Only Deleted Posts Only Non-Deleted Posts
Ghost Posts	All Only Ghost Posts Only Non-Ghost Posts
Post Type	All Only Sticky Threads Only Opening Posts Only Reply Posts
Results	All Grouped By Threads
Order	Latest Posts First Oldest Posts First

Searching for posts with the image hash ‘a4qRdGzC88GmYIbfCLbOIw==’. 1 results found.

On these archives

Your latest searches