/vt/ - Virtual YouTubers » Searching for posts with the image hash ‘XOYXI75CzCHabS0AeOJ+Zw==’.

Archives: [ bant / c / con / e / i / n / news / out / p / pw / qst / toy / vip / vp / vt / w / wg / wsr ] Boards: [ meta ]

Domain changed to archive.palanq.win . Feb 14-25 still awaits import.

Searching for posts with the image hash ‘XOYXI75CzCHabS0AeOJ+Zw==’. 1 results found.

Anonymous

View Same Google ImgOps iqdb SauceNAO file.png, 19KiB, 702x203

Anonymous Sat 14 Sep 2024 09:10:03 No.85171889 View View Report

Quoted By:

>>85170453
if chatGPT is a decent representation of current multimodal model capabilities (haven't used one locally myself) they can't directly do coordinates of objects in images. i've heard of people working around it by overlaying a labeled grid on the image that it can reference, but if you try to ask chatgpt for coordinates of where to click next in an osu screenshot even if it gives you the right circle it has to write some quick python code to basically eyeball it without nearly enough precision to play the game
so besides the actual speed of the whole thing and whether or not the LLM knows how the game is played i think there's a fundamental issue with how image analysis works that'll be a blocker

Capcode	All Only User Posts Only Verified Posts Only Moderator Posts Only Manager Posts Only Admin Posts Only Developer Posts Only Founder Posts
Show Posts	All Only With Images Only Without Images Only Spoiler Images Only Non-Spoiler Images
Deleted Posts	All Only Deleted Posts Only Non-Deleted Posts
Ghost Posts	All Only Ghost Posts Only Non-Ghost Posts
Post Type	All Only Sticky Threads Only Opening Posts Only Reply Posts
Results	All Grouped By Threads
Order	Latest Posts First Oldest Posts First

Searching for posts with the image hash ‘XOYXI75CzCHabS0AeOJ+Zw==’. 1 results found.

On these archives

Your latest searches