Hacker Newsnew | past | comments | ask | show | jobs | submit | raus22's commentslogin

With models like these, when multilingual is not mentioned it will perform really bad on real life non-english pdfs.


The model was primarily trained on English documents, which is why English is listed as the main language. However, the training data did include a smaller proportion of Chinese and various European languages. Additionally, the base model (Qwen-2.5-VL-3B) is multilingual. Someone on Reddit mentioned it worked on Chinese: https://www.reddit.com/r/LocalLLaMA/comments/1l9p54x/comment...


We built yet another eink art frame.


> I found the level of synthetic bubbliness in the voice extremely off-putting.

My thought exactly, it was to the extreme in its, as you say, bubbliness. I would not be able to use a tool that had this behavior.


Douglas Adams was onto something when he decided the superintelligent servant in Hitchhikers Guide was to loudly complain about its endless depression. Maybe then we’ll only ask things of it when we actually need it and otherwise avoid interaction.


Here in Denmark I can't pay for the deposit because we have extra security steps when paying with credit card... and they did not implement it.



But why post that on twitter? That makes no sense.


Why not go the other way and decrease the actions per minute so you learn the overall point of the game , And with each game the actions per minute increases.


Or maybe extend the traditional categories of macro and micro with another one, call it 'nano'... the micro agent indicates where each unit ought to be in 9 frames, and the nano agent figures out how to take them there. Since the timescale is so short, the agent could brute-force enumerate possible moves to some extent and figure out which is optimal, like chess AI. Or use a separate network.

I guess that's inelegant when a deep network already has its own concept of fine-grained versus coarse-grained layers, and should be able to do this on its own with the right training method.


That sounds like an interesting research angle. The thing about AI research is there are so many open ends there are essentially unlimited research options. If you can pose it as a problem and identify a reasonable programming approach then you have an avenue for AI research. Deep Learning isn't the end of AI research. It is the beginning.


and with humour: > Git's internal date-formatting code can now correctly show dates past the year 2100. Phew, fixed with only 84 years to spare.


Related video: Humans need not apply https://youtu.be/7Pq-S557XQU


Do you know why it puts a lot of things on surfboards?

It seems from the comments that that is one of the most represented misclassifications.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: