Hacker Newsnew | past | comments | ask | show | jobs | submit | famouswaffles's submissionslogin
1.SPy: An interpreter and compiler for a fast statically typed variant of Python (antocuni.eu)
276 points by famouswaffles 47 days ago | past | 131 comments
2.Emergent Introspective Awareness in Large Language Models (transformer-circuits.pub)
30 points by famouswaffles 48 days ago | past | 4 comments
3.Quantifying the algorithmic improvement from reasoning models (epoch.ai)
1 point by famouswaffles 4 months ago | past
4.Evidence of interrelated cognitive-like capabilities in large language models (sciencedirect.com)
1 point by famouswaffles 6 months ago | past
5.Atlas: Learning to Optimally Memorize the Context at Test Time (arxiv.org)
43 points by famouswaffles 6 months ago | past | 4 comments
6.Gemini Diffusion (deepmind.google)
61 points by famouswaffles 7 months ago | past | 7 comments
7.Tails Tell Tales: Chapter-Wide Manga Transcriptions with Character Names (arxiv.org)
2 points by famouswaffles 10 months ago | past | 1 comment
8.Over-Tokenized Transformer: Vocabulary Is Generally Worth Scaling (arxiv.org)
2 points by famouswaffles 10 months ago | past
9.LLMs struggle with perception, not reasoning, in ARC-AGI (anokas.substack.com)
2 points by famouswaffles 10 months ago | past
10.EvaByte: Efficient Byte-Level Language Models at Scale (hkunlp.github.io)
3 points by famouswaffles 10 months ago | past
11.Tell me about yourself: LLMs are aware of their learned behaviors (arxiv.org)
2 points by famouswaffles 10 months ago | past
12.Imagine While Reasoning in Space: Multimodal Visualization-of-Thought (arxiv.org)
2 points by famouswaffles 11 months ago | past
13.LLMs struggle with perception, not reasoning, in ARC-AGI (anokas.substack.com)
1 point by famouswaffles 11 months ago | past
14.Byte Latent Transformer: Patches Scale Better Than Tokens (meta.com)
6 points by famouswaffles on Dec 13, 2024 | past
15.Mastering Board Games by External and Internal Planning with Language Models (deepmind.google)
1 point by famouswaffles on Dec 6, 2024 | past
16.Emergence of Hidden Capabilities: Exploring Learning Dynamics in Concept Space (arxiv.org)
2 points by famouswaffles on Nov 14, 2024 | past
17.GameGen-X: Open-World Video Game Generation (gamegen-x.github.io)
4 points by famouswaffles on Nov 5, 2024 | past
18.TokenFormer: Rethinking Transformer Scaling with Tokenized Model Parameters (arxiv.org)
174 points by famouswaffles on Nov 1, 2024 | past | 33 comments
19.Kurzgesagt: We Fell for the Oldest Lie on the Internet [video] (youtube.com)
1 point by famouswaffles on Oct 31, 2024 | past | 3 comments
20.Relaxed Recursive Transformers: Effective Parameter Sharing with Layer-Wise LoRA (arxiv.org)
1 point by famouswaffles on Oct 29, 2024 | past
21.Solving Global Lyapunov functions: open problem in mathematics with transformers (arxiv.org)
3 points by famouswaffles on Oct 27, 2024 | past
22.ChatGPT Topped 3B Visits in September (similarweb.com)
2 points by famouswaffles on Oct 18, 2024 | past
23.Tx-LLM: Supporting therapeutic development with large language models (research.google)
2 points by famouswaffles on Oct 14, 2024 | past
24.Tx-LLM: Supporting therapeutic development with large language models (research.google)
2 points by famouswaffles on Oct 9, 2024 | past
25.Visual Autoregressive Modeling: Image Generation via Next-Resolution Prediction (arxiv.org)
1 point by famouswaffles on Oct 5, 2024 | past | 1 comment
26.xAI's Colossus (100k H100 cluster) has begun training (twitter.com/elonmusk)
7 points by famouswaffles on Sept 7, 2024 | past | 1 comment
27.Recite, Reconstruct, Recollect: Memorization in LMs as a Multifaceted Phenomenon (arxiv.org)
1 point by famouswaffles on June 28, 2024 | past
28.GPT-4o's image generation capabilities (twitter.com/gdb)
1 point by famouswaffles on May 15, 2024 | past
29.LLMs for few-shot low level robot control by representing trajectories as tokens (twitter.com/ed__johns)
1 point by famouswaffles on April 18, 2024 | past
30.Keypoint Action Tokens Enable In-Context Imitation Learning in Robotics (robot-learning.uk)
1 point by famouswaffles on April 17, 2024 | past

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: