Archive - Rohan's Bytes

🔥 GPT-4.5 arrives: OpenAI’s largest LLM

OpenAI drops GPT-4.5, Microsoft unveils Phi-4 multimodal, AllenAI’s olmOCR slashes OCR costs, and DeepSeek AI cracks the code to fix pipeline slowdowns.

2 hrs ago

🚨 Alibaba's Wan2.1: Text to Video in 4 Minutes, 8.19GB VRAM on RTX 4090

Alibaba’s Wan2.1 generates videos in 4 minutes, OpenAI publishes its system card, and AI players shake up coding, automation, audiobook royalties, and…

Feb 26

🥉 Claude 3.7 Sonnet debuts with “extended thinking”

Claude 3.7 Sonnet lands with “extended thinking,” Google rolls out free Gemini Code Assist worldwide, and Google AI Studio introduces conversation…

Feb 25

DeepSeek Open Sources FlashMLA, reduces memory usage by up to 93.3% and improves throughput up to 5.76x

DeepSeek's FlashMLA unlocks extreme speed on Hopper GPUs, while Uncensored.ai brings unrestricted ansewrs from LLMs and world’s smallest video language…

Feb 24

Benchmarks for LLMs: Capabilities, Methods, and Limitations

Large language models (LLMs) are evaluated using standardized benchmarks to gauge their capabilities.

Feb 23 •

Big Tech’s Dominance in AI: Why Huge Moats Make It a High-Capital Game

When 100,000 GPUs set the entry fee, only the cash-rich dare dream of AI supremacy.

Feb 23 •

🥉 Unsloth releases new GRPO algorithms that enable 10x longer context lengths & 90% less VRAM

Unsloth's GRPO slashes VRAM use, SigLIP 2 enhances vision-language, OpenAI’s Operator expands, Test-time-scaling boosts small models, while DeepSeek and…

Feb 21

🥉 Google introduces AI Co-Scientist: Scaling test-time compute for advanced scientific reasoning

Google's AI Co-Scientist scales test-time compute, OpenAI finds LLMs don't find bugs, GitHub's GPT-4o Copilot boosts VS Code, Figure AI debuts Helix…

Feb 20

Elon Musk unveils Grok 3 and 'Deep Search' tool

Total Read time: 5 minutes 20 seconds

Feb 18

"Speak Easy: Eliciting Harmful Jailbreaks from LLMs with Simple Interactions"

Below podcast on this paper is generated with Google's Illuminate.

Feb 16 •

"SPARC: Subspace-Aware Prompt Adaptation for Robust Continual Learning in LLMs"

Below podcast on this paper is generated with Google's Illuminate.

Feb 16 •

"Show-o Turbo: Towards Accelerated Unified Multimodal Understanding and Generation"

Below podcast on this paper is generated with Google's Illuminate.

Feb 16 •

#nojs-banner { position: fixed; bottom: 0; left: 0; padding: 16px 16px 16px 32px; width: 100%; box-sizing: border-box; background: red; color: white; font-family: -apple-system, "Segoe UI", Roboto, Helvetica, Arial, sans-serif, "Apple Color Emoji", "Segoe UI Emoji", "Segoe UI Symbol"; font-size: 13px; line-height: 13px; } #nojs-banner a { color: inherit; text-decoration: underline; } This site requires JavaScript to run correctly. Please turn on JavaScript or unblock scripts