| | Understanding Multimodal LLMs: The Main Techniques and Latest Models (sebastianraschka.com) |
| 4 points by sbbq on Nov 3, 2024 | past |
|
| | Building a GPT-Style LLM Classifier from Scratch (sebastianraschka.com) |
| 2 points by mdp2021 on Sept 21, 2024 | past |
|
| | Building LLMs from the Ground Up: A 3-Hour Coding Workshop (sebastianraschka.com) |
| 970 points by mdp2021 on Aug 31, 2024 | past | 136 comments |
|
| | Show HN: New LLM Pre-Training and Post-Training Paradigms (sebastianraschka.com) |
| 2 points by rasbt on Aug 21, 2024 | past |
|
| | New LLM Pre-Training and Post-Training Paradigms: How Modern LLMs Are Trained (sebastianraschka.com) |
| 5 points by sbbq on Aug 17, 2024 | past |
|
| | Developing an LLM: Building, Training, Finetuning (sebastianraschka.com) |
| 1 point by Anon84 on June 13, 2024 | past |
|
| | Understanding the LLM Development Cycle: Building, Training, Finetuning (sebastianraschka.com) |
| 3 points by rasbt on June 8, 2024 | past |
|
| | The latest major open LLM releases: Mixtral, Llama 3, Phi-3, and OpenELM (sebastianraschka.com) |
| 5 points by rasbt on May 12, 2024 | past |
|
| | Tips for LLM Pretraining and Evaluating Reward Models (sebastianraschka.com) |
| 2 points by sbbq on April 2, 2024 | past |
|
| | Tips for LLM Pretraining and Evaluating Reward Models (sebastianraschka.com) |
| 2 points by tosh on April 1, 2024 | past |
|
| | Tips for LLM Pretraining and Evaluating Reward Models (sebastianraschka.com) |
| 1 point by Anon84 on March 31, 2024 | past |
|
| | Tips for LLM Pretraining and Evaluating Reward Models (sebastianraschka.com) |
| 2 points by rasbt on March 31, 2024 | past |
|
| | AI Research in Feb 2024 – LoRA Successor, "Small" LLMs, Transparent LLM Research (sebastianraschka.com) |
| 3 points by rasbt on March 3, 2024 | past |
|
| | Implementing Weight-Decomposed Low-Rank Adaptation (DoRA) from Scratch (sebastianraschka.com) |
| 96 points by rasbt on Feb 18, 2024 | past | 10 comments |
|
| | AI Research Papers in Jan 2024: Model Merging, Mixtures of Experts, Smaller LLMs (sebastianraschka.com) |
| 20 points by rasbt on Feb 3, 2024 | past |
|
| | Naive Bayes and Text Classification I – Introduction and Theory (2014) (sebastianraschka.com) |
| 2 points by vikrum on Jan 22, 2024 | past |
|
| | Coding Self-Attention, Multi-Head Attention, Cross-Attention, Causal-Attention (sebastianraschka.com) |
| 142 points by rasbt on Jan 14, 2024 | past | 11 comments |
|
| | Ten Noteworthy AI Research Papers of 2023 (sebastianraschka.com) |
| 128 points by danboarder on Jan 6, 2024 | past | 19 comments |
|
| | Noteworthy AI Research Papers of 2023 (sebastianraschka.com) |
| 3 points by rasbt on Jan 1, 2024 | past |
|
| | Ten Noteworthy AI Research Papers of 2023 (sebastianraschka.com) |
| 9 points by lucasus on Dec 30, 2023 | past |
|
| | Research Papers in November 2023 (sebastianraschka.com) |
| 1 point by Anon84 on Dec 10, 2023 | past |
|
| | AI Research Papers in November 2023: hallucinations and reasoning capabilities (sebastianraschka.com) |
| 5 points by rasbt on Dec 9, 2023 | past |
|
| | Practical Tips for Finetuning LLMs Using LoRA (Low-Rank Adaptation) (sebastianraschka.com) |
| 342 points by rasbt on Nov 19, 2023 | past | 27 comments |
|
| | Why would a famous former university ML professor make his posts paywalled? (sebastianraschka.com) |
| 7 points by behnamoh on Nov 6, 2023 | past | 1 comment |
|
| | AI and Open Source in 2023 (sebastianraschka.com) |
| 123 points by belter on Nov 4, 2023 | past | 67 comments |
|
| | AI Research Papers (October 2023) (sebastianraschka.com) |
| 5 points by rasbt on Nov 4, 2023 | past |
|
| | AI and Open Source in 2023: A Review of the Year's Highs and Lows (sebastianraschka.com) |
| 2 points by rasbt on Oct 23, 2023 | past |
|
| | AI chips, acquisitions, new "small" open-source LLMs, and new LoRA techniques (sebastianraschka.com) |
| 5 points by rasbt on Oct 9, 2023 | past |
|
| | AI news editorial from custom AI chips to new "small" LLMs like phi and Mistral (sebastianraschka.com) |
| 1 point by rasbt on Oct 8, 2023 | past |
|
| | AI research papers summaries and highlights (Aug to Sep) (sebastianraschka.com) |
| 3 points by rasbt on Sept 24, 2023 | past |
|
|
| More |