List: Llm | Curated by Daniel Mannino

Feb 24, 2025
30 stories
Llm
In
Data Science Collective
by
Lak Lakshmanan
Optimizing to the EvalGenAI Design Pattern #5
4d ago
4d ago
In
TDS Archive
by
Ida Silfverskiöld
Advanced Prompt Engineering: Chain of Thought (CoT)Comparing different techniques for reasoning
Dec 23, 2024
13
Dec 23, 2024
13
In
Data Science Collective
by
Ida Silfverskiöld
Economics of LLMs: Evaluations vs PricingLooking at which model to use for which task
Feb 20
12
Feb 20
12
In
Towards AI
by
Alden Do Rosario
Dear IT Departments, Please Stop Trying To Build Your Own RAGIT departments convince themselves that building their own RAG-based chat is easy. It’s not. It’s a nightmare.
Nov 12, 2024
171
Nov 12, 2024
171
In
Data Science at Microsoft
by
Shimin Zhang
Evaluating LLM-based chatbots: A comprehensive guide to performance metricsBy Shimin Zhang, Yan Chen, Rui Hu, and Gorkem Ozer Yilmaz
Oct 31, 2024
Oct 31, 2024
In
TDS Archive
by
Murilo Gustineli
The Art of Tokenization: Breaking Down Text for AIDemystifying NLP: From Text to Embeddings
Sep 26, 2024
3
Sep 26, 2024
3
In
The Modern Scientist
by
Yule Wang, PhD
A Complete Guide to LLMs-based Autonomous Agents (Part I):— — Chain of Thought, Plan and Solve/Execute, Self-Ask, ReAct, Reflexion, Self-Consistency, Tree of Thoughts and Graph of Thoughts
Oct 9, 2023
8
Oct 9, 2023
8
In
Towards AI
by
Louis-François Bouchard
The Best RAG Stack to Date(exploring every component)
Sep 14, 2024
17
Sep 14, 2024
17
In
Cubed
by
Michael Wood
Eliminating Hallucinations Lesson 1a: Source Code for Named Entity Filtering (NEF)Here is the code needed to implement production-ready Named Entity Filtering (NEF) discussed in Hallucination Elimination Lesson One.
Sep 17, 2024
1
Sep 17, 2024
1
In
Cubed
by
Michael Wood
Eliminating Hallucinations Lesson 1: Named Entity Filtering (NEF)Named Entity Filtering
Sep 2, 2024
17
Sep 2, 2024
17
In
TDS Archive
by
Benjamin Marie
Quantize Llama 3 8B with Bitsandbytes to Preserve Its AccuracyLlama 2 vs. Llama 3 vs. Mistral 7B, quantized with GPTQ and Bitsandbytes
May 27, 2024
May 27, 2024
In
Snowflake Builders Blog: Data Engineers, App Developers, AI/ML, & Data Science
by
Chase Ginther
Responsible AI on Snowflake: Snowflake Cortex LLM’s + Snowpark Container Services + Snowflake…Enterprises are eager to adopt GenAI at scale but struggle with how to govern it in the enterprise. NVIDIA & Snowflake have solutions..
May 12, 2024
May 12, 2024
In
Snowflake Builders Blog: Data Engineers, App Developers, AI/ML, & Data Science
by
Murali Gandhirajan
Interact with LLM to search and uncover insights from medical transcript documentsRetrieval Augmented Generation (RAG) Using Snowflake Cortex LLM
May 6, 2024
3
May 6, 2024
3
In
TDS Archive
by
Jarek Grygolec, Ph.D.
Evaluate RAGs Rigorously or PerishUse RAGAs framework with hyperparameter optimisation to boost the quality of your RAG system.
Apr 26, 2024
Apr 26, 2024
Michael Gorkow
Custom Embedding Models from Hugging Face in SnowflakeUnlocking Multilingual RAG Capabilities in Snowflake with Custom Hugging Face Models
Apr 14, 2024
1
Apr 14, 2024
1
In
TDS Archive
by
Benjamin Marie
Marlin: Nearly Ideal Inference Speed for 4-bit Large Language ModelsUp to 4x faster than inference with fp16 parameters
Mar 30, 2024
2
Mar 30, 2024
2
In
Data Science at Microsoft
by
Jane Huang
Evaluating LLM systems: Metrics, challenges, and best practicesA detailed consideration of approaches to evaluation and selection
Mar 5, 2024
21
Mar 5, 2024
21
In
Snowflake Builders Blog: Data Engineers, App Developers, AI/ML, & Data Science
by
Carlos Carrero
Asking Questions to Your Own Documents with Snowflake CortexUnderstanding and analyzing unstructured data with the help of Large Language Models (LLMs) has become a very hot topic. Almost every…
Jan 11, 2024
3
Jan 11, 2024
3
Tom Christian
RAG Made Simple with Snowflake CortexEnd to end RAG within a single data platform? Cortex makes things simple.
Jan 4, 2024
3
Jan 4, 2024
3
In
TDS Archive
by
Ahmed Besbes
How To Build an LLM-Powered App To Chat with PapersWithCodeKeep up with the latest ML research
Feb 1, 2024
9
Feb 1, 2024
9