Sort: Relevance · Newest · 10 hits · 27 of 30 left today
#1

Retrieval-Augmented Generation | Definition and Overview | Product Talk

https://www.producttalk.org/glossary-ai-retrieval-augmented-generation/

What is retrieval-augmented generation? Retrieval-augmented generation (RAG) is a technique for enhancing LLM responses by retrieving relevant information from external data sources and adding it to the user's prompt before the LLM generates its response. Instead of relying solel…

Read at source

#2

Retrieval-Augmented Generation (RAG) Security Cheat Sheet¶ (part 1/3)

https://cheatsheetseries.owasp.org/cheatsheets/RAG_Security_Cheat_Sheet.html

Retrieval-Augmented Generation (RAG) Security Cheat Sheet¶ Introduction¶ Retrieval Augmented Generation (RAG) is now standard architecture for enterprise AI applications. By grounding language model responses in retrieved documents, RAG reduces hallucination and enables domain-sp…

Read at source

#3

README

anthropics/anthropic-cookbook · capabilities/contextual-embeddings/README.md

# Retrieval Augmented Generation with Contextual Embeddings Learn how to improve RAG performance using contextual embeddings to add relevant context to each chunk before embedding. ## Contents - `guide.ipynb`: Main tutorial notebook - `data/`: Data files for examples and testi…

Read at source

#4

Situate Context

567-labs/instructor · docs/blog/posts/situate-context.md

--- authors: - jxnl categories: - Anthropic - LLM Techniques - Python comments: true date: 2024-09-26 description: Learn to implement Anthropic's Contextual Retrieval with async processing to enhance RAG systems and preserve crucial context efficiently. draft: false t…

Read at source

#5

Retrieval Augmented Generation

vllm-project/vllm · docs/deployment/frameworks/retrieval_augmented_generation.md

# Retrieval-Augmented Generation [Retrieval-augmented generation (RAG)](https://en.wikipedia.org/wiki/Retrieval-augmented_generation) is a technique that enables generative artificial intelligence (Gen AI) models to retrieve and incorporate new information. It modifies interacti…

Read at source

#6

Rag

huggingface/transformers · docs/source/en/model_doc/rag.md

<!--Copyright 2020 The HuggingFace Team. All rights reserved. Licensed under the Apache License, Version 2.0 (the "License"); you may not use this file except in compliance with the License. You may obtain a copy of the License at http://www.apache.org/licenses/LICENSE-2.0 Unl…

Read at source

#7

Patterns for Building LLM-based Systems & Products (part 2/7)

https://eugeneyan.com/writing/llm-patterns/

If our task has no correct answer but we have references (e.g., machine translation, extractive summarization), we can rely on reference metrics based on matching (BLEU, ROUGE) or semantic similarity (BERTScore, MoverScore). However, these metrics may not work for more open-ended…

Read at source

#8

Configuration file reference (part 1/2)

https://docs.docker.com/ai/docker-agent/reference/config/

Configuration file reference This reference documents the YAML configuration file format for agents using Docker Agent. It covers file structure, agent parameters, model configuration, toolset setup, and RAG sources. For detailed documentation of each toolset's capabilities and s…

Read at source

#9

Configuration file reference (part 2/2)

https://docs.docker.com/ai/docker-agent/reference/config/

With semantic-embeddings, you can include AST metadata in the LLM prompts: - type: semantic-embeddings embedding_model: openai/text-embedding-3-small vector_dimensions: 1536 chat_model: openai/gpt-5-mini database: ./code.db ast_context: true # Include AST metadata in semantic pro…

Read at source

#10

Tuning (part 1/3)

apache/flink · docs/content/docs/dev/table/tuning.md

--- title: "Performance Tuning" weight: 112 type: docs aliases: - /dev/table/tuning/streaming_aggregation_optimization.html --- <!-- Licensed to the Apache Software Foundation (ASF) under one or more contributor license agreements. See the NOTICE file distributed with this wor…

Read at source