> ## Documentation Index
> Fetch the complete documentation index at: https://langwatch.ai/docs/llms.txt
> Use this file to discover all available pages before exploring further.

# Evaluation by Thread

> Evaluate LLM applications by thread in LangWatch to analyze conversation-level performance in agent testing setups.

With LangWatch, you can evaluate your LLM applications by thread. This approach is useful for analyzing the performance of your LLM applications across entire conversation threads, helping you identify which threads are performing well or poorly.

To set up evaluation by thread, toggle the thread-based mapping option when creating an evaluation.

<Frame>
  <img className="block" src="https://mintcdn.com/langwatch/-v0XgW5kRdRH4vBO/images/dataset-thread-evaluation.png?fit=max&auto=format&n=-v0XgW5kRdRH4vBO&q=85&s=45fd4c575331a3c6264e2fa8bd796ac2" alt="LangWatch Evaluation by Thread" width="864" height="1420" data-path="images/dataset-thread-evaluation.png" />
</Frame>

This enables thread-based evaluation where each time a trace is evaluated, the full thread context is retrieved and passed to the evaluation function. This approach builds upon the complete conversation thread rather than individual traces.

By default, we include the trace INPUT and OUTPUT fields in the evaluation. You can add additional fields to the evaluation by including them in your dataset.
