Introduction to Kura

Transform chat data into actionable insights

Your AI assistant handles thousands of conversations daily. Kura helps you understand what users actually need by automatically clustering conversations into meaningful patterns.

Kura is inspired by Anthropic’s CLIO research and designed to work at scale—from 100 conversations to millions.

Why Kura?

Manually reviewing conversations doesn’t scale. Traditional analytics miss semantic meaning. Kura bridges this gap by using machine learning to group similar conversations, revealing:

Pain points affecting your users
Feature requests hidden in unstructured data
Failure patterns before users complain
Success signals to amplify

Real-world impact

E-commerce support bot

Analyzed 50,000 weekly conversations. Discovered 35% of shipping queries clustered into 3 fixable issues. Reduced support volume by 40%.

Developer docs assistant

Found 2,000+ conversations about 5 consistently confusing APIs. Targeted improvements reduced those queries by 60%.

SaaS onboarding bot

Revealed 3 missing integration requests from clustering. Built them, increased trial conversion by 18%.

Product analytics

Identified feature requests repeated by hundreds of users in different ways. Informed roadmap prioritization.

How it works

Kura processes your conversation data through a multi-stage pipeline:

Summarize conversations

Each conversation is condensed into a concise task description using LLMs, with optional disk caching for efficiency.

Generate embeddings

Summaries are converted into vector representations that capture semantic meaning.

Cluster by similarity

Similar conversations are grouped together using K-means or other clustering algorithms.

Build hierarchy

Clusters are organized into a hierarchical structure for easy navigation and analysis.

Key features

Automatic intent discovery

Find what users actually want, not just what they say

Semantic clustering

Group by meaning, not keywords

Privacy-first design

Analyze patterns without exposing individual conversations

Multiple data sources

Load from HuggingFace datasets, Claude exports, or custom formats

Flexible checkpoints

Save progress in JSONL, Parquet, or HuggingFace dataset formats

Rich visualization

Explore clusters in terminal or web UI

When to use Kura

Kura excels when you have 100+ conversations and need to understand patterns rather than individual interactions.

Perfect for:

Product teams discovering feature requests
Customer success teams identifying support deflection opportunities
AI/ML teams evaluating model performance beyond metrics
Analytics teams understanding user segments by behavior

Not ideal for:

Real-time analysis (Kura is designed for batch processing)
Fewer than 100 conversations (manual review may be faster)
Simple keyword search (use traditional search tools)
Individual conversation sentiment analysis (Kura focuses on patterns)

Get started

Installation

Install Kura with pip, uv, or conda

Quickstart tutorial

Process your first conversations in 5 minutes

Core concepts

Learn about the analysis pipeline

API reference

Explore the complete API documentation

From zero to insights in 5 minutes

Here’s a complete example that loads conversations, processes them, and visualizes the results:

import asyncio
from kura.types import Conversation
from kura.summarisation import SummaryModel, summarise_conversations
from kura.cluster import ClusterDescriptionModel, generate_base_clusters_from_conversation_summaries
from kura.meta_cluster import MetaClusterModel, reduce_clusters_from_base_clusters
from kura.dimensionality import HDBUMAP, reduce_dimensionality_from_clusters
from kura.visualization import visualise_pipeline_results
from kura.checkpoints import JSONLCheckpointManager
from rich.console import Console

async def main():
    # Load conversations from HuggingFace
    conversations = Conversation.from_hf_dataset(
        "ivanleomk/synthetic-gemini-conversations",
        split="train"
    )
    
    # Configure pipeline
    console = Console()
    checkpoint_manager = JSONLCheckpointManager("./checkpoints", enabled=True)
    
    # Process conversations
    summaries = await summarise_conversations(
        conversations,
        model=SummaryModel(console=console),
        checkpoint_manager=checkpoint_manager
    )
    
    clusters = await generate_base_clusters_from_conversation_summaries(
        summaries,
        model=ClusterDescriptionModel(console=console),
        checkpoint_manager=checkpoint_manager
    )
    
    reduced_clusters = await reduce_clusters_from_base_clusters(
        clusters,
        model=MetaClusterModel(console=console),
        checkpoint_manager=checkpoint_manager
    )
    
    projected_clusters = await reduce_dimensionality_from_clusters(
        reduced_clusters,
        model=HDBUMAP(),
        checkpoint_manager=checkpoint_manager
    )
    
    # Visualize results
    visualise_pipeline_results(projected_clusters, style="rich")

if __name__ == "__main__":
    asyncio.run(main())

The example above includes checkpoint management, so if the process is interrupted, you can resume from where you left off.

Community and support

Kura is under active development by 567 Labs.

GitHub: 567-labs/kura
Issues: Report bugs or request features
License: MIT

Kura is in active development. APIs may change between versions. Check the release notes for breaking changes.

Get Started

Core Concepts

Guides

Examples

Introduction to Kura

Transform chat data into actionable insights

Why Kura?

Real-world impact

E-commerce support bot

Developer docs assistant

SaaS onboarding bot

Product analytics

How it works

Key features

Automatic intent discovery

Semantic clustering

Privacy-first design

Multiple data sources

Flexible checkpoints

Rich visualization

When to use Kura

Get started

Installation

Quickstart tutorial

Core concepts

API reference

From zero to insights in 5 minutes

Community and support

Get Started

Core Concepts

Guides

Examples

Documentation Index

​Transform chat data into actionable insights

​Why Kura?

​Real-world impact

E-commerce support bot

Developer docs assistant

SaaS onboarding bot

Product analytics

​How it works

​Key features

Automatic intent discovery

Semantic clustering

Privacy-first design

Multiple data sources

Flexible checkpoints

Rich visualization

​When to use Kura

​Get started

Installation

Quickstart tutorial

Core concepts

API reference

​From zero to insights in 5 minutes

​Community and support

Transform chat data into actionable insights

Why Kura?

Real-world impact

How it works

Key features

When to use Kura

Get started

From zero to insights in 5 minutes

Community and support