Back to Glossary
Data Infrastructure

Latency

The time delay between a request and response in AI systems.

Definition

Latency refers to the time delay between submitting a request to an AI system and receiving a response. In document intelligence, latency affects user experience for interactive queries and throughput for batch processing. Factors affecting latency include model size, hardware, network conditions, and retrieval complexity. Production systems often trade off between latency, accuracy, and cost.

See Latency in action

Understanding the terminology is the first step. See how Conductor applies these concepts to solve real document intelligence challenges.

Request a demo