Call Transfer Patterns for AI Agents: Warm Transfer, Cold Transfer, and Conferencing
Master the three call transfer patterns for AI voice agents: cold transfer, warm transfer, and conferencing. Covers context passing, hold music, agent whisper, and seamless handoff implementation.
The Three Transfer Patterns
When an AI agent cannot fully resolve a caller's issue, it must transfer the call to a human. How that transfer happens dramatically affects customer experience. There are three patterns, each with distinct tradeoffs:
Cold Transfer — The AI connects the caller directly to the destination. The caller may hear ringing and must re-explain their issue. Fast but frustrating.
Warm Transfer — The AI first speaks to the human agent, passes context, then bridges the caller in. The caller does not repeat themselves. Slower but much better experience.
Conference Transfer — The AI, caller, and human agent are briefly all on the same call. The AI introduces the situation, then drops off. Best for complex handoffs.
Cold Transfer Implementation
Cold transfer is the simplest pattern. The AI terminates its leg of the call and connects the caller directly to the destination:
See AI Voice Agents Handle Real Calls
Book a free demo or calculate how much you can save with AI voice automation.
flowchart TD
START["Call Transfer Patterns for AI Agents: Warm Transf…"] --> A
A["The Three Transfer Patterns"]
A --> B
B["Cold Transfer Implementation"]
B --> C
C["Warm Transfer Implementation"]
C --> D
D["Conference Transfer Three-Way Introduct…"]
D --> E
E["Context Passing Best Practices"]
E --> F
F["FAQ"]
F --> DONE["Key Takeaways"]
style START fill:#4f46e5,stroke:#4338ca,color:#fff
style DONE fill:#059669,stroke:#047857,color:#fff
from twilio.twiml.voice_response import VoiceResponse, Dial
from fastapi import FastAPI, Request
from fastapi.responses import Response
app = FastAPI()
@app.post("/cold-transfer")
async def cold_transfer(request: Request):
"""Transfer the caller directly to a human agent."""
form = await request.form()
call_sid = form.get("CallSid")
# Log the transfer context before disconnecting
await save_transfer_context(call_sid, {
"reason": "billing_dispute",
"caller_sentiment": "frustrated",
"summary": "Caller disputing charge of $49.99 from March 3",
})
response = VoiceResponse()
response.say(
"I am connecting you to a billing specialist now. "
"Please hold."
)
dial = Dial(
caller_id=form.get("From"), # Preserve caller ID
timeout=30,
action="/transfer-complete", # Called when dial ends
)
dial.number(
"+15559876543",
status_callback="/agent-answered",
status_callback_event="initiated ringing answered completed",
)
response.append(dial)
# Fallback if agent does not answer
response.say(
"I am sorry, no agent is available right now. "
"Let me take a message."
)
response.redirect("/take-message")
return Response(content=str(response), media_type="application/xml")
Warm Transfer Implementation
Warm transfer requires managing three call legs: the original call (on hold), a whisper call to the agent, and the final bridged call:
from twilio.rest import Client
import os
twilio_client = Client()
class WarmTransferManager:
"""Manages warm transfers with context passing."""
def __init__(self, twilio_client, webhook_base):
self.client = twilio_client
self.webhook_base = webhook_base
async def initiate_warm_transfer(
self, call_sid: str, agent_number: str, context: dict
):
"""Start the warm transfer process."""
# Step 1: Put the caller on hold with music
self.client.calls(call_sid).update(
twiml='<Response><Play loop="0">'
'https://api.twilio.com/cowbell.mp3'
'</Play></Response>',
)
# Step 2: Store context for the whisper
await self.store_context(call_sid, context)
# Step 3: Call the human agent with a whisper
whisper_call = self.client.calls.create(
to=agent_number,
from_=os.environ["TWILIO_NUMBER"],
url=(
f"{self.webhook_base}/agent-whisper"
f"?original_call={call_sid}"
),
status_callback=f"{self.webhook_base}/whisper-status",
)
return whisper_call.sid
async def store_context(self, call_sid: str, context: dict):
"""Store transfer context for the receiving agent."""
import json
# Use Redis for fast retrieval during the whisper
await self.redis.set(
f"transfer:{call_sid}",
json.dumps(context),
ex=300, # 5 minute TTL
)
@app.post("/agent-whisper")
async def agent_whisper(request: Request):
"""Play context to the human agent before bridging."""
form = await request.form()
original_call = form.get("original_call")
# Retrieve the transfer context
context = await get_transfer_context(original_call)
response = VoiceResponse()
# Whisper: only the agent hears this
whisper_text = (
f"Incoming transfer. Caller: {context['caller_name']}. "
f"Issue: {context['summary']}. "
f"Sentiment: {context['sentiment']}. "
f"Press 1 to accept, 2 to decline."
)
response.say(whisper_text, voice="Polly.Joanna")
gather = response.gather(
num_digits=1,
action=f"/agent-accept?original_call={original_call}",
timeout=10,
)
# Timeout fallback — decline
response.say("No response received. Transfer cancelled.")
response.hangup()
return Response(content=str(response), media_type="application/xml")
@app.post("/agent-accept")
async def agent_accept_transfer(request: Request):
"""Bridge the caller and agent after acceptance."""
form = await request.form()
digit = form.get("Digits")
original_call = form.get("original_call")
response = VoiceResponse()
if digit == "1":
# Agent accepted — bridge the calls via conference
conference_name = f"transfer-{original_call}"
# Connect the agent to the conference
dial = Dial()
dial.conference(conference_name, end_conference_on_exit=True)
response.append(dial)
# Move the original caller into the same conference
twilio_client.calls(original_call).update(
twiml=(
f'<Response><Dial><Conference>'
f'{conference_name}'
f'</Conference></Dial></Response>'
),
)
else:
response.say("Transfer declined.")
response.hangup()
# Return caller to AI agent
twilio_client.calls(original_call).update(
url=f"{os.environ['WEBHOOK_BASE']}/return-to-ai",
method="POST",
)
return Response(content=str(response), media_type="application/xml")
Conference Transfer (Three-Way Introduction)
The conference pattern keeps all three parties briefly on the same call:
class ConferenceTransferManager:
"""Three-way conference transfer with AI introduction."""
async def initiate_conference_transfer(
self, call_sid: str, agent_number: str, context: dict
):
"""Set up a three-way call for handoff."""
conference_name = f"handoff-{call_sid}"
# Move caller into a conference (from hold)
twilio_client.calls(call_sid).update(
twiml=(
f'<Response>'
f'<Say>I am bringing in a specialist now.</Say>'
f'<Dial><Conference>{conference_name}'
f'</Conference></Dial></Response>'
),
)
# Add the AI agent to the conference (for introduction)
ai_participant = twilio_client.conferences(
conference_name
).participants.create(
from_=os.environ["TWILIO_NUMBER"],
to="sip:ai-intro@yourdomain.com",
early_media=True,
)
# Add the human agent
human_participant = twilio_client.conferences(
conference_name
).participants.create(
from_=os.environ["TWILIO_NUMBER"],
to=agent_number,
early_media=True,
)
return conference_name
async def ai_introduction(self, conference_name, context):
"""AI speaks the introduction then leaves."""
intro_text = (
f"Hello everyone. I have {context['caller_name']} on "
f"the line who needs help with {context['summary']}. "
f"I will leave you to it."
)
# Speak the introduction via TTS
await self.speak_in_conference(conference_name, intro_text)
# Remove the AI from the conference
await asyncio.sleep(2) # Brief pause after speaking
await self.remove_ai_from_conference(conference_name)
Context Passing Best Practices
The value of a warm transfer is the context. Structure it well:
from dataclasses import dataclass
from typing import Optional
@dataclass
class TransferContext:
"""Structured context passed during call transfer."""
caller_name: str
caller_number: str
call_duration_seconds: int
issue_summary: str
sentiment: str # positive, neutral, frustrated, angry
intent: str
actions_taken: list[str]
information_collected: dict
previous_transfers: int
preferred_language: str = "en"
priority: str = "normal"
notes: Optional[str] = None
def to_whisper_script(self) -> str:
"""Generate a concise whisper message for the agent."""
actions = ", ".join(self.actions_taken) if self.actions_taken else "none yet"
return (
f"Caller: {self.caller_name}. "
f"Issue: {self.issue_summary}. "
f"Mood: {self.sentiment}. "
f"Already tried: {actions}. "
f"Priority: {self.priority}."
)
def to_screen_pop(self) -> dict:
"""Generate data for the agent's screen pop display."""
return {
"caller": self.caller_name,
"phone": self.caller_number,
"summary": self.issue_summary,
"sentiment_emoji": {
"positive": "green",
"neutral": "yellow",
"frustrated": "orange",
"angry": "red",
}.get(self.sentiment, "yellow"),
"history": self.actions_taken,
"collected_data": self.information_collected,
"transfer_count": self.previous_transfers,
}
FAQ
When should I use warm transfer versus cold transfer?
Use cold transfer for simple routing where context is not critical — e.g., transferring to a general queue. Use warm transfer when the caller has already explained their issue to the AI and repeating it would cause frustration — especially for complaints, complex issues, or VIP callers. The extra 10-15 seconds for a warm transfer pays for itself in customer satisfaction.
How do I handle the case where the human agent does not answer?
Implement a timeout with fallback logic. After 20-30 seconds of ringing, cancel the transfer and either return the caller to the AI agent, offer to take a message, or try an alternative agent. Always inform the caller what is happening: "Our specialist is not available right now. Would you like me to take a message, or would you prefer to try again later?"
How do I pass context to the agent's screen in addition to the whisper?
Use a parallel HTTP notification. When you initiate the warm transfer, simultaneously POST the TransferContext data to your contact center's API or the agent's desktop application. Most modern contact center platforms (Five9, Genesys, Talkdesk) have APIs for screen pops. The whisper provides audio context, and the screen pop provides visual context — both arrive before the caller is bridged in.
#CallTransfer #WarmTransfer #VoiceAI #Telephony #AgentHandoff #ContactCenter #AgenticAI #LearnAI #AIEngineering
Written by
CallSphere Team
Expert insights on AI voice agents and customer communication automation.
Try CallSphere AI Voice Agents
See how AI voice agents work for your industry. Live demo available -- no signup required.