The Human Team¶

At the core of 4Geeks AI Agents is a fundamental principle: AI works best when guided by real humans. Every AI Agent deployed on the platform is backed by a dedicated team of real human geeks at 4Geeks who handle setup, monitoring, quality assurance, and continuous improvement. This document explains who these people are, how they work, and why the human-in-the-loop approach is our key differentiator.

Why Human-in-the-Loop?¶

Fully autonomous AI agents are unreliable for real-world business operations. They hallucinate, misinterpret context, fail on edge cases, and cannot adapt to novel situations without guidance. The human-in-the-loop (HITL) model solves these problems by keeping real humans in the driver’s seat at every stage of the agent lifecycle.

The Problem with Fully Autonomous AI¶

Challenge	Description	Impact
Hallucinations	AI generates plausible but incorrect information	Customers receive wrong answers, damaging trust
Edge Cases	Unusual situations the AI was not trained on	Agent freezes, gives irrelevant responses, or makes errors
Context Drift	Conversation context degrades over long interactions	Agent loses track of the conversation and repeats itself
Tone Mismatch	AI response tone does not match brand expectations	Customer experience feels off-brand or inappropriate
Integration Failures	External tools change APIs or return unexpected data	Agent workflows break silently
Regulatory Changes	New compliance requirements emerge	Agent may violate updated regulations

The 4Geeks Solution¶

Our human team addresses every one of these challenges through proactive monitoring, real-time intervention, and continuous learning:

graph TB
    subgraph Deploy["Deployment Phase"]
        DM[Discovery Meeting]
        SC[Solution Configuration]
        INT[Manual Integration]
        TEST[Testing & Calibration]
        GO[Go-Live]
    end

    subgraph Monitor["Monitoring Phase"]
        REAL[Real-Time Monitoring]
        QA[Quality Assurance Reviews]
        PERF[Performance Analytics]
        ALERT[Alert Management]
    end

    subgraph Improve["Improvement Phase"]
        ESC[Escalation Handling]
        LEARN[Learning from Resolutions]
        OPT[Prompt Optimization]
        UPDATE[Knowledge Base Updates]
    end

    DM --> SC --> INT --> TEST --> GO
    GO --> REAL
    REAL --> QA
    REAL --> PERF
    REAL --> ALERT
    QA --> ESC
    PERF --> OPT
    ALERT --> ESC
    ESC --> LEARN
    LEARN --> OPT
    OPT --> UPDATE
    UPDATE --> REAL

Team Structure¶

The 4Geeks AI Agents team is organized into specialized roles, each responsible for a specific aspect of the agent lifecycle.

Account Managers¶

Account Managers are your primary point of contact throughout the entire engagement.

Responsibilities:

Conduct discovery meetings to understand your business needs, goals, and success metrics
Translate business requirements into technical specifications for agent configuration
Coordinate between your team and the technical implementation team
Provide regular performance reports and strategic recommendations
Manage billing, plan adjustments, and contract renewals
Serve as escalation point for any issues or concerns

When you interact with them:

Initial discovery and onboarding
Regular check-ins (weekly or bi-weekly depending on plan)
Performance reviews and strategy sessions
Any time you need to make changes to your agents or plan

Typical background:

Experience in SaaS account management or customer success
Understanding of AI and automation concepts
Industry knowledge across multiple verticals (healthcare, real estate, e-commerce, finance, etc.)
Fluent in English and Spanish; additional languages supported

Integration Engineers¶

Integration Engineers are the technical specialists who connect AI Agents to your existing software stack.

Responsibilities:

Design and implement custom integrations with your CRM, calendar, communication tools, and proprietary systems
Write custom code to ensure secure and reliable data flow between systems
Configure API connections, webhooks, and data synchronization pipelines
Test integrations end-to-end to verify data accuracy and reliability
Troubleshoot integration issues and implement fixes
Document all integration configurations for future reference

Technologies they work with:

Category	Tools & Platforms
CRMs	HubSpot, Salesforce, Pipedrive, Close CRM
Communication	Slack, Microsoft Teams, Outlook, Gmail
Calendars	Google Calendar, Microsoft Outlook Calendar
Productivity	Trello, Asana, Monday.com, Google Sheets, Airtable
Healthcare	Huli Health and other healthcare CRMs
Custom Systems	REST APIs, GraphQL endpoints, SOAP services, Webhooks
Telephony	VAPI, Twilio, custom SIP configurations
Messaging	WhatsApp Business API (Meta Cloud API)

Typical background:

Senior software engineering experience (5+ years)
Expertise in API integration and middleware development
Proficiency in Python, JavaScript/TypeScript, and SQL
Experience with Supabase, PostgreSQL, and cloud platforms
Background in building production-grade integrations

Prompt Engineers¶

Prompt Engineers craft and optimize the system prompts that define how each AI Agent behaves.

Responsibilities:

Design initial system prompts based on business requirements and brand guidelines
Test prompts extensively across hundreds of conversation scenarios
Optimize prompts for accuracy, tone, consistency, and cost-efficiency
Implement prompt versioning and A/B testing to measure improvements
Analyze conversation logs to identify prompt weaknesses
Update prompts based on escalation patterns and customer feedback
Create prompt templates for new agent types and use cases

Prompt engineering process:

graph LR
    A[Gather Requirements] --> B[Draft Initial Prompt]
    B --> C[Internal Testing<br/>100+ scenarios]
    C --> D{Quality Check}
    D -->|Pass| E[Client Review]
    D -->|Fail| B
    E --> F{Client Approval}
    F -->|Approved| G[Deploy to Production]
    F -->|Changes| B
    G --> H[Monitor Performance]
    H --> I[Identify Improvements]
    I --> B

What goes into a system prompt:

Section	Description	Example
Identity	Who the agent is and what company it represents	“You are an AI assistant for Acme Corp, a leading provider of…”
Objective	The agent’s primary goal	“Your goal is to qualify leads and schedule demos…”
Tone & Style	How the agent should communicate	“Professional but friendly. Use short sentences. Avoid jargon.”
Knowledge	Key facts the agent should always know	“We have 3 pricing tiers: Basic (\(10), Pro (\)50), Enterprise (custom)”
Guardrails	What the agent must never do	“Never offer discounts above 10%. Never share competitor pricing.”
Escalation Rules	When to transfer to a human	“If the customer asks about legal matters, transfer to a human.”
Tool Instructions	How to use available tools	“Use the check_availability tool before suggesting appointment times.”
Conversation Flow	Expected conversation structure	“Start with a greeting, ask qualifying questions, then suggest next steps.”

Typical background:

Experience in NLP, computational linguistics, or technical writing
Deep understanding of LLM behavior and prompt engineering techniques
Ability to analyze conversation data and identify patterns
Experience with multiple LLM providers (OpenAI, Anthropic, Google)
Strong attention to detail and commitment to quality

Quality Assurance Specialists¶

QA Specialists continuously monitor agent performance and ensure quality standards are met.

Responsibilities:

Review a random sample of agent conversations daily for quality, accuracy, and brand consistency
Score conversations against a quality rubric (accuracy, helpfulness, tone, completeness)
Identify patterns in customer complaints or negative feedback
Flag conversations that require human follow-up
Create quality reports with trends and recommendations
Validate that agents comply with regulatory requirements (GDPR, EU AI Act)
Test agent responses after prompt or knowledge base updates

Quality scoring rubric:

Criterion	Weight	Description
Accuracy	30%	Is the information provided correct and up-to-date?
Helpfulness	25%	Did the agent actually solve the customer’s problem?
Tone & Brand	20%	Does the response match the expected brand voice and style?
Completeness	15%	Did the agent address all parts of the customer’s question?
Efficiency	10%	Was the resolution achieved in a reasonable number of turns?

Quality targets:

Metric	Target
Overall Quality Score	≥ 90%
Accuracy Rate	≥ 95%
First-Contact Resolution	≥ 80%
Escalation Rate	≤ 10%
Customer Satisfaction	≥ 4.5/5.0

Typical background:

Experience in QA, customer service management, or operations
Strong analytical skills and attention to detail
Understanding of AI capabilities and limitations
Experience with customer service metrics and KPIs
Ability to provide constructive feedback for improvement

Operations & Monitoring Team¶

The Operations team ensures the platform infrastructure runs smoothly 24/7.

Responsibilities:

Monitor all platform services for availability, latency, and error rates
Manage LLM provider relationships and handle provider outages
Configure and maintain the Private AI Gateway routing and fallback logic
Manage credit consumption monitoring and alerting
Handle incident response and post-incident reviews
Maintain backup and disaster recovery procedures
Optimize platform performance and cost-efficiency

Monitoring coverage:

Component	Monitored Metrics	Alert Threshold
API Gateway	Response time, error rate, throughput	> 2s latency or > 1% error rate
LLM Providers	Availability, latency per provider	> 5s latency or provider outage
Edge Functions	Invocation count, error rate, duration	> 5% error rate
Database	Connection count, query latency, storage	> 80% connection pool usage
Voice (VAPI)	Call success rate, audio quality	< 95% call success rate
WhatsApp	Webhook delivery, message status	> 5% delivery failure rate
Credit System	Consumption rate, balance thresholds	Balance below 10% of plan

Typical background:

DevOps / SRE experience
Expertise in cloud infrastructure (AWS, GCP, Supabase)
Experience with monitoring tools and incident management
Understanding of LLM provider APIs and rate limiting
On-call rotation experience

Escalation Workflow¶

When an AI Agent encounters a situation it cannot handle confidently, a structured escalation process ensures the customer receives proper assistance while the agent learns from the experience.

Escalation Triggers¶

An escalation is triggered when any of the following conditions are met:

Trigger	Description	Example
Low Confidence	Agent’s confidence score falls below the configured threshold (default: 0.6)	Customer asks a question not covered in the knowledge base
Explicit Request	Customer explicitly asks to speak with a human	“I want to talk to a real person”
Sensitive Topic	Conversation involves topics the agent is configured to escalate	Legal questions, refund requests above a threshold, complaints
Repeated Failures	Agent fails to resolve the issue after multiple attempts	Customer asks the same question 3 times with different phrasing
Emotional Detection	Customer sentiment is detected as highly negative or frustrated	Customer uses angry language or expresses dissatisfaction
Compliance Requirement	Interaction requires human verification for regulatory compliance	Identity verification, consent for data processing

Escalation Process¶

sequenceDiagram
    participant AI as AI Agent
    participant HITL as HITL Orchestrator
    participant Q as Escalation Queue
    participant H as Human Specialist
    participant C as Customer
    participant KB as Knowledge Base

    AI->>AI: Detect escalation trigger
    AI->>C: "Let me connect you with a specialist who can help..."
    AI->>HITL: Submit escalation with full context
    HITL->>HITL: Assign priority (based on trigger type, customer tier, wait time)
    HITL->>Q: Add to escalation queue
    HITL->>C: Confirm transfer and set expectations
    Q->>H: Route to available specialist
    H->>H: Review conversation history + AI context + suggested action
    H->>C: Continue conversation with customer
    H->>H: Resolve the issue
    H->>KB: Log resolution as training example
    H->>HITL: Mark escalation as resolved
    HITL->>AI: Feed resolution back to agent for learning

Escalation Priority Levels¶

Priority	Response Time	Trigger Examples
Critical	< 5 minutes	Customer threatening legal action, system-wide outage affecting customers
High	< 15 minutes	Sensitive compliance issue, high-value customer escalation, emotional distress detected
Medium	< 1 hour	Complex question outside agent’s knowledge, repeated failures, refund requests
Low	< 4 hours	General inquiry the agent could not answer, non-urgent follow-up needed

Learning from Escalations¶

Every resolved escalation becomes a learning opportunity:

Resolution Capture — The human specialist’s response is recorded along with the full conversation context
Pattern Analysis — The QA team reviews escalations weekly to identify recurring patterns
Prompt Updates — If a pattern reveals a gap in the agent’s instructions, the prompt is updated
Knowledge Base Updates — If the escalation was caused by missing information, the knowledge base is updated with new documents
Guardrail Refinement — If the escalation was caused by the agent overstepping its boundaries, guardrails are tightened
Benchmark Addition — The escalation scenario is added to the testing benchmark to ensure the agent handles it correctly after improvement

Onboarding Process¶

When you become a 4Geeks AI Agents customer, here is the detailed onboarding journey:

Week 1: Discovery & Planning¶

Day	Activity	Participants	Deliverable
Day 1	Discovery meeting	Account Manager + Your Team	Requirements document
Day 2	Technical assessment	Integration Engineer	Integration architecture plan
Day 3	Agent design workshop	Prompt Engineer + Your Team	Agent specification (goals, tone, guardrails)
Day 4-5	Internal setup	4Geeks Team	Account configuration, environment setup

Week 2: Build & Integrate¶

Day	Activity	Participants	Deliverable
Day 1-2	Integration development	Integration Engineer	API connections, webhooks configured
Day 3	Prompt development	Prompt Engineer	Initial system prompt draft
Day 4	Knowledge base setup	Integration Engineer + Your Team	Documents uploaded and processed
Day 5	Internal QA	QA Specialist	Quality report with initial scores

Week 3: Testing & Calibration¶

Day	Activity	Participants	Deliverable
Day 1-2	Client testing	Your Team + Prompt Engineer	Feedback on agent behavior
Day 3	Prompt refinement	Prompt Engineer	Updated prompt incorporating feedback
Day 4	End-to-end testing	Full 4Geeks Team	Integration test report
Day 5	Go-live preparation	Account Manager + Operations	Deployment checklist completed

Week 4: Go-Live & Monitoring¶

Day	Activity	Participants	Deliverable
Day 1	Soft launch (limited traffic)	Operations Team	Initial performance data
Day 2-3	Monitoring & adjustments	Full Team	Real-time optimization
Day 4-5	Full launch	Full Team	Production deployment
Ongoing	Weekly check-ins	Account Manager	Performance reports

Support & Communication¶

Support Channels¶

Channel	Availability	Response Time	Best For
Email (support@4geeks.io)	Mon-Fri, 9 AM - 6 PM	< 24 hours	Non-urgent questions, feature requests, documentation
In-Console Chat	Mon-Fri, 9 AM - 6 PM	< 4 hours	Quick questions, configuration help, troubleshooting
Discord	Community-driven	Variable	Peer support, feature discussions, announcements
Account Manager	By appointment	Same day	Strategic discussions, plan changes, escalations
Emergency Hotline	24/7 (Enterprise)	< 30 minutes	Production outages, critical issues (Enterprise plans only)

Regular Reporting¶

Depending on your plan, you receive regular performance reports:

Plan	Report Frequency	Report Contents
Starter	Monthly	Credit usage, interaction volume, escalation rate, quality score
Growth	Bi-weekly	All Starter metrics + conversation samples, prompt recommendations, integration health
Scale	Weekly	All Growth metrics + trend analysis, optimization recommendations, strategic review
Enterprise	Real-time dashboard	All Scale metrics + custom KPIs, dedicated Slack channel, quarterly business review

What’s Included in a Performance Report¶

Section	Metrics
Volume	Total interactions, messages per channel, calls per campaign, peak usage times
Quality	Quality score, accuracy rate, first-contact resolution, customer satisfaction
Efficiency	Average response time, average turns to resolution, credit consumption per interaction
Escalations	Escalation rate, reasons for escalation, average resolution time, learning actions taken
Integration Health	API success rates, webhook delivery rates, data sync status
Recommendations	Suggested prompt changes, knowledge base additions, workflow optimizations

Continuous Improvement Cycle¶

The human-in-the-loop model creates a virtuous cycle of continuous improvement:

graph LR
    A[Deploy Agent] --> B[Monitor Interactions]
    B --> C[Identify Issues]
    C --> D{Issue Type?}
    D -->|Prompt Gap| E[Update Prompt]
    D -->|Knowledge Gap| F[Add to Knowledge Base]
    D -->|Integration Bug| G[Fix Integration]
    D -->|Workflow Gap| H[Add Workflow Step]
    E --> I[Test Changes]
    F --> I
    G --> I
    H --> I
    I --> J{Quality Improved?}
    J -->|Yes| B
    J -->|No| C

Improvement Metrics Over Time¶

As the continuous improvement cycle runs, you should see measurable improvements:

Metric	Week 1	Month 1	Month 3	Month 6
Quality Score	80-85%	88-92%	92-95%	95%+
Escalation Rate	15-20%	10-12%	6-8%	< 5%
First-Contact Resolution	65-70%	75-80%	82-88%	90%+
Average Response Time	2.5s	2.0s	1.8s	1.5s
Customer Satisfaction	3.8/5	4.⅖	4.5/5	4.7/5

Note

These metrics are representative benchmarks based on typical deployments. Actual results vary based on industry, use case complexity, and the quality of initial training data provided.

Team Availability¶

Standard Hours¶

Region	Hours (Local)	Timezone
Americas	9:00 AM - 6:00 PM	EST / CST
Europe	9:00 AM - 6:00 PM	GMT / CET
Latin America	9:00 AM - 6:00 PM	Local timezone

Extended Coverage¶

Growth Plan: Extended hours until 9:00 PM in your timezone
Scale Plan: Extended hours until 11:00 PM in your timezone
Enterprise Plan: 24/7 coverage with dedicated on-call team

Holidays & Time Off¶

The platform (AI Agents) runs 24/7/365 regardless of team availability
Human escalation response times may be longer during holidays
Enterprise customers have guaranteed holiday coverage as part of their SLA
Scheduled maintenance windows are communicated 48 hours in advance

What’s Next¶

Cloud & LLM Architecture — Technical infrastructure powering the platform
LLM Models — Detailed breakdown of the language models used
4Geeks AI Agents Overview — Return to the main documentation
FAQs — Common questions about the platform

Aún con dudas? Pregunta en Discord o explore tutoriales