Saltar a contenido

🤖 Explicar con IA

The Human Team

At the core of 4Geeks AI Agents is a fundamental principle: AI works best when guided by real humans. Every AI Agent deployed on the platform is backed by a dedicated team of real human geeks at 4Geeks who handle setup, monitoring, quality assurance, and continuous improvement. This document explains who these people are, how they work, and why the human-in-the-loop approach is our key differentiator.

Why Human-in-the-Loop?

Fully autonomous AI agents are unreliable for real-world business operations. They hallucinate, misinterpret context, fail on edge cases, and cannot adapt to novel situations without guidance. The human-in-the-loop (HITL) model solves these problems by keeping real humans in the driver’s seat at every stage of the agent lifecycle.

The Problem with Fully Autonomous AI

Challenge Description Impact
Hallucinations AI generates plausible but incorrect information Customers receive wrong answers, damaging trust
Edge Cases Unusual situations the AI was not trained on Agent freezes, gives irrelevant responses, or makes errors
Context Drift Conversation context degrades over long interactions Agent loses track of the conversation and repeats itself
Tone Mismatch AI response tone does not match brand expectations Customer experience feels off-brand or inappropriate
Integration Failures External tools change APIs or return unexpected data Agent workflows break silently
Regulatory Changes New compliance requirements emerge Agent may violate updated regulations

The 4Geeks Solution

Our human team addresses every one of these challenges through proactive monitoring, real-time intervention, and continuous learning:

graph TB
    subgraph Deploy["Deployment Phase"]
        DM[Discovery Meeting]
        SC[Solution Configuration]
        INT[Manual Integration]
        TEST[Testing & Calibration]
        GO[Go-Live]
    end

    subgraph Monitor["Monitoring Phase"]
        REAL[Real-Time Monitoring]
        QA[Quality Assurance Reviews]
        PERF[Performance Analytics]
        ALERT[Alert Management]
    end

    subgraph Improve["Improvement Phase"]
        ESC[Escalation Handling]
        LEARN[Learning from Resolutions]
        OPT[Prompt Optimization]
        UPDATE[Knowledge Base Updates]
    end

    DM --> SC --> INT --> TEST --> GO
    GO --> REAL
    REAL --> QA
    REAL --> PERF
    REAL --> ALERT
    QA --> ESC
    PERF --> OPT
    ALERT --> ESC
    ESC --> LEARN
    LEARN --> OPT
    OPT --> UPDATE
    UPDATE --> REAL

Team Structure

The 4Geeks AI Agents team is organized into specialized roles, each responsible for a specific aspect of the agent lifecycle.

Account Managers

Account Managers are your primary point of contact throughout the entire engagement.

Responsibilities:

  • Conduct discovery meetings to understand your business needs, goals, and success metrics
  • Translate business requirements into technical specifications for agent configuration
  • Coordinate between your team and the technical implementation team
  • Provide regular performance reports and strategic recommendations
  • Manage billing, plan adjustments, and contract renewals
  • Serve as escalation point for any issues or concerns

When you interact with them:

  • Initial discovery and onboarding
  • Regular check-ins (weekly or bi-weekly depending on plan)
  • Performance reviews and strategy sessions
  • Any time you need to make changes to your agents or plan

Typical background:

  • Experience in SaaS account management or customer success
  • Understanding of AI and automation concepts
  • Industry knowledge across multiple verticals (healthcare, real estate, e-commerce, finance, etc.)
  • Fluent in English and Spanish; additional languages supported

Integration Engineers

Integration Engineers are the technical specialists who connect AI Agents to your existing software stack.

Responsibilities:

  • Design and implement custom integrations with your CRM, calendar, communication tools, and proprietary systems
  • Write custom code to ensure secure and reliable data flow between systems
  • Configure API connections, webhooks, and data synchronization pipelines
  • Test integrations end-to-end to verify data accuracy and reliability
  • Troubleshoot integration issues and implement fixes
  • Document all integration configurations for future reference

Technologies they work with:

Category Tools & Platforms
CRMs HubSpot, Salesforce, Pipedrive, Close CRM
Communication Slack, Microsoft Teams, Outlook, Gmail
Calendars Google Calendar, Microsoft Outlook Calendar
Productivity Trello, Asana, Monday.com, Google Sheets, Airtable
Healthcare Huli Health and other healthcare CRMs
Custom Systems REST APIs, GraphQL endpoints, SOAP services, Webhooks
Telephony VAPI, Twilio, custom SIP configurations
Messaging WhatsApp Business API (Meta Cloud API)

Typical background:

  • Senior software engineering experience (5+ years)
  • Expertise in API integration and middleware development
  • Proficiency in Python, JavaScript/TypeScript, and SQL
  • Experience with Supabase, PostgreSQL, and cloud platforms
  • Background in building production-grade integrations

Prompt Engineers

Prompt Engineers craft and optimize the system prompts that define how each AI Agent behaves.

Responsibilities:

  • Design initial system prompts based on business requirements and brand guidelines
  • Test prompts extensively across hundreds of conversation scenarios
  • Optimize prompts for accuracy, tone, consistency, and cost-efficiency
  • Implement prompt versioning and A/B testing to measure improvements
  • Analyze conversation logs to identify prompt weaknesses
  • Update prompts based on escalation patterns and customer feedback
  • Create prompt templates for new agent types and use cases

Prompt engineering process:

graph LR
    A[Gather Requirements] --> B[Draft Initial Prompt]
    B --> C[Internal Testing<br/>100+ scenarios]
    C --> D{Quality Check}
    D -->|Pass| E[Client Review]
    D -->|Fail| B
    E --> F{Client Approval}
    F -->|Approved| G[Deploy to Production]
    F -->|Changes| B
    G --> H[Monitor Performance]
    H --> I[Identify Improvements]
    I --> B

What goes into a system prompt:

Section Description Example
Identity Who the agent is and what company it represents “You are an AI assistant for Acme Corp, a leading provider of…”
Objective The agent’s primary goal “Your goal is to qualify leads and schedule demos…”
Tone & Style How the agent should communicate “Professional but friendly. Use short sentences. Avoid jargon.”
Knowledge Key facts the agent should always know “We have 3 pricing tiers: Basic (\(10), Pro (\)50), Enterprise (custom)”
Guardrails What the agent must never do “Never offer discounts above 10%. Never share competitor pricing.”
Escalation Rules When to transfer to a human “If the customer asks about legal matters, transfer to a human.”
Tool Instructions How to use available tools “Use the check_availability tool before suggesting appointment times.”
Conversation Flow Expected conversation structure “Start with a greeting, ask qualifying questions, then suggest next steps.”

Typical background:

  • Experience in NLP, computational linguistics, or technical writing
  • Deep understanding of LLM behavior and prompt engineering techniques
  • Ability to analyze conversation data and identify patterns
  • Experience with multiple LLM providers (OpenAI, Anthropic, Google)
  • Strong attention to detail and commitment to quality

Quality Assurance Specialists

QA Specialists continuously monitor agent performance and ensure quality standards are met.

Responsibilities:

  • Review a random sample of agent conversations daily for quality, accuracy, and brand consistency
  • Score conversations against a quality rubric (accuracy, helpfulness, tone, completeness)
  • Identify patterns in customer complaints or negative feedback
  • Flag conversations that require human follow-up
  • Create quality reports with trends and recommendations
  • Validate that agents comply with regulatory requirements (GDPR, EU AI Act)
  • Test agent responses after prompt or knowledge base updates

Quality scoring rubric:

Criterion Weight Description
Accuracy 30% Is the information provided correct and up-to-date?
Helpfulness 25% Did the agent actually solve the customer’s problem?
Tone & Brand 20% Does the response match the expected brand voice and style?
Completeness 15% Did the agent address all parts of the customer’s question?
Efficiency 10% Was the resolution achieved in a reasonable number of turns?

Quality targets:

Metric Target
Overall Quality Score ≥ 90%
Accuracy Rate ≥ 95%
First-Contact Resolution ≥ 80%
Escalation Rate ≤ 10%
Customer Satisfaction ≥ 4.5/5.0

Typical background:

  • Experience in QA, customer service management, or operations
  • Strong analytical skills and attention to detail
  • Understanding of AI capabilities and limitations
  • Experience with customer service metrics and KPIs
  • Ability to provide constructive feedback for improvement

Operations & Monitoring Team

The Operations team ensures the platform infrastructure runs smoothly 24/7.

Responsibilities:

  • Monitor all platform services for availability, latency, and error rates
  • Manage LLM provider relationships and handle provider outages
  • Configure and maintain the Private AI Gateway routing and fallback logic
  • Manage credit consumption monitoring and alerting
  • Handle incident response and post-incident reviews
  • Maintain backup and disaster recovery procedures
  • Optimize platform performance and cost-efficiency

Monitoring coverage:

Component Monitored Metrics Alert Threshold
API Gateway Response time, error rate, throughput > 2s latency or > 1% error rate
LLM Providers Availability, latency per provider > 5s latency or provider outage
Edge Functions Invocation count, error rate, duration > 5% error rate
Database Connection count, query latency, storage > 80% connection pool usage
Voice (VAPI) Call success rate, audio quality < 95% call success rate
WhatsApp Webhook delivery, message status > 5% delivery failure rate
Credit System Consumption rate, balance thresholds Balance below 10% of plan

Typical background:

  • DevOps / SRE experience
  • Expertise in cloud infrastructure (AWS, GCP, Supabase)
  • Experience with monitoring tools and incident management
  • Understanding of LLM provider APIs and rate limiting
  • On-call rotation experience

Escalation Workflow

When an AI Agent encounters a situation it cannot handle confidently, a structured escalation process ensures the customer receives proper assistance while the agent learns from the experience.

Escalation Triggers

An escalation is triggered when any of the following conditions are met:

Trigger Description Example
Low Confidence Agent’s confidence score falls below the configured threshold (default: 0.6) Customer asks a question not covered in the knowledge base
Explicit Request Customer explicitly asks to speak with a human “I want to talk to a real person”
Sensitive Topic Conversation involves topics the agent is configured to escalate Legal questions, refund requests above a threshold, complaints
Repeated Failures Agent fails to resolve the issue after multiple attempts Customer asks the same question 3 times with different phrasing
Emotional Detection Customer sentiment is detected as highly negative or frustrated Customer uses angry language or expresses dissatisfaction
Compliance Requirement Interaction requires human verification for regulatory compliance Identity verification, consent for data processing

Escalation Process

sequenceDiagram
    participant AI as AI Agent
    participant HITL as HITL Orchestrator
    participant Q as Escalation Queue
    participant H as Human Specialist
    participant C as Customer
    participant KB as Knowledge Base

    AI->>AI: Detect escalation trigger
    AI->>C: "Let me connect you with a specialist who can help..."
    AI->>HITL: Submit escalation with full context
    HITL->>HITL: Assign priority (based on trigger type, customer tier, wait time)
    HITL->>Q: Add to escalation queue
    HITL->>C: Confirm transfer and set expectations
    Q->>H: Route to available specialist
    H->>H: Review conversation history + AI context + suggested action
    H->>C: Continue conversation with customer
    H->>H: Resolve the issue
    H->>KB: Log resolution as training example
    H->>HITL: Mark escalation as resolved
    HITL->>AI: Feed resolution back to agent for learning

Escalation Priority Levels

Priority Response Time Trigger Examples
Critical < 5 minutes Customer threatening legal action, system-wide outage affecting customers
High < 15 minutes Sensitive compliance issue, high-value customer escalation, emotional distress detected
Medium < 1 hour Complex question outside agent’s knowledge, repeated failures, refund requests
Low < 4 hours General inquiry the agent could not answer, non-urgent follow-up needed

Learning from Escalations

Every resolved escalation becomes a learning opportunity:

  1. Resolution Capture — The human specialist’s response is recorded along with the full conversation context
  2. Pattern Analysis — The QA team reviews escalations weekly to identify recurring patterns
  3. Prompt Updates — If a pattern reveals a gap in the agent’s instructions, the prompt is updated
  4. Knowledge Base Updates — If the escalation was caused by missing information, the knowledge base is updated with new documents
  5. Guardrail Refinement — If the escalation was caused by the agent overstepping its boundaries, guardrails are tightened
  6. Benchmark Addition — The escalation scenario is added to the testing benchmark to ensure the agent handles it correctly after improvement

Onboarding Process

When you become a 4Geeks AI Agents customer, here is the detailed onboarding journey:

Week 1: Discovery & Planning

Day Activity Participants Deliverable
Day 1 Discovery meeting Account Manager + Your Team Requirements document
Day 2 Technical assessment Integration Engineer Integration architecture plan
Day 3 Agent design workshop Prompt Engineer + Your Team Agent specification (goals, tone, guardrails)
Day 4-5 Internal setup 4Geeks Team Account configuration, environment setup

Week 2: Build & Integrate

Day Activity Participants Deliverable
Day 1-2 Integration development Integration Engineer API connections, webhooks configured
Day 3 Prompt development Prompt Engineer Initial system prompt draft
Day 4 Knowledge base setup Integration Engineer + Your Team Documents uploaded and processed
Day 5 Internal QA QA Specialist Quality report with initial scores

Week 3: Testing & Calibration

Day Activity Participants Deliverable
Day 1-2 Client testing Your Team + Prompt Engineer Feedback on agent behavior
Day 3 Prompt refinement Prompt Engineer Updated prompt incorporating feedback
Day 4 End-to-end testing Full 4Geeks Team Integration test report
Day 5 Go-live preparation Account Manager + Operations Deployment checklist completed

Week 4: Go-Live & Monitoring

Day Activity Participants Deliverable
Day 1 Soft launch (limited traffic) Operations Team Initial performance data
Day 2-3 Monitoring & adjustments Full Team Real-time optimization
Day 4-5 Full launch Full Team Production deployment
Ongoing Weekly check-ins Account Manager Performance reports

Support & Communication

Support Channels

Channel Availability Response Time Best For
Email (support@4geeks.io) Mon-Fri, 9 AM - 6 PM < 24 hours Non-urgent questions, feature requests, documentation
In-Console Chat Mon-Fri, 9 AM - 6 PM < 4 hours Quick questions, configuration help, troubleshooting
Discord Community-driven Variable Peer support, feature discussions, announcements
Account Manager By appointment Same day Strategic discussions, plan changes, escalations
Emergency Hotline 24/7 (Enterprise) < 30 minutes Production outages, critical issues (Enterprise plans only)

Regular Reporting

Depending on your plan, you receive regular performance reports:

Plan Report Frequency Report Contents
Starter Monthly Credit usage, interaction volume, escalation rate, quality score
Growth Bi-weekly All Starter metrics + conversation samples, prompt recommendations, integration health
Scale Weekly All Growth metrics + trend analysis, optimization recommendations, strategic review
Enterprise Real-time dashboard All Scale metrics + custom KPIs, dedicated Slack channel, quarterly business review

What’s Included in a Performance Report

Section Metrics
Volume Total interactions, messages per channel, calls per campaign, peak usage times
Quality Quality score, accuracy rate, first-contact resolution, customer satisfaction
Efficiency Average response time, average turns to resolution, credit consumption per interaction
Escalations Escalation rate, reasons for escalation, average resolution time, learning actions taken
Integration Health API success rates, webhook delivery rates, data sync status
Recommendations Suggested prompt changes, knowledge base additions, workflow optimizations

Continuous Improvement Cycle

The human-in-the-loop model creates a virtuous cycle of continuous improvement:

graph LR
    A[Deploy Agent] --> B[Monitor Interactions]
    B --> C[Identify Issues]
    C --> D{Issue Type?}
    D -->|Prompt Gap| E[Update Prompt]
    D -->|Knowledge Gap| F[Add to Knowledge Base]
    D -->|Integration Bug| G[Fix Integration]
    D -->|Workflow Gap| H[Add Workflow Step]
    E --> I[Test Changes]
    F --> I
    G --> I
    H --> I
    I --> J{Quality Improved?}
    J -->|Yes| B
    J -->|No| C

Improvement Metrics Over Time

As the continuous improvement cycle runs, you should see measurable improvements:

Metric Week 1 Month 1 Month 3 Month 6
Quality Score 80-85% 88-92% 92-95% 95%+
Escalation Rate 15-20% 10-12% 6-8% < 5%
First-Contact Resolution 65-70% 75-80% 82-88% 90%+
Average Response Time 2.5s 2.0s 1.8s 1.5s
Customer Satisfaction 3.8/5 4.⅖ 4.5/5 4.7/5

Note

These metrics are representative benchmarks based on typical deployments. Actual results vary based on industry, use case complexity, and the quality of initial training data provided.

Team Availability

Standard Hours

Region Hours (Local) Timezone
Americas 9:00 AM - 6:00 PM EST / CST
Europe 9:00 AM - 6:00 PM GMT / CET
Latin America 9:00 AM - 6:00 PM Local timezone

Extended Coverage

  • Growth Plan: Extended hours until 9:00 PM in your timezone
  • Scale Plan: Extended hours until 11:00 PM in your timezone
  • Enterprise Plan: 24/7 coverage with dedicated on-call team

Holidays & Time Off

  • The platform (AI Agents) runs 24/7/365 regardless of team availability
  • Human escalation response times may be longer during holidays
  • Enterprise customers have guaranteed holiday coverage as part of their SLA
  • Scheduled maintenance windows are communicated 48 hours in advance

What’s Next


Aún con dudas? Pregunta en Discord o explore tutoriales