The Human Team¶
At the core of 4Geeks AI Agents is a fundamental principle: AI works best when guided by real humans. Every AI Agent deployed on the platform is backed by a dedicated team of real human geeks at 4Geeks who handle setup, monitoring, quality assurance, and continuous improvement. This document explains who these people are, how they work, and why the human-in-the-loop approach is our key differentiator.
Why Human-in-the-Loop?¶
Fully autonomous AI agents are unreliable for real-world business operations. They hallucinate, misinterpret context, fail on edge cases, and cannot adapt to novel situations without guidance. The human-in-the-loop (HITL) model solves these problems by keeping real humans in the driver’s seat at every stage of the agent lifecycle.
The Problem with Fully Autonomous AI¶
| Challenge | Description | Impact |
|---|---|---|
| Hallucinations | AI generates plausible but incorrect information | Customers receive wrong answers, damaging trust |
| Edge Cases | Unusual situations the AI was not trained on | Agent freezes, gives irrelevant responses, or makes errors |
| Context Drift | Conversation context degrades over long interactions | Agent loses track of the conversation and repeats itself |
| Tone Mismatch | AI response tone does not match brand expectations | Customer experience feels off-brand or inappropriate |
| Integration Failures | External tools change APIs or return unexpected data | Agent workflows break silently |
| Regulatory Changes | New compliance requirements emerge | Agent may violate updated regulations |
The 4Geeks Solution¶
Our human team addresses every one of these challenges through proactive monitoring, real-time intervention, and continuous learning:
graph TB
subgraph Deploy["Deployment Phase"]
DM[Discovery Meeting]
SC[Solution Configuration]
INT[Manual Integration]
TEST[Testing & Calibration]
GO[Go-Live]
end
subgraph Monitor["Monitoring Phase"]
REAL[Real-Time Monitoring]
QA[Quality Assurance Reviews]
PERF[Performance Analytics]
ALERT[Alert Management]
end
subgraph Improve["Improvement Phase"]
ESC[Escalation Handling]
LEARN[Learning from Resolutions]
OPT[Prompt Optimization]
UPDATE[Knowledge Base Updates]
end
DM --> SC --> INT --> TEST --> GO
GO --> REAL
REAL --> QA
REAL --> PERF
REAL --> ALERT
QA --> ESC
PERF --> OPT
ALERT --> ESC
ESC --> LEARN
LEARN --> OPT
OPT --> UPDATE
UPDATE --> REAL Team Structure¶
The 4Geeks AI Agents team is organized into specialized roles, each responsible for a specific aspect of the agent lifecycle.
Account Managers¶
Account Managers are your primary point of contact throughout the entire engagement.
Responsibilities:
- Conduct discovery meetings to understand your business needs, goals, and success metrics
- Translate business requirements into technical specifications for agent configuration
- Coordinate between your team and the technical implementation team
- Provide regular performance reports and strategic recommendations
- Manage billing, plan adjustments, and contract renewals
- Serve as escalation point for any issues or concerns
When you interact with them:
- Initial discovery and onboarding
- Regular check-ins (weekly or bi-weekly depending on plan)
- Performance reviews and strategy sessions
- Any time you need to make changes to your agents or plan
Typical background:
- Experience in SaaS account management or customer success
- Understanding of AI and automation concepts
- Industry knowledge across multiple verticals (healthcare, real estate, e-commerce, finance, etc.)
- Fluent in English and Spanish; additional languages supported
Integration Engineers¶
Integration Engineers are the technical specialists who connect AI Agents to your existing software stack.
Responsibilities:
- Design and implement custom integrations with your CRM, calendar, communication tools, and proprietary systems
- Write custom code to ensure secure and reliable data flow between systems
- Configure API connections, webhooks, and data synchronization pipelines
- Test integrations end-to-end to verify data accuracy and reliability
- Troubleshoot integration issues and implement fixes
- Document all integration configurations for future reference
Technologies they work with:
| Category | Tools & Platforms |
|---|---|
| CRMs | HubSpot, Salesforce, Pipedrive, Close CRM |
| Communication | Slack, Microsoft Teams, Outlook, Gmail |
| Calendars | Google Calendar, Microsoft Outlook Calendar |
| Productivity | Trello, Asana, Monday.com, Google Sheets, Airtable |
| Healthcare | Huli Health and other healthcare CRMs |
| Custom Systems | REST APIs, GraphQL endpoints, SOAP services, Webhooks |
| Telephony | VAPI, Twilio, custom SIP configurations |
| Messaging | WhatsApp Business API (Meta Cloud API) |
Typical background:
- Senior software engineering experience (5+ years)
- Expertise in API integration and middleware development
- Proficiency in Python, JavaScript/TypeScript, and SQL
- Experience with Supabase, PostgreSQL, and cloud platforms
- Background in building production-grade integrations
Prompt Engineers¶
Prompt Engineers craft and optimize the system prompts that define how each AI Agent behaves.
Responsibilities:
- Design initial system prompts based on business requirements and brand guidelines
- Test prompts extensively across hundreds of conversation scenarios
- Optimize prompts for accuracy, tone, consistency, and cost-efficiency
- Implement prompt versioning and A/B testing to measure improvements
- Analyze conversation logs to identify prompt weaknesses
- Update prompts based on escalation patterns and customer feedback
- Create prompt templates for new agent types and use cases
Prompt engineering process:
graph LR
A[Gather Requirements] --> B[Draft Initial Prompt]
B --> C[Internal Testing<br/>100+ scenarios]
C --> D{Quality Check}
D -->|Pass| E[Client Review]
D -->|Fail| B
E --> F{Client Approval}
F -->|Approved| G[Deploy to Production]
F -->|Changes| B
G --> H[Monitor Performance]
H --> I[Identify Improvements]
I --> B What goes into a system prompt:
| Section | Description | Example |
|---|---|---|
| Identity | Who the agent is and what company it represents | “You are an AI assistant for Acme Corp, a leading provider of…” |
| Objective | The agent’s primary goal | “Your goal is to qualify leads and schedule demos…” |
| Tone & Style | How the agent should communicate | “Professional but friendly. Use short sentences. Avoid jargon.” |
| Knowledge | Key facts the agent should always know | “We have 3 pricing tiers: Basic (\(10), Pro (\)50), Enterprise (custom)” |
| Guardrails | What the agent must never do | “Never offer discounts above 10%. Never share competitor pricing.” |
| Escalation Rules | When to transfer to a human | “If the customer asks about legal matters, transfer to a human.” |
| Tool Instructions | How to use available tools | “Use the check_availability tool before suggesting appointment times.” |
| Conversation Flow | Expected conversation structure | “Start with a greeting, ask qualifying questions, then suggest next steps.” |
Typical background:
- Experience in NLP, computational linguistics, or technical writing
- Deep understanding of LLM behavior and prompt engineering techniques
- Ability to analyze conversation data and identify patterns
- Experience with multiple LLM providers (OpenAI, Anthropic, Google)
- Strong attention to detail and commitment to quality
Quality Assurance Specialists¶
QA Specialists continuously monitor agent performance and ensure quality standards are met.
Responsibilities:
- Review a random sample of agent conversations daily for quality, accuracy, and brand consistency
- Score conversations against a quality rubric (accuracy, helpfulness, tone, completeness)
- Identify patterns in customer complaints or negative feedback
- Flag conversations that require human follow-up
- Create quality reports with trends and recommendations
- Validate that agents comply with regulatory requirements (GDPR, EU AI Act)
- Test agent responses after prompt or knowledge base updates
Quality scoring rubric:
| Criterion | Weight | Description |
|---|---|---|
| Accuracy | 30% | Is the information provided correct and up-to-date? |
| Helpfulness | 25% | Did the agent actually solve the customer’s problem? |
| Tone & Brand | 20% | Does the response match the expected brand voice and style? |
| Completeness | 15% | Did the agent address all parts of the customer’s question? |
| Efficiency | 10% | Was the resolution achieved in a reasonable number of turns? |
Quality targets:
| Metric | Target |
|---|---|
| Overall Quality Score | ≥ 90% |
| Accuracy Rate | ≥ 95% |
| First-Contact Resolution | ≥ 80% |
| Escalation Rate | ≤ 10% |
| Customer Satisfaction | ≥ 4.5/5.0 |
Typical background:
- Experience in QA, customer service management, or operations
- Strong analytical skills and attention to detail
- Understanding of AI capabilities and limitations
- Experience with customer service metrics and KPIs
- Ability to provide constructive feedback for improvement
Operations & Monitoring Team¶
The Operations team ensures the platform infrastructure runs smoothly 24/7.
Responsibilities:
- Monitor all platform services for availability, latency, and error rates
- Manage LLM provider relationships and handle provider outages
- Configure and maintain the Private AI Gateway routing and fallback logic
- Manage credit consumption monitoring and alerting
- Handle incident response and post-incident reviews
- Maintain backup and disaster recovery procedures
- Optimize platform performance and cost-efficiency
Monitoring coverage:
| Component | Monitored Metrics | Alert Threshold |
|---|---|---|
| API Gateway | Response time, error rate, throughput | > 2s latency or > 1% error rate |
| LLM Providers | Availability, latency per provider | > 5s latency or provider outage |
| Edge Functions | Invocation count, error rate, duration | > 5% error rate |
| Database | Connection count, query latency, storage | > 80% connection pool usage |
| Voice (VAPI) | Call success rate, audio quality | < 95% call success rate |
| Webhook delivery, message status | > 5% delivery failure rate | |
| Credit System | Consumption rate, balance thresholds | Balance below 10% of plan |
Typical background:
- DevOps / SRE experience
- Expertise in cloud infrastructure (AWS, GCP, Supabase)
- Experience with monitoring tools and incident management
- Understanding of LLM provider APIs and rate limiting
- On-call rotation experience
Escalation Workflow¶
When an AI Agent encounters a situation it cannot handle confidently, a structured escalation process ensures the customer receives proper assistance while the agent learns from the experience.
Escalation Triggers¶
An escalation is triggered when any of the following conditions are met:
| Trigger | Description | Example |
|---|---|---|
| Low Confidence | Agent’s confidence score falls below the configured threshold (default: 0.6) | Customer asks a question not covered in the knowledge base |
| Explicit Request | Customer explicitly asks to speak with a human | “I want to talk to a real person” |
| Sensitive Topic | Conversation involves topics the agent is configured to escalate | Legal questions, refund requests above a threshold, complaints |
| Repeated Failures | Agent fails to resolve the issue after multiple attempts | Customer asks the same question 3 times with different phrasing |
| Emotional Detection | Customer sentiment is detected as highly negative or frustrated | Customer uses angry language or expresses dissatisfaction |
| Compliance Requirement | Interaction requires human verification for regulatory compliance | Identity verification, consent for data processing |
Escalation Process¶
sequenceDiagram
participant AI as AI Agent
participant HITL as HITL Orchestrator
participant Q as Escalation Queue
participant H as Human Specialist
participant C as Customer
participant KB as Knowledge Base
AI->>AI: Detect escalation trigger
AI->>C: "Let me connect you with a specialist who can help..."
AI->>HITL: Submit escalation with full context
HITL->>HITL: Assign priority (based on trigger type, customer tier, wait time)
HITL->>Q: Add to escalation queue
HITL->>C: Confirm transfer and set expectations
Q->>H: Route to available specialist
H->>H: Review conversation history + AI context + suggested action
H->>C: Continue conversation with customer
H->>H: Resolve the issue
H->>KB: Log resolution as training example
H->>HITL: Mark escalation as resolved
HITL->>AI: Feed resolution back to agent for learning Escalation Priority Levels¶
| Priority | Response Time | Trigger Examples |
|---|---|---|
| Critical | < 5 minutes | Customer threatening legal action, system-wide outage affecting customers |
| High | < 15 minutes | Sensitive compliance issue, high-value customer escalation, emotional distress detected |
| Medium | < 1 hour | Complex question outside agent’s knowledge, repeated failures, refund requests |
| Low | < 4 hours | General inquiry the agent could not answer, non-urgent follow-up needed |
Learning from Escalations¶
Every resolved escalation becomes a learning opportunity:
- Resolution Capture — The human specialist’s response is recorded along with the full conversation context
- Pattern Analysis — The QA team reviews escalations weekly to identify recurring patterns
- Prompt Updates — If a pattern reveals a gap in the agent’s instructions, the prompt is updated
- Knowledge Base Updates — If the escalation was caused by missing information, the knowledge base is updated with new documents
- Guardrail Refinement — If the escalation was caused by the agent overstepping its boundaries, guardrails are tightened
- Benchmark Addition — The escalation scenario is added to the testing benchmark to ensure the agent handles it correctly after improvement
Onboarding Process¶
When you become a 4Geeks AI Agents customer, here is the detailed onboarding journey:
Week 1: Discovery & Planning¶
| Day | Activity | Participants | Deliverable |
|---|---|---|---|
| Day 1 | Discovery meeting | Account Manager + Your Team | Requirements document |
| Day 2 | Technical assessment | Integration Engineer | Integration architecture plan |
| Day 3 | Agent design workshop | Prompt Engineer + Your Team | Agent specification (goals, tone, guardrails) |
| Day 4-5 | Internal setup | 4Geeks Team | Account configuration, environment setup |
Week 2: Build & Integrate¶
| Day | Activity | Participants | Deliverable |
|---|---|---|---|
| Day 1-2 | Integration development | Integration Engineer | API connections, webhooks configured |
| Day 3 | Prompt development | Prompt Engineer | Initial system prompt draft |
| Day 4 | Knowledge base setup | Integration Engineer + Your Team | Documents uploaded and processed |
| Day 5 | Internal QA | QA Specialist | Quality report with initial scores |
Week 3: Testing & Calibration¶
| Day | Activity | Participants | Deliverable |
|---|---|---|---|
| Day 1-2 | Client testing | Your Team + Prompt Engineer | Feedback on agent behavior |
| Day 3 | Prompt refinement | Prompt Engineer | Updated prompt incorporating feedback |
| Day 4 | End-to-end testing | Full 4Geeks Team | Integration test report |
| Day 5 | Go-live preparation | Account Manager + Operations | Deployment checklist completed |
Week 4: Go-Live & Monitoring¶
| Day | Activity | Participants | Deliverable |
|---|---|---|---|
| Day 1 | Soft launch (limited traffic) | Operations Team | Initial performance data |
| Day 2-3 | Monitoring & adjustments | Full Team | Real-time optimization |
| Day 4-5 | Full launch | Full Team | Production deployment |
| Ongoing | Weekly check-ins | Account Manager | Performance reports |
Support & Communication¶
Support Channels¶
| Channel | Availability | Response Time | Best For |
|---|---|---|---|
| Email (support@4geeks.io) | Mon-Fri, 9 AM - 6 PM | < 24 hours | Non-urgent questions, feature requests, documentation |
| In-Console Chat | Mon-Fri, 9 AM - 6 PM | < 4 hours | Quick questions, configuration help, troubleshooting |
| Discord | Community-driven | Variable | Peer support, feature discussions, announcements |
| Account Manager | By appointment | Same day | Strategic discussions, plan changes, escalations |
| Emergency Hotline | 24/7 (Enterprise) | < 30 minutes | Production outages, critical issues (Enterprise plans only) |
Regular Reporting¶
Depending on your plan, you receive regular performance reports:
| Plan | Report Frequency | Report Contents |
|---|---|---|
| Starter | Monthly | Credit usage, interaction volume, escalation rate, quality score |
| Growth | Bi-weekly | All Starter metrics + conversation samples, prompt recommendations, integration health |
| Scale | Weekly | All Growth metrics + trend analysis, optimization recommendations, strategic review |
| Enterprise | Real-time dashboard | All Scale metrics + custom KPIs, dedicated Slack channel, quarterly business review |
What’s Included in a Performance Report¶
| Section | Metrics |
|---|---|
| Volume | Total interactions, messages per channel, calls per campaign, peak usage times |
| Quality | Quality score, accuracy rate, first-contact resolution, customer satisfaction |
| Efficiency | Average response time, average turns to resolution, credit consumption per interaction |
| Escalations | Escalation rate, reasons for escalation, average resolution time, learning actions taken |
| Integration Health | API success rates, webhook delivery rates, data sync status |
| Recommendations | Suggested prompt changes, knowledge base additions, workflow optimizations |
Continuous Improvement Cycle¶
The human-in-the-loop model creates a virtuous cycle of continuous improvement:
graph LR
A[Deploy Agent] --> B[Monitor Interactions]
B --> C[Identify Issues]
C --> D{Issue Type?}
D -->|Prompt Gap| E[Update Prompt]
D -->|Knowledge Gap| F[Add to Knowledge Base]
D -->|Integration Bug| G[Fix Integration]
D -->|Workflow Gap| H[Add Workflow Step]
E --> I[Test Changes]
F --> I
G --> I
H --> I
I --> J{Quality Improved?}
J -->|Yes| B
J -->|No| C Improvement Metrics Over Time¶
As the continuous improvement cycle runs, you should see measurable improvements:
| Metric | Week 1 | Month 1 | Month 3 | Month 6 |
|---|---|---|---|---|
| Quality Score | 80-85% | 88-92% | 92-95% | 95%+ |
| Escalation Rate | 15-20% | 10-12% | 6-8% | < 5% |
| First-Contact Resolution | 65-70% | 75-80% | 82-88% | 90%+ |
| Average Response Time | 2.5s | 2.0s | 1.8s | 1.5s |
| Customer Satisfaction | 3.8/5 | 4.⅖ | 4.5/5 | 4.7/5 |
Note
These metrics are representative benchmarks based on typical deployments. Actual results vary based on industry, use case complexity, and the quality of initial training data provided.
Team Availability¶
Standard Hours¶
| Region | Hours (Local) | Timezone |
|---|---|---|
| Americas | 9:00 AM - 6:00 PM | EST / CST |
| Europe | 9:00 AM - 6:00 PM | GMT / CET |
| Latin America | 9:00 AM - 6:00 PM | Local timezone |
Extended Coverage¶
- Growth Plan: Extended hours until 9:00 PM in your timezone
- Scale Plan: Extended hours until 11:00 PM in your timezone
- Enterprise Plan: 24/7 coverage with dedicated on-call team
Holidays & Time Off¶
- The platform (AI Agents) runs 24/7/365 regardless of team availability
- Human escalation response times may be longer during holidays
- Enterprise customers have guaranteed holiday coverage as part of their SLA
- Scheduled maintenance windows are communicated 48 hours in advance
What’s Next¶
- Cloud & LLM Architecture — Technical infrastructure powering the platform
- LLM Models — Detailed breakdown of the language models used
- 4Geeks AI Agents Overview — Return to the main documentation
- FAQs — Common questions about the platform
Aún con dudas? Pregunta en Discord o explore tutoriales