Overview · IT Digital OCC
IRSL Command Center
Executive Summary
Monthly health snapshot for IRSL operations, security, project, KPI, and team readiness — combining Cloudflare real-time telemetry with manual operational reporting.
71%
Healthy
3
Active inc.
6
Products
Source status
Every module in the IRSL Command Center, labeled by its current data source
Overall Team Health
Mixed SourceOps + incidents + projects + KPI
Source: Combined
Cloudflare API Status
Real APIWorker proxy reachable
Source: Cloudflare Worker
Service Stability
Real API5xx error rate
Source: Cloudflare HTTP Analytics
Security Posture
Real APILatest security events & WAF actions
Source: Cloudflare Security Events
Incident Status
Manual UpdateTotal / major / RCA
Source: Manual Incident Update
Project Health
Manual Update6-project portfolio
Source: Manual Project Update
KPI Performance
Mixed SourceReliability · Security · Delivery · Cost
Source: Cloudflare + Manual + Coming Soon
Team Readiness
Manual StructureInfra Cloud · DevOps · Operation Support
Source: IRSL Operating Model
AWS Billing
Coming Soon APIMTD, forecast, optimization
Source: AWS Cost Explorer API
Monthly Review Readiness
Mixed SourceReady-to-present sections
Source: Dashboard + Manual
Executive summary — this month vs. last month
High-level health, delivery and security signals for management review
Cloudflare Requests
412M
30-day total across edges
Source: Cloudflare HTTP Analytics
5xx Responses
0.42%
Service stability · target ≤ 1%
Source: Cloudflare HTTP Analytics
Latest Security Events
1,284
From latest Cloudflare fetch
Source: Cloudflare Security Events
WAF Events
962
Block / Challenge / Managed
Source: Cloudflare Security Events
Service Stability KPI
Pass
5xx rate ≤ 1% · calculated from Cloudflare
Source: Calculated · Cloudflare API
Cybersecurity Posture KPI
Watch
Critical-path coverage from Security Events
Source: Calculated · Cloudflare Security Events
WAF Visibility KPI
Pass
Data completeness on 8 required fields
Source: Calculated · Cloudflare Security Events
Total Incidents
7
1 SEV1 · 2 SEV2 · 4 SEV3/4
Source: Manual Incident Update
MTTR
24 min
Target ≤ 30 min · Jira coming later
Source: Manual Incident Update
Projects Health
5 / 6 On-track
1 at-risk · PrakanTidloh API EOL
Source: Manual Project Update
Cloud Cost Optimization
3.1%
Target ≥ 2% · AWS Cost Explorer coming later
Source: Manual Billing Update
AI-assisted Coding Coverage
48%
Target ≥ 50%
Source: Manual AI Usage Report
Monthly Uptime
—
Awaiting Uptime Kuma API
Source: Uptime Kuma API coming soon
Uptime Kuma Summary
—
Up / Down monitor counts pending
Source: Uptime Kuma API coming soon
SEO / PageSpeed Score
—
Per-domain Core Web Vitals pending
Source: Google PageSpeed Insights API coming soon
Bot Protection
—
Likely-automated traffic share pending
Source: Cloudflare Bot Analytics coming soon
DDoS Monitoring
—
L3/L7 attack timeline pending
Source: Cloudflare DDoS Analytics coming soon
Data Integration Coverage
Honest view of which modules are powered by real APIs vs. manual or coming soon
Cloudflare API Coverage
100%
Domain selector, traffic analytics, cybersecurity overview, and WAF events are using real Cloudflare API data.
Overall Dashboard Real Data Coverage
40%
Cloudflare modules are real API data. Uptime Kuma, PageSpeed, Projects, KPI manual sources, and Incidents are still manual or coming soon.
Manual / Coming Soon Coverage
60%
Remaining modules will be integrated in later phases.
Cloudflare API
Cloudflare API
Cloudflare Security Events
Cloudflare Security Events
Partially real, calculated from Cloudflare API
Coming soon API
Coming soon API
Manual Update
Manual Update / Partial API
Manual Update
Manual Structure
Key performance indicators
Targets for the Infra Resilient Team — rolling 30 days.
Cycle Time — Dev to Deploy
0.7 d
MTTR
24 min
Uptime
99.94%
Cloud Cost Optimization
3.4%
AI-assisted Coding Coverage
46%
Edge / CDN (CloudFront)
OperationalInfra Cloud · ap-southeast-1 · 22ms · 99.99% uptime
API Gateway
OperationalInfra Cloud · ap-southeast-1 · 78ms · 99.98% uptime
Auth / OIDC
OperationalOperation Support · global · 61ms · 99.96% uptime
Loan Origination DB
DegradedInfra Cloud · ap-southeast-1 · 212ms · 99.71% uptime
CI/CD Pipeline (GitHub → ArgoCD)
OperationalDevOps · global · — · 99.94% uptime
AI Gateway
OperationalDevOps · ap-southeast-1 · 340ms · 99.92% uptime
Partner Tenancy Router
OutageInfra Cloud · ap-southeast-1 · — · 97.1% uptime
Failover test for Partner Tenancy Router
R. Boonchai · Infra Cloud · due Today 16:00
Cut over PrakanTidloh to new premium API
S. Achara · Infra Cloud · due Jul 08
Roll out Copilot to remaining 18 engineers
N. Chayanan · DevOps · due Jul 05
Tune Tidlor read-replica lag alerts
K. Pongsak · DevOps · due Jun 28
Refresh on-call runbook for Tidjai
P. Wassana · Operation Support · due Jun 30
| ID | Title | Sev | Status | Owner |
|---|---|---|---|---|
| INC-20512 | Partner Tenancy Router returning 502s Motor White Label · Infra Cloud | SEV1 | Investigating | R. Boonchai |
| INC-20509 | Loan calculator latency spike on Tidlor Tidlor Website · DevOps | SEV2 | Identified | K. Pongsak |
| INC-20505 | PrakanTidloh premium API timeouts PrakanTidloh Website · Infra Cloud | SEV2 | Monitoring | S. Achara |
| INC-20498 | AI Gateway rate-limit misconfig AI Digital · DevOps | SEV3 | Resolved | N. Chayanan |
| INC-20494 | Tidjai non-login flow 500 on submit Tidjai App / Non-login · Operation Support | SEV3 | Resolved | P. Wassana |
Partner tenancy isolation untested at peak load
Motor White Label · R. Boonchai · due Jul 03
Legacy premium API EOL in 21 days
PrakanTidloh Website · S. Achara · due Jul 17
Copilot adoption below 50% target
AI Digital · N. Chayanan · due Jul 10
Loan calc DB read replica lag
Tidlor Website · K. Pongsak · due Jul 05
Captcha provider rate-limit changes
Tidjai App / Non-login · P. Wassana · due Jul 14
Team workload
Capacity across the three squads
Infra Cloud
4 members · 20 open tickets
Average capacity utilisation
DevOps
4 members · 18 open tickets
Average capacity utilisation
Operation Support
3 members · 43 open tickets
Average capacity utilisation