Executive Summary
Incognito Software Systems provides service orchestration software and services solutions that help fixed service providers manage their broadband devices and services. Prior to automation, they had to use AWS services and manually obtain insights about their servers and associated services. Their system had inefficiencies, including the absence of automation in its processes and the dashboard having limited visibility regarding real-time server health.
The customer approached ITTStar Global for a comprehensive solution. ITTStar developed the SLA Dashboard Monitoring System to provide real-time insights about the health and availability of Incognito’s 14 critical servers and their associated services. The solution leveraged AWS serverless technologies to automate data collection, SLA computation, and visualization. This solution also provided stakeholders with immediate access to operational statuses and performance metrics.
Key Challenges
- Manual Monitoring Processes: Prior to automation, the customer had to manually track server uptime and service availability. The manual process was labor-intensive and prone to delays.
- Lack of Real-Time Insights: Stakeholders could not view real-time details regarding server health. This led to significant delays during incident response.
- Inefficient SLA Tracking: Service Level Agreements (SLAs) could only be tracked manually. This increased the risk of inaccurate data and non-compliance.
The Solution: Automated Real-Time Monitoring and SLA Computation
To address these challenges, ITTStar implemented the following solutions:
- AWS Lambda Functions:
- directFetch: Triggered via API Gateway to read the status of servers and services, and provide real-time data on uptime, downtime, and active/inactive states.
- slacomputation: Scheduled via EventBridge to compute SLA metrics in 1-minute intervals and store it securely in S3.
- slaFetch: Accesses the computed SLA data from S3 and delivers it to clients through API Gateway for front-end visualization.
- Frontend Dashboard:
- Visual Indicators: Depending on the displayed color, it shows:
- Green: All services are operational.
- Orange: One or more services are down.
- Red: The server or instance is completely shut down.
- High Availability (HA) Considerations:
- If one service is active and the other is inactive, the dashboard displays green but shows the system to be on standby.
- If both services are inactive, the color changes to orange.
Why Did ITTSTAR Choose This Solution?
- Scalability: With AWS serverless architecture, the system can scale up/down with organizational needs without significant overhead.
- Real-Time Data Processing: Integrating Lambda functions and API Gateway allows for immediate, real-time data retrieval and processing. This ensures the availability of up-to-date information.
- Automated SLA Tracking: Automating SLA tracking reduces manual errors and ensures compliance with service agreements.
- Enhanced Visibility: Stakeholders can view clear, color-coded indicators of system health with the new front-end dashboard. This facilitates quicker, more quality decision-making.
Why AWS?
AWS was chosen for its highly scalable, secure, and cost-effective infrastructure that is ideal for real-time monitoring and data-driven automation.
The following AWS services were used for the solution and resulted in a resilient and efficient system:
- AWS Lambda
- Provided serverless computing power for processing continuous server status updates.
- Eliminated the need to manage underlying infrastructure.
- Amazon EventBridge
- Delivered scheduled event triggers for SLA computations every minute.
- Ensured precise and consistent tracking of service performance.
- Amazon S3
- Offered a durable and scalable storage solution for computed SLA data.
- Enabled reliable long-term data retention and access.
- Amazon API Gateway
- Facilitated secure access to real-time server insights.
- Powered front-end visualization with minimal latency.
These AWS services combined gives:
- A resilient and highly accessible monitoring system that scales dynamically with organizational growth.
- Strong security and compliance standards to protect sensitive operational data.
- Maximized system uptime and ensured performance reliability.
Why Incognito Chose ITTStar
Incognito chose ITTStar because of their:
- Expertise in building scalable, cloud-native monitoring solutions that are tailored to a business’s needs.
- Membership in the global Amazon Partner Network (APN) and deep knowledge of AWS services, automation, and real-time data processing.
- Strong focus on reliability, security, and user-centric design.
- Experience in delivering a customized, automated solution that satisfies Incognito’s requirements.
- Dedication in delivering cutting-edge cloud technology, continuous innovation, and dedicated support.
All this combined led to the creation of a powerful SLA Dashboard that offers unmatched operational visibility. The ITTStar promise ensures that their critical infrastructure will remain resilient, efficient, and future-ready.
Results & Impact
- Enhanced Monitoring Efficiency: Incognito achieved 100% real-time visibility across all critical servers and services. This led to faster identification of potential issues and downtime events.
- Improved Incident Response Times: The average time to detect and respond to server/service disruptions was reduced by 45%. This minimized service interruptions and improved system resilience.
- Operational Time Savings: Manual efforts for tracking SLA compliance and server health were reduced by over 50%. This allowed technical teams to focus more on strategic initiatives and innovation.
- Better SLA Compliance: Real-time SLA computation ensured that Incognito consistently met or exceeded SLA targets. This helped strengthen internal governance and improve customer satisfaction.
- Increased Service Reliability: Continuous monitoring and proactive alerts helped maintain high service uptime. This contributed to better performance and system reliability.
- Stronger Decision-Making with Data: With accurate, real-time operational data, Incognito’s leadership team was able to make more informed infrastructure- and scaling-related decisions. This led to a significant improvement in agility.
- Enhanced Stakeholder Confidence: Visual dashboards and SLA reports provided leadership and technical teams with clear, accessible insights. This helped boost confidence in infrastructure management practices.