Rivervo
← Previous July 2019 – June 2020 Next →
Incident History
June 2020 1 incident
Scheduled maintenance — server OS migration to CloudLinux
22 Jun 2020 hosting servers control panel server load resolved 1h 45m
Production server operating system migrated from CentOS to CloudLinux. CloudLinux enables per-account LVE resource limits and CageFS filesystem isolation, improving stability and security on the shared server. Server reboot required. Maintenance at night (GMT+4).
20:00 UTC
Maintenance window opened. CloudLinux kernel installation started.
20:40 UTC
Server rebooting into CloudLinux kernel. Full downtime window.
21:10 UTC
CloudLinux confirmed active. LVE and CageFS configuration in progress.
21:45 UTC
All services restored. Per-account resource isolation now enforced server-wide.
April 2020 1 incident
Server overload — sustained high traffic surge
14 Apr 2020 hosting servers server load resolved 2h 30m
Sudden increase in inbound web traffic across the server caused load averages to exceed safe thresholds. PHP-FPM pools and Nginx worker processes scaled up live to absorb the traffic.
10:00 UTC
Server load average climbing past threshold. Investigating.
10:30 UTC
Traffic volume 3× above baseline across shared server. Scaling worker processes.
11:15 UTC
PHP-FPM pools resized. Nginx workers increased. Load stabilising.
12:30 UTC
Server stable under new resource allocation. Monitoring active.
February 2020 1 incident
Scheduled maintenance — Node.js runtime installed on server
11 Feb 2020 hosting servers control panel resolved 30m
Node.js (v12 LTS) installed server-wide via cPanel's Node.js Selector module. EasyApache rebuild required a web server restart. Maintenance at night (GMT+4).
21:00 UTC
Maintenance window opened. Node.js Selector packages installing.
21:20 UTC
EasyApache rebuilding with Node.js support. Brief service interruption.
21:30 UTC
Node.js 12 LTS active on server. All accounts can enable Node.js applications.
November 2019 1 incident
MySQL slow query causing cPanel database timeouts
29 Nov 2019 control panel resolved 1h 10m
Missing index on an internal cPanel MySQL table caused slow queries under concurrent load, leading to session timeouts in WHM and cPanel. Index added; no data loss.
15:00 UTC
cPanel session timeouts reported. Server MySQL slow query log reviewed.
15:40 UTC
Missing index identified on internal sessions table. Adding index live.
16:10 UTC
Index added. cPanel fully operational.
September 2019 1 incident
Scheduled maintenance — OPcache and Redis server-level optimisation
10 Sep 2019 hosting servers resolved 30m
Server-wide OPcache tuning applied and Redis object cache daemon installed for shared hosting accounts. Service restart required. Maintenance at night (GMT+4).
22:00 UTC
Maintenance window opened. OPcache and Redis packages installing on server.
22:20 UTC
Services restarting. Brief interruption expected.
22:30 UTC
OPcache and Redis active server-wide. Server response times improved.
July 2019 1 incident
Disk I/O saturation — runaway MySQL queries
17 Jul 2019 hosting servers server load resolved 1h 25m
A runaway database process on one shared account generated excessive disk I/O, saturating the server's I/O queue and degrading all co-hosted accounts. Process terminated and account resource limits tightened.
09:30 UTC
Server disk I/O at 100%. Multiple accounts reporting 504 errors.
10:00 UTC
Offending MySQL process identified and killed. I/O pressure easing.
10:55 UTC
I/O normalised. All accounts responding. Per-account MySQL limits tightened.