Job Description
Job DescriptionWe are looking for a Systems Engineer to support the reliability and performance of critical cloud-based business applications in Jacksonville, Florida. This role focuses on day-to-day platform operations, incident resolution, and service improvement across production environments. The ideal candidate is comfortable working across infrastructure, application support, and automation while partnering with cross-functional teams to maintain compliance and deliver a strong customer experience.
Responsibilities:
• Monitor cloud applications and platform components to identify service degradation, abnormal behavior, and data issues before they affect users.
• Perform routine operational tasks such as system checks, maintenance activities, patching coordination, and software version updates to keep environments stable.
• Investigate production incidents independently, restore service as quickly as possible, and contribute to major incident response efforts when escalations occur.
• Document root causes, corrective actions, and follow-up recommendations after incidents to strengthen operational processes and reduce repeat issues.
• Partner with security and compliance stakeholders to gather audit support materials, address findings, and help maintain required control standards.
• Work with sales and customer support teams to troubleshoot client-facing technical problems through log review, environment analysis, and issue replication.
• Create and maintain automation scripts in Shell or Python to simplify recurring support tasks and improve operational efficiency.
• Support containerized and middleware-based environments by assisting with Kubernetes operations and resolving common issues involving web, database, cache, and messaging services.
• Participate in an on-call rotation and respond to production emergencies in a timely manner to help maintain service availability.
