Senior Site Reliability Engineer I, Data Protection Products
ConnectWise is an industry and Global leading software company with over 3,000 colleagues in North America, EMEA and APAC. As a community-driven software company dedicated to the success of technology solution providers, our suite helps over 45,000 of our partners manage their businesses better, sell more efficiently, automate service delivery, and remotely control technology so they can consistently deliver amazing customer experiences.
Our company is powered by our connections, our colleagues, and our community. And, we accept all kinds.
Game-changers, innovators, culture-lovers—and humankind.
We invite discovery and debate. We recognize key moments as milestones.
We see you and value you for your unique contributions. Our inclusive, positive culture lays the foundation to ensure every colleague is valued for their perspectives and skills, giving you the choice of how YOU make a difference.
Curious? Read this opportunity to learn how YOU can make a difference at ConnectWise!
General Summary:
The Senior Site Reliability Engineer I is responsible for ensuring the availability, performance, and scalability of our
systems and applications. This role works in partnership with the Engineering teams to design, implement, and
maintain robust and resilient infrastructure and automation solutions to improve the reliability and performance
of our technology stack.
Essential Duties and Responsibilities:
• Provides support to the Engineering teams, with a high attention to detail
• Researches, analyzes, and documents findings
• May influence others within the Development Engineering team through the explanation of facts, policies,
and practices
• Designs, implements, and maintains highly available and scalable infrastructure solutions for our
applications and services
• Develops and optimizes automation scripts and tools to enhance system reliability, deployment
processes, and operational efficiency
• Monitors and analyzes system performance metrics, identifying areas for improvement and implementing
proactive measures to prevent issues and ensure high availability
• Collaborates with development teams to define and implement best practices for application deployment,
configuration management, and release management
• Troubleshoots, resolves, and documents RCAs related to system performance, network connectivity,
application stability, and other encountered issues
• Performs capacity planning and scalability assessments to ensure our systems can handle increasing
demands and future growth
• Implements and maintains monitoring, alerting, and logging systems to provide real-time visibility into
system health and performance
• Participates in incident response, root cause analysis, and post-incident reviews to identify and address
system vulnerabilities and improve overall system resilience
• Stays up to date with industry trends and best practices related to site reliability engineering and
contributes to continuous improvement initiatives
• Designs, implements, and optimizes infrastructure and deployment processes to achieve cost
optimization and decreases the COGS (Cost of Goods Sold) while ensuring high availability and
performance
• Monitors and analyzes cost metrics, providing recommendations and implementing cost-saving initiatives
to achieve optimal resource allocation and budget utilization
Knowledge, Skills, and/or Abilities Required:
To perform this job successfully, an individual must be able to perform each essential duty satisfactorily. The requirements listed below are representative of the knowledge, skill, and/or ability required. Reasonable accommodations may be made to enable individuals with disabilities to perform the essential functions
• Ability to work independently on projects and processes with general supervision
• Practical knowledge of applicable work area
• Ability to situationally adapt and understand new technology/processes as per business requirement
• Understanding of scripting and automation using languages such as Python, Bash, PowerShell or similar
• Solid understanding of networking concepts, protocols, and security practices
• Strong knowledge of Linux/Unix systems administration and troubleshooting.
• Familiarity with containerization technologies like AWS Fargate, Docker or similar
• Understanding of container orchestration platforms like AWS ECS, Kubernetes or similar
• Ability to work collaboratively in cross-functional teams and effectively communicate technical concepts
to both technical and non-technical stakeholders.
• Strong problem-solving skills and the ability to analyze complex issues and provide practical solutions
• Strong attention to detail, with a focus on ensuring reliability, performance, and scalability in all aspects of
system design and implementation
• Ability to adapt to a fast-paced and changing environment, managing multiple priorities and deadlines
effectively
Educational/Vocational/Previous Experience Recommendations:
- Bachelor’s degree in related field or equivalent business experience
- 2+ years of relevant experience
- Experience in designing, building, and managing highly available and scalable infrastructure in a cloud
environment - Experience with configuration management and infrastructure-as-code tools
Working Conditions:
• Onsite/Hybrid/Remote depending on location
• 10-20% travel may be required
ConnectWise is an Equal Opportunity Employer, dedicated to building a diverse and inclusive workforce and providing a workplace free from discrimination and harassment. ConnectWise provides equal employment opportunities to all employees and applicants without regard to race, ethnicity, color, religion, age, sex (including pregnancy), sexual orientation, gender, gender identity or expression, ancestry, national origin, citizenship status, physical or mental disability, genetic information, military/veteran status, marital status, familial or parental status, or any other characteristic or status protected by applicable federal, state and local laws.
The statements above are intended to describe the general nature and level of work being performed by individuals assigned to this job. Other duties may be assigned as needed. Reasonable accommodations may be made to enable qualified individuals with disabilities to perform the essential functions of the job and/or to receive other benefits and privileges of employment. If you need a reasonable accommodation for any part of the application and hiring process, please contact us at talentacquisition@connectwise.com or 1-800-671-6898.
Create a Job Alert
Interested in building your career at ConnectWise? Get future opportunities sent straight to your email.
Apply for this job
*
indicates a required field