Engage in and improve the whole lifecycle of services—from inception and design, through deployment, operation and refinement.
Install new / rebuild existing servers and configure hardware, services, settings, directories, storage, etc. in accordance with standards and project/operational requirements.
Perform daily system monitoring, verifying the integrity and availability of all hardware, server resources, systems and key processes, reviewing system and application logs, and verifying completion of scheduled jobs such as backups.
Perform regular security monitoring to identify any possible intrusions.
Regularly work on improving August 99’s security practices, including:
Recommending new technologies to improve threat assessment and mitigation.
Assisting in the migration to new technologies.
Assisting coworkers with infosec best practices to ensure cross-coverage within the team
Practice sustainable incident response and blameless postmortems.
Perform ongoing performance tuning and resource optimization as required.
Apply OS patches and upgrades on a regular basis, and upgrade administrative tools and utilities. Configure / add new services as necessary.
Develop and maintain installation and configuration procedures, especially related to automation.
BS / MS degree in Information & Technology or equivalent experience relevant to functional area.
With at least 5 years of software or systems engineering experience.
Working experience with multiple POSIX operating systems (e.g. CentOS, Ubuntu, macOS).
Advanced knowledge of at least one server-grade GNU/Linux distribution (e.g. CentOS, Ubuntu).
Basic to advanced knowledge of database optimization and SQL queries (specifically MySQL/MariaDB).
Good scripting skills using POSIX scripting toolkits (bash, sed, awk, Python, Perl, etc). Knowledge of general-purpose programming languages such as PHP, C, C++, and Java a plus.
Expertise/advance knowledge with WordPress setup and configuration.
Demonstrated experience working with monitoring and analytics tools (e.g. Sysdig, Papertrail, Nagios, Cacti, Splunk).
Knowledge of best practices in regards to security/encryption and service configuration (SSL/TLS, SFTP, password management, access restrictions, firewalls, ports, etc.).
Working knowledge of AWS, Rackspace, or Google Cloud services and tools.
RHCSA or RHCSE a major plus, but not required.
AWS Certified Developer, SysOps, or Architect a major plus, but not required.
Design, develop, troubleshoot, and debug software programs for databases, applications, tools, networks, etc.
As a member of the site reliability and IT team, you will assist in defining and developing software for tasks associated with the developing, debugging or designing of software applications or operating systems.
Provide technical leadership to other SREs and software developers.
Specify, design, and implement modest changes to existing software architecture to meet changing needs.
Analyze system and software security and change procedures or code when necessary.
Stay informed about new and relevant CVE’s, potential bugs, viruses/worms/etc, and how to take preventive or corrective measures for each.
Duties and tasks are varied and complex needing independent judgment. Candidates should be fully competent in their own areas of expertise. May have project lead roles and/or supervise lower level personnel.