What You’ll Do
SRE Track — SR Automation Consolidation
- Inventory all existing SR automation workflows across the SRE organization, including formally documented and informal/tribal scripts.
- Audit each automation for reliability, documenting trigger mechanisms, execution platforms, success/failure rates, dependencies, error handling, and ownership.
- Identify duplicated automation across teams and quantify the cost of fragmentation in manual toil hours and incident impact.
- Perform root cause analysis on the top failing automations and map current SR automation architecture, including integration points and single points of failure.
- Produce a consolidation blueprint mapping each automation to its target state — merge, retire, or rewrite — along with standard patterns and templates for future automations.
- Deliver a phased migration roadmap with effort estimates, risk assessment, and recommended sequencing.
Supportability Track — Monitoring Coverage & Tooling Rollout
- Map current monitoring and observability coverage across services and environments, identifying gaps in alerting, dashboarding, and telemetry.
- Assess existing tooling adoption and effectiveness across Supportability workflows.
- Develop a tooling rollout plan to expand monitoring coverage to underserved areas.
What You’ll Deliver
- SR Automation Master Inventory with full metadata per automation
- Automation Duplication Report (cross-team overlap analysis)
- Reliability Assessment per automation (success rates, failure modes, MTTR)
- Top 10 Failing Automations Root Cause Analysis Report
- SR Automation Architecture Diagrams (current state)
- Consolidation Blueprint with per-automation target-state mapping
- Standard Automation Patterns and Templates
- Phased Migration Roadmap with Effort Estimates
- Monitoring Coverage Map and Gap Analysis
- Tooling Rollout Plan
- Leadership Review Presentation and Complete Documentation Package
- Currently pursuing or recently completed a degree in Computer Science, Information Technology, Systems Engineering, or a related field.
- Familiarity with SRE concepts — reliability, incident management, automation, and toil reduction.
- Exposure to ticketing systems (ServiceNow or similar) and automation platforms (Ansible, Terraform, or scripting languages like Python/Bash).
- Strong analytical skills — you’ll be digging into logs, failure data, and automation workflows to identify patterns and root causes.
- Excellent written and verbal communication skills for stakeholder interviews, documentation, and leadership presentations.
- Ability to manage parallel workstreams and stay organized across two reporting lines.
- Self-directed with the ability to synthesize findings from multiple sources into clear, actionable deliverables.
Nice to Have
- Experience with cloud platforms (AWS, Azure, or GCP) and infrastructure-as-code tools.
- Familiarity with monitoring and observability tools (Datadog, Prometheus, Grafana, PagerDuty, or similar).
- Understanding of ITIL/ITSM processes and service request lifecycle.
- Exposure to containerization and orchestration (Docker, Kubernetes).
- Previous experience with process mapping, root cause analysis, or operational auditing.
Benefits and perks listed here may vary depending on the nature of employment with Deltek. Employees have access to healthcare benefits, a 401(k) plan and company match, paid vacation time and holidays, well-living programs, short-term and long-term disability coverage, basic life insurance and tuition reimbursement.
[#video#https://player.vimeo.com/video/1088518081?h=b4a4b95128&%3bbadge=0&%3bautopause=0&%3bplayer_id=0&%3bapp_id=58479{#550,300#}#/video#]
Why Join #TeamDeltek
Grow. Collaborate. Innovate.
We create innovative products and solutions that power our customers’ project success. Our market leadership is based on the work of our global and diverse team of innovators, creators and collaborators who have a passion for learning, growing and making a difference for Deltek Project Nation.



