Follow our LinkedIn to see new jobs, every day.
Fulcrum Digital logo

System Reliability Engineer (Application Support + Automation)

Fulcrum Digital
Contract
On-site
Dublin 02, Dublin, Ireland
​<\/span><\/span>
<\/div>

Who are we<\/span><\/u>
<\/span>Fulcrum Digital is an agile and next -generation digital accelerating company providing digital transformation and technology services right from ideation to implementation. These services have applicability across a variety of industries, including banking & financial services, insurance, retail, higher education, food, healthcare, and manufacturing.<\/span><\/p>

 <\/span>
<\/p>

The Role<\/span><\/u>
<\/p>

  • Plan, manage, and oversee all aspects of a Production Environment <\/span>
    <\/li>
  • Define strategies for Application Performance Monitoring, Optimization in Prod environment<\/span>
    <\/li>
  • Respond to Incidents and improvise platform based on feedback and measure the reduction of incidents over time.<\/span>
    <\/li>
  • Support deployment of code into multiple lower environments.  Supporting current processes with an emphasis on automating everything as soon as possible.<\/span>
    <\/li>
  • Design, develop and standardize Monitoring and Alerting mechanism for the supported applications.<\/span><\/b>
    <\/li>
  • Take a holistic approach to problem solving, by connecting the dots during a production event through the various technology stack that makes up the platform, to optimize meantime to recover.<\/span>
    <\/li>
  • Engage in and improve the whole lifecycle of services—from inception and design, through deployment, operation and refinement.<\/span>
    <\/li>
  • Analyze ITSM activities of the platform and provide feedback loop to development teams on operational gaps or resiliency concerns.<\/span>
    <\/li>
  • Support services before they go live through activities such as system design consulting, capacity planning and launch reviews.<\/span>
    <\/li>
  • Support the application CI/CD pipeline for promoting software into higher environments through validation and operational gating, and lead  in DevOps automation and best practices.<\/span>
    <\/li>
  • Maintain services once they are live by measuring and monitoring availability, latency and overall system health.<\/span>
    <\/li>
  • Scale systems sustainably through mechanisms like automation and evolving systems by pushing for changes that improve reliability and velocity.<\/span>
    <\/li>
  • Work with a global team spread across tech hubs in multiple geographies and time zones.<\/span>
    <\/li>
  • Ability to share knowledge and explain processes and procedures to others.<\/span>
    <\/li>
  • Share knowledge and mentor junior resources<\/span>
    <\/li>
  • Able to perform on -call duties on a rotational basis.<\/span>
    <\/li>
  • Occasional off hours work required.<\/span>
    <\/li><\/ul>

     
    <\/p>


    <\/div><\/span>

    Requirements<\/h3>
    Skills <\/span><\/b>–<\/span>
    <\/div>

    Must Have<\/span><\/b>
    <\/p>

    • Linux<\/span>
      <\/li>
    • Mainframe​<\/span>
      <\/li>
    • Shell Scripting<\/span>
      <\/li>
    • ITIL / ITSM, Application Troubleshooting<\/span>
      <\/li>
    • SQL<\/span>
      <\/li>
    • Any Monitoring tool (Preferred Splunk/Dynatrace)<\/span>
      <\/li>
    • Jenkins - CI/CD<\/span>
      <\/li>
    • Groovy Scripting/Yaml -  basic<\/span>
      <\/li>
    • Git basic/bit bucket<\/span> - basic<\/span>
      <\/li>
    • Ansible/Chef - good to have<\/span>
      <\/li><\/ul>

      Good To Have<\/span><\/span><\/span><\/b>
      <\/span><\/span><\/p>

      • Even Framework architecture<\/span><\/span><\/span>
        <\/li><\/ul>

        <\/div><\/span>

Apply now
Share this job