Be the next big change > Dassault Systèmes
Be the Next Game Changer

Staff Site Reliability Engineer

Japan, Tokyo
Regular
2/2/2023
531890

Medidata: Powering Smarter Treatments and Healthier People

Medidata, a Dassault Systèmes company, is leading the digital transformation of life sciences, creating hope for millions of people. Medidata helps generate the evidence and insights to help pharmaceutical, biotech, medical device and diagnostics companies, and academic researchers accelerate value, minimize risk, and optimize outcomes. More than one million registered users across 2,000+ customers and partners access the world's most trusted platform for clinical development, commercial, and real-world data. Known for its ground-breaking technological innovations, Medidata has supported more than 30,000 clinical trials and 9 million study participants. And Medidata’s ongoing commitment to infusing the patient voice into trial designs and solutions is helping to create a better and more inclusive experience for all participants in clinical studies. Medidata is involved in nearly 40% of company-initiated trial starts globally, with studies conducted in more than 140 countries. More than 70% of novel drugs approved by the Food and Drug Administration (FDA) in 2022 were developed with Medidata software. Medidata is headquartered in New York City and has offices around the world to meet the needs of its customers. Discover more at www.medidata.com and follow us @medidata.

 

our Mission: 

 

As a Staff Site Reliability Engineer, you will be building the reliability into products used every day by our customers around the world. You'll also create and contribute to tools relied upon by all internal engineering teams and customer-facing functions at Medidata. Quality and standards matter to us - we strive to positively influence the Technology organization through close collaboration with other teams and attention to detail when contributing to shared projects. While our projects are diverse (observability tools and services, clinical trial data capture, regulated content management, clinical trial management, and much more), our mission remains constant - improve Medidata’s velocity of innovation so we can help our customers power smarter treatments and healthier people.

 

Role Description:

 

Site Reliability Engineers (SRE) at Medidata aim to help teams improve reliability of our Platform. Some SREs focus on writing application level code to improve observability and reliability, while others focus on improving deployment and infrastructure automation. We appreciate different areas of expertise and offer growth in the area of focus most suitable to the candidate and our team.  All SRE practitioners have common expectations - listed below - to help us lower MTTR and CFR and accelerate our teams.
The SRE team creates tooling, sets standards and best practices for the rest of the Medidata teams. As a member of the team you will work on solutions to improve the reliability of deployments, communications between services, observability and alerting, and much more. Your solutions will be used by multiple teams and you will have the chance to interact with a multitude of technical stacks and guide teams on their road of SRE.

 

  • Guide teams on their observability and alerting needs. Reviews and approves their changes. 
  • Design and create software solutions for complex SRE needs of teams.
  • Improve implementation of the telemetry pipelines, including collaboration with open-source projects.
  • Understand the runtime hardware used by services, and guides teams on how to add telemetry for each use case.
  • Lead hazards reviews with teams. Promotes usage and follows up with value added.
  • Create solutions to do complex analysis of telemetry data.
  • Propose and implement new auto-scaling mechanisms.
  • Propose changes to improve the performance of interactions among multiple services.
  • Have a wide understanding of the interaction among systems and create a set of consistent solutions for their observability needs.
  • Lead improvements of CD for the team. Teaches others best practices.
  • Lead analysis of past incidents to find potential improvements in observability.
  • Propose changes to prevent incidents from repeating.
  • Lead the RCA meeting.
  • Create means to ensure teams review their runtime objectives and alerts periodically.
  • Propose and lead new initiatives that are widely applicable within the organization.
  • Give feedback to write better stories and propose breakdown to reduce risk.
  • Proactively search for clarification, urgency and context if unclear.
  • Maintain awareness of industry trends and tools.
  • Learn and disseminate best practices.
  • Work with empathy for other teams. Proactively worries about other teams.
  • Identify and communicate issues as they arise.
  • Take ownership of stories.
  • Proactively contribute to internal technical documentation content and organization.
  • Provide input throughout planning, design, implementation, deployment, maintenance, and monitoring.

Education & Experience:

  • Bachelor’s degree in computer science (or related field) or equivalent experience.
  • Experience creating innovative monitoring tools
  • Experience with web backend, synchronous and asynchronous communications
  • Experience consuming, designing and building APIs

 

 

MEDIDATA Logo > Dassault Systèmes

MEDIDATA generates the evidence and insights to help pharmaceutical, biotech, medical device and diagnostics companies, and academic researchers accelerate value, minimize risk, and optimize outcomes.