Principal Software Engineer, Ozone / HDFS


This description is a summary of our understanding of the job description. Click on ‘Apply’ button to find out more.

Role Description

At Cloudera, we empower people to transform complex data into clear and actionable insights. Cloudera is looking for an exceptional and passionate software engineer with a strong distributed systems background to join the Storage Engineering team focused on building Apache Ozone.

  • Responsible for primary storage and storage access layers, which are core to the platform.
  • Apache Ozone provides a massively scalable distributed object store with a distributed file system interface.
  • Designed to scale to tens of billions of files and blocks, overcoming the limitations of Hadoop Distributed File System (HDFS).
  • Opportunity to join the team that created and wrote most of the HDFS code and make a huge impact on the big data and cloud computing industry.
  • Directly involved in the design and implementation of the core feature set of Apache Ozone and Apache Ratis (open-source RAFT implementation).
  • Regularly contribute code and design docs to the Apache open-source community.
  • Support enterprise customers running 100s of petabytes-scale big data analytics and ML/AI pipelines.
  • Partner with Engineering leaders, product managers, and cross-functional teams in understanding requirements and turning them into a solid design and implementation.
  • Responsible for leading a talented group of engineers working on a feature and mentoring junior engineers.

Qualifications

  • Bachelor’s +10, Master’s +8 years of relevant industry experience required (5+ for PhD candidate).
  • Strong backend engineering skill set with expertise in Java, or strong C++ skills, with intermediate Java expertise.
  • Passionate about programming with clean coding habits, attention to detail, and focus on quality.
  • Experience with large-scale, distributed systems design and development with a strong understanding of scaling, replication, consistency, and high availability.
  • Solid experience with system software design and development with a strong understanding of computer architecture, storage, network, and IO subsystems, and distributed systems.
  • Hands-on programmer with strong data structures and algorithms skillset.
  • Strong oral and written communication skills.

Requirements

  • Strong background in a distributed storage system, including file systems, database storage internals, NoSQL storage, or distributed hash tables.
  • Strong background in performance tuning, identifying performance bottlenecks, and implementing performance optimizations.
  • Strong understanding of the Apache Big Data ecosystem and over 3+ years of experience in systems software, including file systems.
  • Recognized contributions to open source projects.
  • Experience using projects such as Hive, Pig, MapReduce, HBase, etc., is a big plus.
  • Good understanding of storage development, RAFT replication framework, or equivalent distributed consensus frameworks.

Benefits

  • Generous PTO Policy.
  • Support work-life balance with Unplugged Days.
  • Flexible WFH Policy.
  • Mental & Physical Wellness programs.
  • Phone and Internet Reimbursement program.
  • Access to Continued Career Development.
  • Comprehensive Benefits and Competitive Packages.
  • Paid Volunteer Time.
  • Employee Resource Groups.

Source link

Organization

Post Name

Number of Vacency

Educational Qualification

Mode

Salary

Age Limit

Starting Date

Ending Date

Scroll to Top