Sr. Site Reliability Engineer--Big Data

Employment Type

: Full-Time


: Information Technology

Loading some great jobs for you...

PulsePoint, a global programmatic advertising platform with specialized healthcare expertise, fuses the science of programmatic targeting, distribution and optimization with the art of brand engagement. The PulsePoint platform is powered by terabytes of impression-level data, allowing brands to efficiently engage the right audiences at scale while helping publishers increase yield through actionable insights.As a part of the SRE team you will be challenged, expected to grow your technical knowledge, challenge your fellow team members, and they will challenge you back. Our team is not competitive, but we are goal oriented and driven to succeed.What you'll be doing:* Deploying, configuring, monitoring and maintaining multiple big data stores, across multiple datacenters* Perform planning, configuration, deployment and maintenance work relevant to the environment* Managing the large-scale Linux infrastructure to ensure maximum uptime* Developing and documenting system configuration standards and procedures* Performance and reliability testing. This may include reviewing configuration, software choices/versions, hardware specs, etc.* Advancing our technology stack with innovative ideas and new creative solutionsWho are you:* Collaboration is in your DNA. You enjoy contributing to a mutual cause, that is why you know when the team succeeds, you succeed.* You are always looking for ways to grow your skills. You are hungry to learn new technologies and sharing your insights with your team.* You like a big picture perspective and also digging into the fine details. You can think strategy but also dive into complex systems break them down and build them back better.* You are a proactive problem solver. You are irked by an unreliable infrastructure and your first instinct is to find ways to fix it.What you'll need:* Thorough understanding of Linux ( we use CentOS in production )* Experience managing Kafka clusters on Linux, including knowledge of the JVM and memory management on the host.* Ability to work with Cassandra cluster from installation through troubleshooting and maintenance.* Multi-faceted Hadoop understanding, including the use of HDFS, Hive, and Impala for data storage and retrieval* Experience administering SQL/NoSQL databases ( MySQL, PostgreSQL, MongoDB)* Any scripting language (Python/Ruby/Shell etc)* Understanding of basic networking concepts ( TCP/IP stack, DNS, CDN, load balancing )Bonus, but not required:* Isilon experience, especially as related to HDFS storage for a Hadoop cluster.* Experience with columnar data store such as Vertica or Clickhouse* Puppet configuration management tool* Experience with scalable infrastructure monitoring solutions such as Icinga, Prometheus, ELK* Experience with container technologies such as Docker and Kubernetes.* Train/mentor junior-level staff* Experience in AdTech or High-Frequency Trading a plus* Experience with Security-related best practices a plusWhat you'll get:* Sane work hours (with flexible scheduling)* Competitive Salary & 401K Plan Match* Generous paid vacation (we consider your birthday a holiday)* Annual Company Retreat* The opportunity to partake in our Office Fitness Shape-Up Program* Spin Classes, Bootcamp, Yoga and other Fitness classes with the office
Associated topics: chief program officer, cpo, manage, manager, management, monitor, product manager, project manager, relationship manager, task

Launch your career - Upload your resume now!

Upload your resume

Loading some great jobs for you...