Sr. Data Engineer
Job title: Sr. Data Engineer in Englewood Cliffs, NJ at NBCUniversal
Company: NBCUniversal
Job description: Company DescriptionNBCUniversal is one of the world's leading media and entertainment companies. We create world-class content, which we distribute across our portfolio of film, television, and streaming, and bring to life through our theme parks and consumer experiences. We own and operate leading entertainment and news brands, including NBC, NBC News, MSNBC, CNBC, NBC Sports, Telemundo, NBC Local Stations, Bravo, USA Network, and Peacock, our premium ad-supported streaming service. We produce and distribute premier filmed entertainment and programming through Universal Filmed Entertainment Group and Universal Studio Group, and have world-renowned theme parks and attractions through Universal Destinations & Experiences. NBCUniversal is a subsidiary of Comcast Corporation.Our impact is rooted in improving the communities where our employees, customers, and audiences live and work. We have a rich tradition of giving back and ensuring our employees have the opportunity to serve their communities. We champion an inclusive culture and strive to attract and develop a talented workforce to create and deliver a wide range of content reflecting our world.Comcast NBCUniversal has announced its intent to create a new publicly traded company ('Versant') comprised of most of NBCUniversal's cable television networks, including USA Network, CNBC, MSNBC, Oxygen, E!, SYFY and Golf Channel along with complementary digital assets Fandango, Rotten Tomatoes, GolfNow, GolfPass, and SportsEngine. The well-capitalized company will have significant scale as a pure-play set of assets anchored by leading news, sports and entertainment content. The spin-off is expected to be completed during 2025.Job Description
- Reviews internal and external business and product requirements for data operations and activity and suggests changes and upgrades to systems and storage to accommodate ongoing needs.
- Design, develop, and maintain CI/CD pipelines using GitHub Actions to automate deployment, testing, and monitoring of applications.
- Implement and manage serverless solutions (e.g., AWS Lambda, EMR Serverless, Kafka, SNS, SQS, Athena etc.) as part of the application architecture.
- Implement infrastructure as code (IaC) practices using tools like Terraform, AWS CloudFormation, or similar to manage cloud infrastructure.
- Work with development teams to set up automated testing frameworks, ensuring high test coverage and code quality.
- Must understand the basics of relational data modeling and be able to clearly articulate the reasons to use non-relational systems in our architecture. Experience in desired but relevant experience in any of the following is acceptable: Singlestore, MySQL, Redshift, Athena, MSSQL Server, Oracle.
- Decent understanding for the digital media ad sales business and ad serving technologies with experience working with ad serving transactional data logs or Nielsen demographic data.
- Educate and inform business partners on architecture, capabilities, best practices and solutions to build out future enhancements
- Assist in analyzing business requirements, source systems, understand underlying data sources, transformation requirements, data mapping, data model and metadata for reporting solutions
- Writing easily understood documentation and architecture diagrams and keeping them up to date as code and frameworks change over time.
- 5+ Years Experience in Data Modeling, Data architecture, Data Quality, Metadata, ETL and Data Warehouse methodologies and technologies. 3+ years experience in distributed computing solutions such as Spark, MapReduce, Snowflake, Databricks, or Kubernetes
- 3+ years experience with AWS technologies, with preference for Managed Airflow, EMR, Lambda, ECS, EKS
- Experience in designing and managing CI/CD pipelines, preferably using GitHub Actions.
- Experience in any combination of the following: SQL, Linux, MicroStrategy, Tableau, Python, APIs, Spark, Scala, Pandas
- Strong problem-solving skills.
- Strong oral and written communication and influencing skills, with the ability to communicate new concepts and drive change in processes and behaviors and to communicate complex technical topics to management and non-technical audiences.
- Strong knowledge of data security practices and privacy regulations (e.g., GDPR, CCPA) with a proven ability to implement and maintain robust data protection measures
- Bachelor's degree in Engineering, Computer Science, Information Systems or related field with 5+ years of relevant experience.
- Proven ability to develop data applications using Spark Scala
- Troubleshoot complex data pipelines, including addressing scale-related issues such as partitioning, resolving data skews, and optimizing performance by reviewing Spark UI.
- Additionally, the candidate should be able to consider data model design in scale and performance decisions before implementing solutions
- Understanding of how to manage code in the Enterprise Git repository with appropriate branching and documentation skills
- Ability to read external API documentation and write pipelines to extract data from our partners' systems
- Strong analytical focus, results-oriented and execution driven.
- Ability and desire to work within a cross-functional team environment with people from multiple business units, vendors, countries and cultures.
- Flexibility to adjust to changing requirements, schedules and priorities.
- Ability to work independently under minimum supervision and proactive in solving issues
- Energetic, committed and solution focused with the ability to perform under pressure and meeting targets
- Strong desire to share knowledge, teach others, and improve the overall skills of the team
Expected salary: $115000 - 145000 per year
Location: Englewood Cliffs, NJ
Apply for the job now!
[ad_2]
Apply for this job