Hadoop/Spark Admins
Critical: Hadoop Admin / Spark Admin / Linux / Windows / SAS & Python
Nice to have: AWS / Tableau
Need a great communicator. This role will be interfacing directly with Data Scientists, etc, and will need to be personable and have the ability to work issues and deal with people. It's a lot of tech, but it's a lot of internal coordinating and communicating.
There are 3 openings. These will be a contract through end of year. They need to be familiar with SAS and able to convert in Spark and Python. SAS to Python typically refers to the process of migrating data analysis, data manipulation, and statistical modeling tasks from SAS (Statistical Analysis System) to Python, a popular open-source programming language.
Locations: Scottsdale and Chicago and New York
Interview process: 30 min with hiring manager , 2nd- 1 hour onsite
Current Need:
The EWS Data Science and Analytics teams as well as the ML Ops team are fully committed, and need to augment our resources with external support to
- Help convert legacy code-based assets to modern high-performance tools (SAS to Python)
- Existing data processing scripts including data movement, cleaning, and aggregation
- Value Testing Process. This scores a potential customers data through our models to help determine the value of EWS solutions
- SQL/Hive query performance tuning and enhancement
- Develop shared toolkit to automate certain data science processes
- Data profiling
- Feature importance and effectiveness evaluation
- Automate documentation of model development processes
- Assist in upskilling existing team
Project Specifics
- Code Modernization for VT, MV&P, DS, DICA and CIR teams on existing programs/processes
- from SAS/Hive to Python/Scala/Pyspark/SQL or other modern highly efficient technology that fits the Early Warning's current on-prem environment and set up an easy conversion path for future state in ADP/Model Factory
- coordinate with MLOps team to onboard new data sources that exist in SAS environment but not in Newton
- For new VTs, work with the relevant parties to ensure Project plans account for MLOps engagement to build the capability (other processes potentially as well key capabilities in general can be requested to be built by MLOps from scratch)
- Training team to ensure proper adoption/transition to the team
- Hive code efficiency evaluation and modernization
- Evaluate legacy repeated Hive queries commonly used by the analytics community
- Upgrade the legacy code to Scala/Pyspark or other modern highly efficient technology that fits the Early Warning's current on-prem environment and set up an easy conversion path for future state in ADP/Model Factory
- Training team to ensure proper adoption/transition to the team
- Analytics ToolKit / Capability (shared among all teams)
- When existing open-source packages not available or not fitting our modeling need, Create standard, re-usable, highly efficient procedures for end-to-end model development, validation and evaluation, for example:
- Data profiling tool (evaluate data missing, value ranges, outlier, categorical features etc.)
- Feature effectiveness triaging toolset for XGBoost or other non-transparent models
- Provide standard generation of outputs of various model stages that aligns with model governance documentation requirements.
- When existing open-source packages not available or not fitting our modeling need, Create standard, re-usable, highly efficient procedures for end-to-end model development, validation and evaluation, for example:
-
- Provide a template for efficient python-based project structure that enables efficient run, test, debug and deploy pipeline.
- Engage with MLOps for design, code review and approval this is within MLOps roles/resp but this SOW will help to bridge the short term resource gap
- Report Automation
- Replace the current SAS/VisualBasic process with automate standard report automation using the modeling outputs. Collaborate with the tech writer and analytics team to standardize template and output. This include both validation report and initial model development report (auto-inserted with template), this may depend on when we have a DR replacement
- Engage with MLOps for design, code review and approval this is within MLOps roles/resp but this SOW will help to bridge the short term resource gap
- Training / Upskilling Analytics Teams
- Create training/onboarding materials and provide hands-on practice training environment with target adoption outcome
- Work with Corp Learning & Development to develop programming training path using existing platforms and tools (LinkedIn Learning and Udemy)
- Provide office hour and troubleshooting support
- Conduct regular code guidance for the team in partnership with MLOps
- Day 1 Monitoring Script
- Create Day 1 model monitoring script when MLOps resource are not available
Recommended Jobs
Advanced AI Platform Product Manager
We Are: The beginning of a new Data & AI decade that will reshape work and society has begun. Accenture is stepping boldly into this future with a clear strategy and purpose: to help clients optimiz…
Bookkeeper
Open Source Integrators, a leading player in the computer software industry, is seeking a dedicated and detail-oriented Bookkeeper to join our team. As we continue to grow, we need a professional who…
Service Center Technician
~Products ~Services ~Markets ~Careers Career Opportunities - Service Center Technician Description If you are mechanically inclined, this is a good way to develop your career path a…
Retail Stocking Team Lead - Part-Time
If you have strong leadership skills, an interest in retail, and you thrive in a fast-paced environment, join Our Burlington Back of House/Receiving team as a **Part Time Retail Stocking Team Lead!** …
Restaurant Shift Manager
Job Description Job Description Overview Be a Part of Something Uncommonly Good At Noodles & Company , our mission is to nourish and inspire every team member, guest, and community …
Spanish Speaking Behavioral Technician
Job Description Job Description We are looking for enthusiastic individuals to join our team as Behavior Technicians. As a Behavior Technician, you will have the opportunity to make a real differ…
Analog/Mixed-Signal Verification Research Engineer (remote)
Analog/Mixed-Signal Verification Research Engineer Remote / work from home US Citizen or US Permanent Resident Full-time/employee + Benefits + 401k + Stock Options You will work on a small, t…
Lead Analyst, Configuration Oversight - Payment Integrity - Remote
**Job Description** **Job Summary** We are seeking a highly experienced Lead Analyst, Configuration Oversight to support our Payment Integrity and Claims Operations teams in ensuring the accuracy and …
General Laborer
**Overview** **Pay:** $30-$36/HR DOE The pay listed is the hourly range for this position. A specific offer will vary based on the applicant's experience, skills, abilities, geographic location, and …
Driver
Job Description Job Description Truck Driver HACI Mechanical Contractors, Inc. Phoenix, AZ, USA Employment Type Full-Time Benefits Offered 401K, Dental, Life, Medical, Vision We ar…