Staples global careers United States

Data Engineer – SparX Team

Function: eCommerce
Location: Innovation Lab, San Mateo, 94402, California
Date posted: 06/23/2017
Type: Full-time
Permanent / Contract: Regular
Job number: 1015600
Position Summary:
The Data Engineer will work closely with Data Scientist team to create reliable data pipeline automation to build data models.  This position will be based at our San Jose, CA office.
Work with Product owners to understand the problems
Be familiar with data pipeline automation tools
Work on identifying, collecting and processing data required for the algorithms
Proactively push for reliable and continuous updates to models

Responsibilities:
  • Hands on development, design and testing of scalable data pipelines
  • Work on a small team as a self-starter
  • Test at the unit, functional and integration level
  • Working on legacy code as well as green field development
  • Collaborate with Data Science, DBA, Integration and Support teams
  • Actively track new languages and technologies

Basic:

  • Bachelor’s degree
  • At least 1 year working with tools used in the Data Science / Machine Learning space (Python, Spark, Hadoop, etc.)
  • Fluency coding in: Python, SQL
  • Familiarity with data pipeline automation tools (Airflow, Luigi)
  • Strong knowledge of computer algorithms
  • Write reliable code with error and bounds checking
  • Able to independently translate problem space into working code
  • Ability to work independently (NOT expected to create models independently)


Preferred:

  • Experience with Numpy / Scipy
  • Knowledge of statistics and machine learning algorithms
  • Experience with Java, Clojure




Staples is an Equal Opportunity Employer.  All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, protected veteran status, disability, or any other basis protected by federal, state, or local law.