Sr. Data Scientist

  • Permanent
  • Anywhere

Website PennyMac


Company : PennyMac

Country: United States

Location : Westlake Village, California

Post: Tue, 21 Sep 2021 13:09:28 GMT

Expires: Tue, 19 Oct 2021 23:59:59 GMT

Apply Job : Apply Online

—————————— Job Description ——————————

A Sr. Data Scientist at PennyMac needs to be able to fulfill both the roles of the Data Scientist, analyzing data and building models, and the typical role of the Data Engineer – ETL. We have a massive amounts of data, and we have many projects that will improve the business’ profitability and operations. The challenge for you, should you accept it, is to marry the two together. It is not an easy task. Building data pipelines and scrubbing data is a start, building start-of-the-art models is the middle, and implementing them so that our results improve the bottom line is the end game. Our goal is to be best in class in Machine Learning in the mortgage space.

Job Description:
Help to understand the business problem and the objectives.

Support project leads to identify valuable data sources, and work with our IT team to collect such data

Build ETL data pipelines

Undertake preprocessing of structured and unstructured data

Analyze large amounts of information to discover trends and patterns

Utilize and implement machine learning models for data analysis

Present information using data visualization techniques

Propose data science solutions and strategies to business problems

Ensure solutions are feasible to implement and achieve the business’ objectives

Demonstrate behavior aligned with the organization’s desired culture and values

Ideal Candidate will have the following::
BS/MS in Computer Science, Engineering, Applied Math or related discipline; graduate degree in Data Science or other quantitative field is desirable

Prior experience as a Data Scientist

Relevant coursework or project experience utilizing modeling techniques such as logistic regression, Naïve Bayes, SVM, decision trees, or neural networks

Strong understanding of algorithms and data structures

Strong proficiency with R, SQL, Snowflake and Python including the Tensorflow, Keras, and XGBoost libraries

Strong knowledge of database systems, data modeling, ETL tools, data APIs, and data warehousing solutions

Experience implementing ways to improve data reliability, efficiency, and quality

EMR experience such as Spark, AWSGlue, Dask, and Hadoop

Experience with Docker, ECR, and Kubernetes a plus

Advanced experience with MS Office and Google Suite

Strong quantitative, math, and problem solving skills

Years of Experience: :

Bachelor’s Degree

To apply for this job please visit www.resume-library.com.