Duties: Responsible for Design, Development, and Implementation of Data Lake solution with petabytes of data for JPMC’s Risk and Finance Organizations. Responsible for building ETL process to copy the data from different source systems on to Hadoop platform to build machine learning models. Build Data Quality controls for data ingested from source to target. Design and set up machine learning platform to develop and train machine learning models. Design and manage Bigdata Hadoop machine learning platform with petabytes of storage and 50k+ vcores for model development and training using Bigdata tools, such as Hive, Impala and Sqoop. Design, implement and support advanced machine learning tools, such as Anaconda Packages, Xgboost, Scipy, numpy, Pandas and Tensorflow, and support model development. Responsible for performance tuning Spark job and implementing best practice in Spark job development. Support machine learning platform with large number of users and set up platform controls and governance process. Set up new tools aligning with firmwide control process. Automate platform monitoring tools using Python and Unix shell scripts, setting up real time alert to users consuming more compute and storage in platform, archive and clean up unused tables past retention period, and detect unauthorized package installation on the platform. Design and develop machine learning fraud models using graph databases, such as Tigergraph. Set up and support Tigergraph cluster to support model development. Utilize AWS – EC2, S3, EMR and SageMaker to migrate the machine learning model from Hadoop to AWS.
Minimum education and experience required: Bachelor’s degree or equivalent in Information Technology, or related field, plus 7 years of related experience in application development, or related experience; OR Master’s degree or equivalent in Information Technology, or related field, plus 5 years of related experience in application development, or related experience.
Skills Required: Experience in designing and developing application using Hive, Impala and Spark. Experience performance tuning of jobs running on distributed database architecture. Experience in application development in Cloudera Bigdata platform and Hadoop tools. Experience in design, coding and implementation of Data Lake and Data Load activities. Experience in platform monitoring and build tools to automate platform monitoring. Experience in development and support of applications using NoSQL database. Experience in automation of platform monitoring tools using Python and Unix shell scripts, setting up real time alert to users consuming more compute and storage in platform, archive and cleanup of unused tables past retention period. Experience managing small to medium teams to complete deliverables. Experience in managing datalake and datawarehouse solution to bring the data from different source systems using ETL technologies. Experience in designing application with distributed database in handling larger dataset. Experience in building Data Quality controls for the data ingested from source to target systems. Employer will accept any amount of professional experience with the required skills.
JPMorgan Chase & Co., one of the oldest financial institutions, offers innovative financial solutions to millions of consumers, small businesses and many of the world’s most prominent corporate, institutional and government clients under the J.P. Morgan and Chase brands. Our history spans over 200 years and today we are a leader in investment banking, consumer and small business banking, commercial banking, financial transaction processing and asset management.
We recognize that our people are our strength and the diverse talents they bring to our global workforce are directly linked to our success. We are an equal opportunity employer and place a high value on diversity and inclusion at our company. We do not discriminate on the basis of any protected attribute, including race, religion, color, national origin, gender, sexual orientation, gender identity, gender expression, age, marital or veteran status, pregnancy or disability, or any other basis protected under applicable law. In accordance with applicable law, we make reasonable accommodations for applicants’ and employees’ religious practices and beliefs, as well as any mental health or physical disability needs.
Equal Opportunity Employer/Disability/Veterans