Tech Tammina
Data Engineer with Machine Learning - W2 Position
Tech Tammina, New York, New York, us, 10261
Position Title: Data Engineer
Emp Type: Full Time & Direct Hire
Location: New York, NY (Hybrid; Need only locals or candidates located commutable distance to NY City)
Visa Required: USC, GCs only
Must be strong with Python for ML pipelines specifically with Pytorch and scikit-learn AWS is required, building pipelines within, Should have a background in LLM (langchain, agents, extensive prompt engineering)
Responsibilities:Ingesting, structuring and analyzing a wide range of unstructured datasourcesDesigning, maintaining and orchestrating data pipelines in an AWS environment for production processing and training flowsContinuously evaluate, analyze, test and improve the quality, privacy and performance of our data systemsContribute across the product, where - from front-end UX and product design, API/systems architecture and ML processing/trainingMinimum Qualifications:
3+ years of experience ingesting, analyzing and structuring a wide variety of datasourcesSignificant experience building and maintaining data pipelines in a production environmentStrong database/SQL, python, pandas (or equivalent) experiencePrior experience working in fast paced environments and tackling problems across the stack with quick iterations while maintaining a high quality bar.Strong Additional Qualifications:
Significant healthcare data experienceLLM experience (langchain, agents, extensive prompt engineering)MLE Experience - pytorch, scikit-learn, etc..Extensive production AWS, container and/or data orchestration experienceFullstack development experience (JS/TS/Node in particular)
Emp Type: Full Time & Direct Hire
Location: New York, NY (Hybrid; Need only locals or candidates located commutable distance to NY City)
Visa Required: USC, GCs only
Must be strong with Python for ML pipelines specifically with Pytorch and scikit-learn AWS is required, building pipelines within, Should have a background in LLM (langchain, agents, extensive prompt engineering)
Responsibilities:Ingesting, structuring and analyzing a wide range of unstructured datasourcesDesigning, maintaining and orchestrating data pipelines in an AWS environment for production processing and training flowsContinuously evaluate, analyze, test and improve the quality, privacy and performance of our data systemsContribute across the product, where - from front-end UX and product design, API/systems architecture and ML processing/trainingMinimum Qualifications:
3+ years of experience ingesting, analyzing and structuring a wide variety of datasourcesSignificant experience building and maintaining data pipelines in a production environmentStrong database/SQL, python, pandas (or equivalent) experiencePrior experience working in fast paced environments and tackling problems across the stack with quick iterations while maintaining a high quality bar.Strong Additional Qualifications:
Significant healthcare data experienceLLM experience (langchain, agents, extensive prompt engineering)MLE Experience - pytorch, scikit-learn, etc..Extensive production AWS, container and/or data orchestration experienceFullstack development experience (JS/TS/Node in particular)