Data Science Director jobs in Connecticut

Data Science Director establishes, plans, and administers the overall policies and goals of the data science function. Provides strategic guidance and overall direction for analytical efforts. Being a Data Science Director determines the appropriate tools, techniques, staffing and methodologies to extract data that produces meaningful results. Uses extensive knowledge and research into big data tools to guide the integration of new and existing tools into the organization's data science tech stack. Additionally, Data Science Director typically requires a master's degree in computer science, mathematics, engineering or equivalent. Typically reports to top management. The Data Science Director manages a departmental sub-function within a broader departmental function. Creates functional strategies and specific objectives for the sub-function and develops budgets/policies/procedures to support the functional infrastructure. To be a Data Science Director typically requires 5+ years of managerial experience. Deep knowledge of the managed sub-function and solid knowledge of the overall departmental function. (Copyright 2024 Salary.com)

C
Data Engineer
  • Catalytic Data Science
  • Westport, CT FULL_TIME
  • Data Engineer III (Large Language Models)

     

    About Catalytic Data Science (CDS):

    Catalytic Data Science is a groundbreaking cloud R&D platform designed to integrate volumes of scientific resources, data, and analytic tools while providing the ability to network with colleagues in one secure and scalable environment.   By enabling R&D teams to work more collaboratively and improving productivity company-wide, the Catalytic platform helps teams achieve key R&D milestones faster and with greater accuracy.  Our customers are passionate about making the world a better place, and we are inspired by the opportunity to help them.


    The Role

    You are a Data Engineer with experience in processing terabytes of data and working with large language models (LLMs). You have experience in creating and automating scalable, fault-tolerant, and reproducible data pipelines for natural language processing (NLP) using Amazon AWS technologies. You will design and implement data ingestion, processing, and storage solutions that can handle massive amounts of text data from various sources. You are interested in helping to create a platform completely built on top of AWS. You are eager to join a team of Life Scientists and Software Engineers that believe the brightest minds in research should have the best tools to drive innovation. 

    What You’ll Do

    • Build, test, and operate automated Extract, Transform, and Load (ETL) pipelines that process terabytes of text data nightly
    • Develop service frontends around our various backend data stores (AWS Aurora, MySQL, Elasticsearch, S3)
    • Rapidly protype, test, and deploy data pipelines for LLMs using AWS.
    • Collaborate with data scientists and NLP engineers to understand the data requirements and specifications for LLMs and related tasks such as text summarization, translation, and question answering.
    • Optimize the performance, reliability, and scalability of the data pipelines and LLMs by applying best practices and techniques such as data partitioning, caching, compression, and monitoring.
    • Ensure the quality, integrity, and security of the data by implementing data validation, cleaning, and governance policies and procedures.
    • Research and evaluate new technologies and methods for data engineering and LLMs and stay updated with the latest trends and developments in the field.
    • Participate in data architecture and engineering decisions, bringing your strong experience and knowledge to bear.

    Qualifications

    • Bachelor's degree or higher in computer science, engineering, or a related field.
    • 3 years of experience in data engineering, preferably with large-scale text data and LLMs and 6 years of any software engineering experience (including data engineering).
    • Proficient in Python 3 or Java, preferably both.
    • Experience with data modeling, ETL, and data warehouse design and implementation.
    • Expertise with ETL schedulers such as Airflow, Prefect or similar frameworks.
    • Familiar with LLMs and NLP concepts and frameworks such as Transformers, BERT, GPT, PaLM, and LLaMA.
    • Day-to-day experience using AWS technologies such as Lambda, ECS Fargate, SQS, & SNS
    • Experience extracting, processing, storing, and querying of petabyte-scale datasets
    • Familiarity with building and using containers
    • Familiarity with event-based microservices
    • Strong communication, collaboration, and problem-solving skills.

     

    Core Skills:

    1. ETL Processes
    2. Data Modeling and Database Design
    3. Proficiency in Large Language Models
    4. Data Pipeline Optimization
    5. Cross-functional Collaboration
    6. Problem-solving and Analytical Skills 

    Nice-to-Haves

    • Prior experience with Elasticsearch (custom development and/or administration) is a huge plus
    • Knowledge of Graph databases


    What Do We Love in Team Members? 

    Your specialization is less important than your ability to learn fast and adapt to shifting technologies. We’re especially fond of people who:

    • Focus on customer’s needs and our company’s goals, not just writing code
    • Iterate until customers love what you’ve built
    • Self-start and initiate
    • Self-organize
    • Strive to grow personally and professionally, beyond just expanding technical abilities
    • Love to experiment with new technology and share knowledge with the team



    In compliance with federal law, all persons hired will be required to verify identity and eligibility to work in the United States and to complete the required employment eligibility verification document form upon hire.

  • 2 Days Ago

C
Data and Machine Learning Scientist
  • Catalytic Data Science
  • Westport, CT FULL_TIME
  • Position Title: Data and Machine Learning Scientist About Catalytic Data Science (CDS): Catalytic Data Science is a groundbreaking cloud R&D platform designed to integrate the volumes of scientific re...
  • 2 Days Ago

P
Director of National Payer Account Management
  • PNT Data
  • Middletown, CT FULL_TIME
  • We are a team of engineers, business analysts, data scientists, support specialists, painters, musicians, philanthropists and much more. PNT is a privately-owned company in Middletown, CT with over 17...
  • 26 Days Ago

P
Director of Esports & Computer Center and Esports Head Coach
  • Putnam Science Academy
  • Putnam, CT FULL_TIME
  • Position Type: Full Time - 12 months Hours: Regular school hours General Job Description: A Director of Esports is a 12-month employee. The Director will report directly to the school Athletic Directo...
  • 11 Days Ago

Y
Data Science Software Engineer
  • Yale New Haven Health
  • Stratford, CT OTHER
  • OverviewTo be part of our organization, every employee should understand and share in the YNHHS Vision, support our Mission, and live our Values. These values - integrity, patient-centered, respect, a...
  • 3 Days Ago

E
Data Science Intern
  • EXL Service
  • Hartford, CT INTERN
  • Job Title: Data Science Intern Job Description: Works with the AI team in the area of Generative AI, especially large language models or adjacent areas, such as vector databases Applying Generative AI...
  • 25 Days Ago

H
Director of Data Science
  • Henderson Scott
  • New York, NY
  • **Must be local to NYC (on site 2 days a week in midtown) and MUST be coming from either the Consumer Goods Industry or ...
  • 4/26/2024 12:00:00 AM

G
Director of Data Science
  • Glocomms
  • Houston, TX
  • A private equity firm with an emphasis on utilizing data science and analytics is seeking a Director of Data Science to ...
  • 4/26/2024 12:00:00 AM

B
Director of Data Science
  • Burtch Works
  • Columbia, MD
  • We are working with a client in the education space that is looking for a director-level, Data Scientist to oversee thei...
  • 4/25/2024 12:00:00 AM

A
Director of Data Science
  • Aegistech
  • New York, NY
  • A Full-time, Director of Data Science - NLP, LLM and GenAI job is available with our client, a leader in risk management...
  • 4/24/2024 12:00:00 AM

G
Director of Data Science
  • Glocomms
  • Houston, TX
  • A private equity firm with an emphasis on utilizing data science and analytics is seeking a Director of Data Science to ...
  • 4/23/2024 12:00:00 AM

B
Director of Data Science
  • Burtch Works
  • Chicago, IL
  • We are working with a client in the education space that is looking for a Director level Data Scientist to oversee their...
  • 4/22/2024 12:00:00 AM

B
Director of Data Science
  • Burtch Works
  • Columbia, MD
  • We are working with a client in the education space that is looking for a director-level, Data Scientist to oversee thei...
  • 4/22/2024 12:00:00 AM

S
Director of Data Science
  • Solomon Page
  • New York, NY
  • Our client, a curated marketplace for Gourmet Food & Food Gifts is looking for a Director of Data Science. As a key memb...
  • 4/22/2024 12:00:00 AM

Connecticut is bordered on the south by Long Island Sound, on the west by New York, on the north by Massachusetts, and on the east by Rhode Island. The state capital and fourth largest city is Hartford, and other major cities and towns (by population) include Bridgeport, New Haven, Stamford, Waterbury, Norwalk, Danbury, New Britain, Greenwich, and Bristol. Connecticut is slightly larger than the country of Montenegro. There are 169 incorporated towns in Connecticut.The highest peak in Connecticut is Bear Mountain in Salisbury in the northwest corner of the state. The highest point is just east...
Source: Wikipedia (as of 04/11/2019). Read more from Wikipedia
Income Estimation for Data Science Director jobs
$226,362 to $283,643

Data Science Director in Medford, OR
New technologies are driving the pace of scientific discovery in an unprecedented manner, and in the process, generating reams of data.
January 24, 2020
Cal U’s statistics and data science degree is a comprehensive bachelor’s degree program that teaches you to think like a data scientist while you master the essential software skills and data analysis tools that are being used in top data science jobs across industries.
January 06, 2020
Data Science Director in Savannah, GA
Data science is an interdisciplinary field that uses scientific methods, processes, algorithms, and systems to extract value from data.
February 08, 2020
Data Science Director in Port Arthur, TX
Data science reveals trends and produces insights that businesses can use to make better decisions and create more innovative products and services.
February 16, 2020