Data Warehousing Director jobs in Bridgeport, CT

Data Warehousing Director directs the team responsible for the design, implementation, maintenance, and support of data warehouse systems and projects. Has overall responsibility for the development of data warehouse strategy, architecture and standards that meet business needs. Being a Data Warehousing Director ensures that new data platforms can be successfully integrated with existing warehouses, and that data warehouse systems fully support organization-wide data initiatives. Requires a bachelor's degree. Additionally, Data Warehousing Director typically reports to senior management. The Data Warehousing Director manages a departmental sub-function within a broader departmental function. Creates functional strategies and specific objectives for the sub-function and develops budgets/policies/procedures to support the functional infrastructure. To be a Data Warehousing Director typically requires 5+ years of managerial experience. Deep knowledge of the managed sub-function and solid knowledge of the overall departmental function. (Copyright 2024 Salary.com)

C
Data Engineer
  • Catalytic Data Science
  • Westport, CT FULL_TIME
  • Data Engineer III (Large Language Models)

     

    About Catalytic Data Science (CDS):

    Catalytic Data Science is a groundbreaking cloud R&D platform designed to integrate volumes of scientific resources, data, and analytic tools while providing the ability to network with colleagues in one secure and scalable environment.   By enabling R&D teams to work more collaboratively and improving productivity company-wide, the Catalytic platform helps teams achieve key R&D milestones faster and with greater accuracy.  Our customers are passionate about making the world a better place, and we are inspired by the opportunity to help them.


    The Role

    You are a Data Engineer with experience in processing terabytes of data and working with large language models (LLMs). You have experience in creating and automating scalable, fault-tolerant, and reproducible data pipelines for natural language processing (NLP) using Amazon AWS technologies. You will design and implement data ingestion, processing, and storage solutions that can handle massive amounts of text data from various sources. You are interested in helping to create a platform completely built on top of AWS. You are eager to join a team of Life Scientists and Software Engineers that believe the brightest minds in research should have the best tools to drive innovation. 

    What You’ll Do

    • Build, test, and operate automated Extract, Transform, and Load (ETL) pipelines that process terabytes of text data nightly
    • Develop service frontends around our various backend data stores (AWS Aurora, MySQL, Elasticsearch, S3)
    • Rapidly protype, test, and deploy data pipelines for LLMs using AWS.
    • Collaborate with data scientists and NLP engineers to understand the data requirements and specifications for LLMs and related tasks such as text summarization, translation, and question answering.
    • Optimize the performance, reliability, and scalability of the data pipelines and LLMs by applying best practices and techniques such as data partitioning, caching, compression, and monitoring.
    • Ensure the quality, integrity, and security of the data by implementing data validation, cleaning, and governance policies and procedures.
    • Research and evaluate new technologies and methods for data engineering and LLMs and stay updated with the latest trends and developments in the field.
    • Participate in data architecture and engineering decisions, bringing your strong experience and knowledge to bear.

    Qualifications

    • Bachelor's degree or higher in computer science, engineering, or a related field.
    • 3 years of experience in data engineering, preferably with large-scale text data and LLMs and 6 years of any software engineering experience (including data engineering).
    • Proficient in Python 3 or Java, preferably both.
    • Experience with data modeling, ETL, and data warehouse design and implementation.
    • Expertise with ETL schedulers such as Airflow, Prefect or similar frameworks.
    • Familiar with LLMs and NLP concepts and frameworks such as Transformers, BERT, GPT, PaLM, and LLaMA.
    • Day-to-day experience using AWS technologies such as Lambda, ECS Fargate, SQS, & SNS
    • Experience extracting, processing, storing, and querying of petabyte-scale datasets
    • Familiarity with building and using containers
    • Familiarity with event-based microservices
    • Strong communication, collaboration, and problem-solving skills.

     

    Core Skills:

    1. ETL Processes
    2. Data Modeling and Database Design
    3. Proficiency in Large Language Models
    4. Data Pipeline Optimization
    5. Cross-functional Collaboration
    6. Problem-solving and Analytical Skills 

    Nice-to-Haves

    • Prior experience with Elasticsearch (custom development and/or administration) is a huge plus
    • Knowledge of Graph databases


    What Do We Love in Team Members? 

    Your specialization is less important than your ability to learn fast and adapt to shifting technologies. We’re especially fond of people who:

    • Focus on customer’s needs and our company’s goals, not just writing code
    • Iterate until customers love what you’ve built
    • Self-start and initiate
    • Self-organize
    • Strive to grow personally and professionally, beyond just expanding technical abilities
    • Love to experiment with new technology and share knowledge with the team



    In compliance with federal law, all persons hired will be required to verify identity and eligibility to work in the United States and to complete the required employment eligibility verification document form upon hire.

  • Just Posted

C
Data and Machine Learning Scientist
  • Catalytic Data Science
  • Westport, CT FULL_TIME
  • Position Title: Data and Machine Learning Scientist About Catalytic Data Science (CDS): Catalytic Data Science is a groundbreaking cloud R&D platform designed to integrate the volumes of scientific re...
  • Just Posted

S
Executive Director of Prospect Management, Data Analytics & Pipeline Optimization
  • Sacred Heart University
  • Fairfield, CT FULL_TIME
  • About Sacred Heart University: As the second-largest independent Catholic university in New England, and one of the fastest-growing private doctoral institutions in the U.S., Sacred Heart University i...
  • 7 Days Ago

A
Data Analyst
  • AaraTechnologies Inc
  • Shelton, CT FULL_TIME
  • Job DetailsJob Description:We are seeking a Data Analyst to join our team. The ideal candidate will have 2-4 years of experience in data analysis, strong problem-solving skills, and a passion for work...
  • Just Posted

B
Data Associate
  • Bridgewater Associates LP
  • Westport, CT FULL_TIME
  • About Bridgewater Bridgewater Associates is a premier asset management firm, focused on delivering unique insight and partnership for the most sophisticated global institutional investors. Our investm...
  • 14 Days Ago

H
Data Analyst
  • Hatch IT
  • Westport, CT FULL_TIME
  • Hatch I.T. is partnering with a Pryon, to find a Data Analyst. See details below:About the role:Pryon is searching for a talented Data Analyst to be a pivotal player in driving data-backed decision-ma...
  • 1 Month Ago

Filters

Clear All

  • Filter Jobs by companies
  • More

0 Data Warehousing Director jobs found in Bridgeport, CT area

B
Area Director
  • Brinks
  • New Britain, CT
  • The Brink's name is a promise to respect the trust we've earned in over 150 years in business. Every employee honors tha...
  • 4/25/2024 12:00:00 AM

G
IT Director
  • Grayscale Investments
  • Stamford, CT
  • Grayscale Investments is the world's largest digital currency asset manager. Through its family of investment products, ...
  • 4/25/2024 12:00:00 AM

T
Director of Advancement
  • The Maritime Aquarium at Norwalk
  • Norwalk, CT
  • DIRECTOR OF ADVANCEMENT The Maritime Aquarium at Norwalk seeks a full time Director of Advancement. POSITION OVERVIEW Th...
  • 4/22/2024 12:00:00 AM

S
Director of Philanthropy
  • Stamford Center For The Arts' Palace Theatre
  • Stamford, CT
  • Stamford Center for the Arts (SCA) operates the Palace Theatre, a 1600-seat performing arts center in the heart of Stamf...
  • 4/22/2024 12:00:00 AM

W
Director of Development
  • Westchester Parks Foundation
  • Mount Kisco, NY
  • Westchester Parks Foundation invests in, advocates for, and enhances the over 50 parks of the Westchester County Parks s...
  • 4/22/2024 12:00:00 AM

C
Program Director
  • Connecticut Renaissance
  • Bridgeport, CT
  • Job Description Job Description Summary: The Program Director of Community Release Program directly manages the Work Rel...
  • 4/21/2024 12:00:00 AM

V
Executive Director
  • Verigent
  • Bethel, CT
  • Job Title: Executive Director Location: Bethel, CT (relocation assistance provided if needed) Duration: Direct Hire, Per...
  • 4/21/2024 12:00:00 AM

S
Director of Sales
  • Sunrise Senior Living
  • Stamford, CT
  • Overview "Sunrise is the best place that I've ever worked, simply because of the people. We provide quality care in an e...
  • 4/19/2024 12:00:00 AM

Bridgeport is a historic seaport city in the U.S. state of Connecticut. It is in Fairfield County, at the mouth of the Pequonnock River on Long Island Sound, 60 miles from Manhattan and 40 miles from The Bronx. It is bordered by the towns of Trumbull to the north, Fairfield to the west, and Stratford to the east. As of 2017, Bridgeport had an estimated population of 146,579, which made it the largest city in Connecticut and the fifth-most populous in New England. The Greater Bridgeport area is the 48th-largest urban area in the United States. The showman P. T. Barnum was a resident of the cit...
Source: Wikipedia (as of 04/11/2019). Read more from Wikipedia
Income Estimation for Data Warehousing Director jobs
$211,555 to $278,092
Bridgeport, Connecticut area prices
were up 1.7% from a year ago

Data Warehousing Director in Wilmington, NC
Data warehousing specialists (DWSs) design, develop, and maintain data warehouses.
November 28, 2019
Data Warehousing Director in Provo, UT
"But the reality is also that today, a lot of data is unstructured.
February 15, 2020
Data Warehousing Director in Fort Smith, AR
There have been rapid advances in cloud data warehousing technology.
January 31, 2020