Yesterday
Secret
Senior Level Career (10+ yrs experience)
$150,000 and above
No Traveling
IT - Data Science
Lexington, MA (On-Site/Office)
Our client is in need of a Software Developer -- Specialization in Multilingual Research Data Development to:
1. Collect and analyze text, audio and physiological data;
2. Design and create multi-lingual databases;
3. Author study protocols and successfully apply to Human Subject Review (HSR) boards for approval;
4. Design and implement audio QA/QC tools, procedures, and best practices
5. Maintain and manage an audio facility including computer systems maintenance; hardware support; interaction with vendors
6. Create and maintain Data Security Plans (DSP) and Loan agreements;
7. Implement Natural Language Processing (NLP) and Machine Learning (ML) tools and techniques for HLT evaluation and applications;
❖ Position Scope and Job Functions
1. HLT Data Collection: Implement Natural Language Processing (NLP) and Machine Learning (ML) tools and techniques to create and enhance data for Human Language Technology (HLT) evaluation and applications. Use DOMINO Lincoln workflow system for creating interactive foreign language training and testing materials.
2. Human Language Technology Evaluation: Ability to identify and apply benchmarks to evaluate AI model performance. Ability to create custom multi-lingual datasets for the evaluation of machine translation (MT) and automatic speech recognition (ASR) systems
3. Generative AI: Expertise in Large Language Models (LLMs), generative AI to operational systems. Skills include programming abilities: llama-cpp, mistral, GPT4All, Chat-GPT, Orca, and transformer-based capabilities generally.
4. For Audio/Speech QA/QC: define, create and implement audio applications for measurement and enhancement of audio and speech recording quality; assessing speech corpora integrity; coordinating with and providing guidance to subcontractors providing speech corpora.
5. Advanced audio data analysis: Ability to design, implement and confirm the performance of an audio data collection method for the speech intelligibility evaluation of wearable acoustic sensors.
6. Human Subjects Protocols: Design, author and implement study protocols for the collection of multilingual speech and multi-modal databases. Submit to and maintain protocols with HSR boards and US DoD Human Research Protection agencies.
7. Manage Data Collection Equipment and Facility: Maintain and Manage the Group 24 sound room facility; specifying equipment needs, coordinating efforts across multiple Groups, creating calibrated
acoustic noise simulations, Implementing Study Protocols for collecting multi-modal data from human
subjects; author and implement the procedures necessary to provide and preserve the capability to perform in-field speech and acoustic noise data collections and speech communication.
8. Laboratory Facilities: Ability to work closely with the Facilities division of MIT LL to design and specify new laboratory spaces. Ability to interface with the technical team, understand how the laboratory spaces will need to be design to address technical needs, and communicate the design to address technical needs, and communicate the design specifications to the Facilities division.
❖ Required Skills
1. HLT Research Experience: Experience with Java, Python, MATLAB, git, Digital Audio Workstation (DAW) such as Adobe Audition, Audacity, SoX, Sound Exchange, etc.; must include experience using machine learning techniques and natural language processing tools to create HLT data sets. Familiarity with foreign language corpus development is required for this work. Requires experience designing crowdsourcing jobs for text annotation; experience with JSON and SQL Databases. Experience directing subject matter experts to create interactive foreign language training and testing materials.
2. Human Subjects Experience: Authoring Study Protocols and successfully submitting them for approval to MIT and DoD Human Subject Review Boards for the purpose of multi-sensor data collections and language-learning systems performance. Demonstrated ability to train new personnel in implementing human subjects data collection protocols is required.
3. Sound Room Management: Specify and Maintain equipment. Data Collection Hardware: MacOS and MS Windows platforms, professional audio interfaces, loudspeaker playback systems, audio microphone and multi-modal sensors (heart rate, skin conductance, etc.) data collection systems. National Instruments data collection systems; Portable audio recording systems and Sound Pressure Level (SPL) meters. Demonstrated ability to author and maintain Data Security Plans and Loan Agreements for off-site equipment. Solid understanding of audio equipment usage is required.
4. Independence and Reliability: Demonstrated ability to work independently to complete complex projects on a tight schedule; Requires strong communication skills, interacting with various MIT-LL groups, human subjects, and subcontractors. Demonstrated ability to lead and coordinate teams to produce deliverables on tight deadlines.
5. HLT/Machine Learning: Demonstrated experience implementing Machine Learning and in Human Language Technology / Natural Language Processing Tools and Services
6. Software dev-ops: Demonstrated ability to work in agile development cycle including issues, projects, pull request review, UI and unit testing, Jenkins build, Artifactory storage and deployment
❖ Preferred Skills
1. Experience with digital signal processing;
2. Experience in Digital Speech Communication Test and Evaluation
3. Experience with JSON and SQL Databases
4. Experience in digital speech communication test and evaluation
5. Experience in extracting and analyzing data from social media platforms
1. Collect and analyze text, audio and physiological data;
2. Design and create multi-lingual databases;
3. Author study protocols and successfully apply to Human Subject Review (HSR) boards for approval;
4. Design and implement audio QA/QC tools, procedures, and best practices
5. Maintain and manage an audio facility including computer systems maintenance; hardware support; interaction with vendors
6. Create and maintain Data Security Plans (DSP) and Loan agreements;
7. Implement Natural Language Processing (NLP) and Machine Learning (ML) tools and techniques for HLT evaluation and applications;
❖ Position Scope and Job Functions
1. HLT Data Collection: Implement Natural Language Processing (NLP) and Machine Learning (ML) tools and techniques to create and enhance data for Human Language Technology (HLT) evaluation and applications. Use DOMINO Lincoln workflow system for creating interactive foreign language training and testing materials.
2. Human Language Technology Evaluation: Ability to identify and apply benchmarks to evaluate AI model performance. Ability to create custom multi-lingual datasets for the evaluation of machine translation (MT) and automatic speech recognition (ASR) systems
3. Generative AI: Expertise in Large Language Models (LLMs), generative AI to operational systems. Skills include programming abilities: llama-cpp, mistral, GPT4All, Chat-GPT, Orca, and transformer-based capabilities generally.
4. For Audio/Speech QA/QC: define, create and implement audio applications for measurement and enhancement of audio and speech recording quality; assessing speech corpora integrity; coordinating with and providing guidance to subcontractors providing speech corpora.
5. Advanced audio data analysis: Ability to design, implement and confirm the performance of an audio data collection method for the speech intelligibility evaluation of wearable acoustic sensors.
6. Human Subjects Protocols: Design, author and implement study protocols for the collection of multilingual speech and multi-modal databases. Submit to and maintain protocols with HSR boards and US DoD Human Research Protection agencies.
7. Manage Data Collection Equipment and Facility: Maintain and Manage the Group 24 sound room facility; specifying equipment needs, coordinating efforts across multiple Groups, creating calibrated
acoustic noise simulations, Implementing Study Protocols for collecting multi-modal data from human
subjects; author and implement the procedures necessary to provide and preserve the capability to perform in-field speech and acoustic noise data collections and speech communication.
8. Laboratory Facilities: Ability to work closely with the Facilities division of MIT LL to design and specify new laboratory spaces. Ability to interface with the technical team, understand how the laboratory spaces will need to be design to address technical needs, and communicate the design to address technical needs, and communicate the design specifications to the Facilities division.
❖ Required Skills
1. HLT Research Experience: Experience with Java, Python, MATLAB, git, Digital Audio Workstation (DAW) such as Adobe Audition, Audacity, SoX, Sound Exchange, etc.; must include experience using machine learning techniques and natural language processing tools to create HLT data sets. Familiarity with foreign language corpus development is required for this work. Requires experience designing crowdsourcing jobs for text annotation; experience with JSON and SQL Databases. Experience directing subject matter experts to create interactive foreign language training and testing materials.
2. Human Subjects Experience: Authoring Study Protocols and successfully submitting them for approval to MIT and DoD Human Subject Review Boards for the purpose of multi-sensor data collections and language-learning systems performance. Demonstrated ability to train new personnel in implementing human subjects data collection protocols is required.
3. Sound Room Management: Specify and Maintain equipment. Data Collection Hardware: MacOS and MS Windows platforms, professional audio interfaces, loudspeaker playback systems, audio microphone and multi-modal sensors (heart rate, skin conductance, etc.) data collection systems. National Instruments data collection systems; Portable audio recording systems and Sound Pressure Level (SPL) meters. Demonstrated ability to author and maintain Data Security Plans and Loan Agreements for off-site equipment. Solid understanding of audio equipment usage is required.
4. Independence and Reliability: Demonstrated ability to work independently to complete complex projects on a tight schedule; Requires strong communication skills, interacting with various MIT-LL groups, human subjects, and subcontractors. Demonstrated ability to lead and coordinate teams to produce deliverables on tight deadlines.
5. HLT/Machine Learning: Demonstrated experience implementing Machine Learning and in Human Language Technology / Natural Language Processing Tools and Services
6. Software dev-ops: Demonstrated ability to work in agile development cycle including issues, projects, pull request review, UI and unit testing, Jenkins build, Artifactory storage and deployment
❖ Preferred Skills
1. Experience with digital signal processing;
2. Experience in Digital Speech Communication Test and Evaluation
3. Experience with JSON and SQL Databases
4. Experience in digital speech communication test and evaluation
5. Experience in extracting and analyzing data from social media platforms
group id: 10107773