Job Title: Data Scientist (NLP) (Hybrid/ 3 days onsite)
Location: Addison, TX, or Charlotte, NC
Job Overview:
Location: Addison, TX, or Charlotte, NC
Job Overview:
- The Enterprise Complaints Operations and Enablement - Reporting and Analytics AI team designs and develops AI automation machine learning solutions that drive daily business decisions.
- Qualified individuals should have a keen interest in combining data science skills to engineer automation solutions, help drive decisions, and are passionate about breaking down complex business problems and providing well-design data science solutions that makes a significant impact.
- Must have a strong background in working with text data using natural language processing (NLP) including classification modeling, unsupervised modeling techniques such as clustering and topic modeling.
- Background in working with transformer models.
- Strong Python coding skills are a must.
- Strong SQL skills, data mining, analytics, wrangling and data cleaning skills and strong feature engineering and entity/parts of speech tagging.
- Strong understanding of word embeddings and finding like patterns in text with cosine similarity.
- Background in ElasticSearch or building semantic search solutions a plus
- 2 - 4 years of professional experience in NLP programming
- Experience with finetuning Large Language Models (LLMs) or writing language models from scratch.
- Demonstrated background in NLP and machine learning. Preference for degrees in computer science, information retrieval, statistics, applied math, or other quantitative field
- Demonstrated track record of publication in peer reviewed journals and conferences
- Track-record of having developed Client algorithms
- Experience in deep learning - phonetics (valued)
- Experience in one or more of the following areas: entity/relation extraction, information extraction, summarization, semantics, document classification, ontology, question answering, and knowledge graph
- Experience on modern deep learning approaches to NLP: word/paragraph embedding's, structured prediction, sentiment analysis, disambiguation
- Ability to consistently deliver results across shifting priorities and deadlines in fast paced environment
- Ability to work with a 'sense of urgency' in order to meet critical deadlines
- Detail oriented with strong investigative and problem-solving skills
- Strong communication skills, both verbal and written
- Ability to proactively engage in a teamwork culture
- Advanced degree in Data Science with a focus in NLP valued
- Experience with BERT family of models, open source foundational LLMs such as LLAMA, Falcon or Mistral.
- Ability to indirectly manage peer level associates who are part of problem-solving teams
- Background in teaching NLP algorithms to beginners and advanced Data Scientists
- Experience with large scale data analysis tools such as Spark, Hadoop
- Background in machine learning techniques
- NLP programming in R and or Python / Anaconda
- Background with moving large data sets across Hadoop, SQL, ORACLE
- Solid background in statistical learning techniques for NLP (HMMs, CRFs, SVMs, LDA, LSI, MRFs)
- NLP algorithm implementation experience as well as the ability to create / modify standard algorithms (e.g. change objectives, work-out the math and implement)