Job Summary: Design, develop, evaluate, integrate Client/NLP/LLM/GAI models, algorithms, and solutions
Responsible for the design and development of custom Client, Gen AI, NLP, LLM Models for batch and stream processing-based AI Client pipelines including data ingestion, preprocessing modules, search and retrieval, Retrieval Augmented Generation (RAG), NLP/LLM model development and ensure the end-to-end solution meets all technical and business requirements, and SLA specifications.
Adopt and customize LLM agent-based orchestration tools (LlamaIndex, Langchain, Semantic Kernel) for enterprise applications and use cases
Ph.D (preferred), Bachelor's or Master's degree in Computer Science, Mathematics or Statistics , Computational linguistics, Engineering, or a related field.
7+ years of professional hands-on experience leveraging large sets of structured and unstructured data to develop data-driven tactical and strategic analytics and insights using Client, NLP, computer vision solutions
Customize LLM and Client models leveraging Hugging Face, TensorFlow, Keras, PyTorch, Spark or similar statistical tools. (Expert in python pogramming.)
Develop, code, and customize Client, LLM models and pipelines in Python, (+Java/Scala is a bonus)
Analyze, design, develop, support and maintain Big Data environment and code base, support traditional data warehouses and processes. Assist and participate in the teams effort to ensure that the implemented results do meet the partners requirements and perform the necessary IT testing and validation activities.
Primary Responsibilities:
Develop natural language processing (NLP) solutions using GenAI, LLMs and custom transformer architectures
Adopt and customize LLM agent-based orchestration tools (LlamaIndex, Langchain, Semantic Kernel) for enterprise applications and use cases
Implement information search and retrieval at scale, using a range of solutions ranging from keyword search to semantic search using embeddings
Developing and/or tune Large Language Models (LLM) and Generative AI (GAI)
Analyze, design, develop, support and maintain Big Data environment and code base, support traditional data pipelines and processes.
Design application code, implement technical solutions, and configure applications in various environments in response to business problems in close collaboration with Architects, Business Analysts and Change Partners. Manage Hadoop, NoSQL, and/or MPP infrastructure supporting data. Write applications to solve analytical problems. Code, test, release, and support Big Data. Help in the design and build of the data platform over Big Data technologies. Solve big data engineering problems. Analyze, recommend, and implement data technologies for the platform. Involved in case studies about Big Data. Responsible for efficient deliveries. Work with others to propose the best technical solutions. Help design the best backend data warehouse platform to support the capacity and performance. Participate in the proof of concept application.
Ph.D (preferred), Bachelor's or Master's degree in Computer Science, Mathematics or Statistics , Computational linguistics, Engineering, or a related field.
7+ years of professional hands-on experience leveraging large sets of structured and unstructured data to develop data-driven tactical and strategic analytics and insights using Client, NLP, computer vision solutions
Demonstrated 4+ years hands-on experience with Python, Hugging Face, TensorFlow, Keras, PyTorch, Spark or similar statistical tools. Expert in python pogramming.
Expert programmer and software developer (5+ years hands-on) in Python, (+Java/Scala is a bonus)
Big Data design and development experience. Experience as data modeling Client/NLP scientist. including, but not limited to, Performance tuning, fine tuning, RLHF, performance optimization. Validated background with Client toolkits, such as PyTorch, Tensorflow, Keras, Langchain, Llamadindex, SparkML, or Databricks. Proficient with integration of data from multiple data sources Experience with NoSQL databases, such as HBase, ElasticSearch, MongoDB API Design. API/Data mapping to schema.