IMMIGRATION Sr. Data Scientist II - Job ID 556
Houston, TX
Operations – Data Science & Analytics /
Full-Time /
Hybrid
Intelligent Medical Objects (IMO) (Houston, TX) seeks a Sr. Data Scientist II to directly influence both our products and clients by being intricately involved in running analytical experiments in a methodical manner and will regularly evaluate alternate models via theoretical approaches.
#LI-DNI
Specific Duties Include:
- Specific duties include: analyze and process textual data for bioinformatics applications using Melax CLAMP software kit and clinical NLP techniques; design, customize, and extend existing Melax software suite and web-service applications according to Melax product needs and customer requirements; develop, maintain, and improve NLP applications that process unstructured biomedical texts into structured and searchable information; modify and improve current Melax products by developing and incorporating the cutting-edge machine learning and deep learning algorithms and techniques for enhanced performance and usability; communicate with customers, analyze their NLP needs and requirements, deliver products and projects, and provide assistance; work within the NLP development team to develop NLP modules in different programming or scripting languages such as Java, JavaScript. J2EE, HTML; conduct pre-processing and quality analyses for textual data inputs and performance validation for NLP output; create systematic testing, error-checking procedures, and user manuals; conduct customer consultation and technical support on NLP training, installation, development, and deployment; share knowledge with team members and across the organization on topics including new and emerging NLP methods and technologies; build the infrastructure required for optimal extraction, transformation, and loading of data from a wide variety of data sources using SQL and AWS ‘big data’ technologies; and build analytics tools that utilize the data pipeline to provide actionable insights into customer acquisition, operational efficiency, and other key business performance metrics. Must take and pass Python/Java coding test to solve one NLP algorithm problem. Option to work remotely 60% of the time.
Position Requires:
- Bachelor’s degree, or foreign equivalent, in Computer Information Systems, Informatics, or a closely related field of study, plus 5 years of experience in the job offered, or as an NLP Developer, NLP Data Engineer, NLP Data Scientist, Research Assistant, or a closely related NLP position. Must have 5 years of experience in the following: developing NLP applications and building machine learning models; developing ETL pipelines and processes in big data environments; deploying, maintaining, versioning, and A/B testing machine learning models; working in at least one of these databases: AWS Redshift, Oracle, SQL Server, or MySQL; using SQL to write complex queries across large volumes of data; developing and deploying full-stack solutions in Python; using and following standardized development practices and tools, including TFS/GIT, code standards, and process standards; writing unit tests using standard unit test frameworks; and working with statistical techniques, concepts, methods, and approaches, and working with their application; using multivariate calculus and linear algebra. Must have 3 years of experience with the following: working with AWS tools including Lambda and Sagemaker; creating and using process documentation and workflows; working with TensorFlow and TensorFlow Serving; working with statistical modeling using R or Matlabl; using big data frameworks such as Spark/pySpark; and working with Tableau, Looker, Qlikview, R Shiny or similar data visualization tools. Must also have 2 years of experience using infrastructure-as-code tools, like terraform. Must take and pass Python/Java coding test to solve one NLP algorithm problem. Option to work remotely 60% of the time