Data Scientist – Rice University Ken Kennedy Institute for Information Technology

The Ken Kennedy Institute at Rice University, established in 1987, is dedicated to solving critical global challenges with collaborative approaches that focus on innovations in computing and harnessing the transformative power of data. The Institute is the virtual home of over two hundred faculty members and senior researchers at Rice University spanning computer science, mathematics, statistics, engineering, natural sciences, humanities, social sciences, business, architecture and music. Primarily, the Institute is a catalyst for research collaborations across the conventional boundaries of university, department, center and laboratory. Our increasingly complex world demands solutions that span disciplines. Today, most of our activities are in machine learning, AI, data science, and high performance computing and on how these can provide insight and enable convergent research in medicine, engineering, natural sciences, social sciences, humanities and the arts.
The data scientist will manage clinical research projects involving the application of novel machine learning algorithms; identify core requirements and key stakeholders for successful project completion; and identify project bottlenecks, find solutions, and push project implementation forward. S/he will manage all aspects of data access and data cleaning/standardization in order to prepare raw data for feature creation.
Successful candidates will work within multiple teams and need to be self-driven and comfortable working independently.
This is a benefits eligible two-year term limited position with the possibility of extension based upon need and available funding

Experience Preferred

  • Experience which may include experience gained as an undergraduate/graduate student or training gained subsequent to graduation is acceptable
  • Experience in Python
  • Experience applying statistics to clinical data
  • Experience applying machine learning algorithms to clinical data
  • Experience in cloud environments
  • Experience with Linux and Spark (or related)
  • Experience in imaging data
  • Experience in designing and implementing efficient algorithms
  • Experience in application and design of machine learning algorithms to large data sets
  • Experience developing data pipelines that include model application

Skills Required

  • Excellent listening, verbal and written communication, analytical, and research skills
  • Superior scientific and numerical skills with meticulous attention to details and accuracy
  • Proficient and comprehensive knowledge of information management, knowledge management and publication management
  • Ability to work in a team environment, to participate actively, to collaborate and to motivate others in the lab
  • Excellent critical thinking, technical, data collection and interviewing skills
  • Excellent statistical and graphical analysis skills
  • Ability to maintain quality, safety and/or infection control standards
  • Ability to plan and schedule effectively

Essential Functions

  • Conducts complex independent specialized research and experiments
  • Collects, tracks and analyzes data using spreadsheets and databases
  • Communicates the results of research through verbal presentations, written reports, and written articles submitted for publication
  • Monitors research budget
  • Assists in the administration of research project
  • May lead a team of research personnel
  • Performs all other duties as assigned

Additional Functions

  • Manages clinical research projects involving the application of novel machine learning algorithms
  • Identifies core requirements and key stakeholders for successful project completion
  • Identifies project bottlenecks, finds solutions, and pushes project implementation forward
  • Manages all aspects of data access and data cleaning/standardization in order to prepare raw data for feature creation
  • Collaborates with Rice faculty to apply and develop data science and machine learning algorithms for clinical data
  • Collaborates with data team to provide information and decision support
  • Creates presentations which communicate data insights and findings with external technical and non-technical audiences
  • Functions as part of multiple teams across institutions
  • Coordinates with multiple scientists to move grant process forward
  • Develops and writes high-quality grant proposal narratives and supporting documents
  • Supervises and shepherds papers derived from projects
  • Manages and trains graduate students when applicable
  • Communicates with research partners to improve the research infrastructure
  • Leads in the development of modules and tools to perform various tasks as required

Interested applicants can find out more information at the RICEWorks! website at