Overview

The candidate is expected to take part in and coordinate the strategic scientific development of software tools and frameworks for the generation and provision of installation a of large, data integration platform which enables interoperability of plant phenotyping data collected from Purdue’s phenotyping facilities (ICSC and AAPF), and potentially other facilities/sites. The candidate will closely interact with a wide variety of PIs who generate data of different types and/or use these data for their research activities. The candidate will also need to work with Purdue’s AgIT team in the implementation of the data ecosystem – a high performance data management and data processing infrastructure to enable high-throughput computation for plant research across multiple facilities/sites. Additional duties will include but are not limited to:

  • Structure and standardize data to establish the foundation that facilitate interoperability of data; collect and prepare metadata following the Findability, Accessibility, Interoperability, and Reusability (FAIR) principle
  • Implement data processing and data management platform(s), and develop software packages that are capable of handling large volume of scientific data
  • Enable useful access to the data management platform and software using a standard application programming interface (API) and a graphical user interface (GUI)
  • Take the lead in deliver data (raw and processed data)
  • Provide technical supports, guidance, coaching and training in image analysis to customers and partners
  • Communicate results to key stakeholders
  • Develop strategies and data policies that support technical aspects of data interoperability

Required:

  • Bachelor’s degree in computer science, information technology, computational biology, agricultural and biosystem engineer or similar field
  • 4 years of data management and/or bioinformatics
  • Strong knowledge and experience in data management, specifically in database applications
  • Knowledge and experience in distributed computing and/or the parallelization of applications
  • Knowledge and experiences in multiple programming languages and platforms, such as Python, R, C/C++, Matlab
  • Capability to work independently or as a team member
  • Strong skills in spoken and written English

Tagged as: EARLY CAREER