Data Engineer for text Mining
General Information:
* Start date: ASAP
* latest Start Date: ideally June
* Planned duration: 31.12.2024
* Extension (in case of limitation): possible
* Workplace: Basel
* Workload: 100%
* Remote/Home Office: hybrid model, multiple days a month onsite required. High level of flexibility possible for candidate
* Travel: nein
* Team: 2-3 colleagues in core team
Tasks & Responsibilities:
* Text Data Repository (RoMine) data curation, and FAIRification (cleaning, parsing, disambiguation, deduplication, data harmonization etc.)
* Enhancing the user experience for RoMine (eg. building data warehouse, API)
* Contributing to the development of a Vector Store
* Establishing common terminologies to link literature and omics data and harmonization of study and biosample level metadata.
* User support for utilizing RoMine datasets
* Data integration DDA Value Stream
Must Haves:
* At least 5 years of experience in related fields
* University degree (Bachelor, Master, PhD)
* Data engineering, data warehousing and database experience
* Knowledge of the FAIR principles
* Good understanding in the area of Text Analytics, TDM and LLMs
* Experience with biological pathway and network content
* Exposure to high performance computing environments
* Python scripting
* Biomedical background/education
* Basic knowledge in scientific literature /published content/literature or/and other text based data
* Basic knowledge on OMICS data
* English fluent
