Develop and coordinate LLM frameworks for interpreting, annotating, and integrating heterogeneous scientific datasets, Implement pipelines for metadata extraction, provenance capture, and ontology alignment using foundation models, Design and benchmark methods for transfer learning, active learning, and domain adaptation in data-sparse regimes, Collaborate closely with BlueMat partners across data, modeling, and imaging to connect multimodal data streams, in particular data stewards and co-data stewards, Support the related PhD project in the same research area and guide methodological alignment between the projects, Lead publications, contribute to open-source software, and represent BlueMat’s LLM research in national and international collaborations