About the Team

MGnify is EMBL-EBI’s microbiome derived sequence data analysis resource, which performs the archiving, assembly and analysis of amplicon, metagenomic and metatranscriptomic data. One of the major data products is the MGnify Protein database. This ever-increasing resource currently contains >2.5 billion non-redundant protein sequences, which will more than double in the next release due in 2026. There is an associated >700 million predicted protein structures, a number that will continue to grow with scalable AlphaFold predictions. The storage, indexing, presentation and accessibility of a dataset this size presents many exciting opportunities in the development of scalable solutions. We are looking for somebody who is keen to take on these challenges and help design and implement the technical solutions needed to ensure the MGnify Protein Database serves the current and next generation of data scientists.

Duties & Responsibilities 

In this role you will:

  • Design and manage scalable PostgreSQL or MySQL database architectures for multi-million-row datasets, including schema optimization and data ingestion workflows

  • Oversee the ingestion and release of datasets in Google Cloud BigQuery

  • Design and implement web APIs and user interfaces to enable non-technical users to query large datasets

  • Develop, maintain, and create documentation and code examples for users across different  proficiency levels

  • Diagnose and resolve technical issues promptly

  • Handle day-to-day responsibilities including responding to user queries and participating in team stand-ups

  • Collaborate with team members through code reviews and peer feedback

  • Present technical developments at team meetings and to internal and external collaborators at EMBL

  • Attend meetings, conferences, and collaborative events that may require overseas travel and occasional work outside standard hours

You have (Requirements) 

  • Educated to masters-level in a computational or related discipline, or have a demonstrable equivalent level of experience

  • Demonstrated experience writing robust, production-quality code in production environments

  • Proficiency in software development best practices including revision control and agile methodologies

  • Strong collaboration and teamwork skills

  • Proficient in Python development

  • Extensive hands-on experience with relational databases, including schema design and optimization

  • Experience with data science tools for large-scale data processing, such as Dask, Polars, or Pandas.

  • Good written and verbal communication skill

  • Team player, as well as an ability to work on problems in isolation

You might also have (Desirable)

  • Demonstrable experience handling large, complex datasets that can be considered 'Big Data'

  • Experience developing REST APIs and front-end applications using modern frameworks such as Vue.js or React

  • Proficiency with Git and collaborative coding platforms such as GitHub or GitLab

Behaviors we value in our team:

You bring curiosity, drive, and a proactive mindset to everything you do.

You will posses strong communication skills, with the ability to explain complex technical concepts clearly

A self-starter able to manage multiple priorities and deadlines, collaborative and effective in multidisciplinary, international teams! 

Other helpful information 

Hybrid Working: At EMBL-EBI we are pleased to offer hybrid working options for all our employees. A dedicated desk will be available everyday, our team work two days on site and three from home.  

Contract length: 1 year (Project based)

Salary: Grade 5 - 6 - monthly salary from £3,303 or 3,695 after tax plus generous benefits (excluding pension and insurance contributions) 

Why join us

Do something meaningful
At EMBL-EBI you can apply your talent and passion to accelerate science and tackle some of humankind's greatest challenges. EMBL-EBI, part of the European Molecular Biology Laboratory, is a worldwide leader in the storage, analysis and dissemination of large biological datasets. We provide the global research community with access to publicly available databases and tools which are crucial for the advancement of healthcare, food security, and biodiversity.
 

Join a culture of innovation
We are located on the Wellcome Genome Campus, alongside other prominent research and biotech organisations, and surrounded by beautiful Cambridgeshire countryside. This is a highly collaborative and inclusive community where our employees enjoy a relaxed atmosphere. We are committed to ensuring our employees feel valued, supported and empowered to reach their professional potential.  Watch this video to see how EMBL-EBI makes an impact.

Enjoy lots of benefits:

  • Financial incentives: Monthly family, child and non-resident allowances, annual salary review, pension scheme, death benefit, long-term care, accident-at-work and unemployment insurances

  • Flexible working arrangements - including hybrid working patterns 

  • Private medical insurance for you and your immediate family (including all prescriptions and generous dental & optical cover)

  • Generous time off: 30 days annual leave per year, in addition public holidays

  • Relocation package including installation grant (if required)

  • Campus life: Free shuttle bus to and from work, on-site library, subsidised on-site gym and cafeteria, casual dress code, extensive sports and social club activities (on campus and remotely)

  • Family benefits: On-site nursery, 10 days of child sick leave, generous parental leave, holiday clubs on campus and monthly family and child allowances

  • Benefits for non-UK residents: Visa exemption, education grant for private schooling, financial support to travel back to your home country every second year and a monthly non-resident allowance.

For detailed information please visit our employee benefits page here

What else you need to know

  • International applicants: We recruit internationally and successful candidates are offered visa exemptions. Please take a look at our International Applicants page for further information.  

  • EMBL is a signatory of DORA. Find out how we apply DORA principles to our recruitment and performance assessment processes here.

  • Diversity and inclusion: At EMBL, we believe that diverse teams drive innovation and scientific excellence. We encourage applications from candidates of all genders, identities, nationalities and/or any other diverse backgrounds.

  • How to apply: To apply please submit a cover letter and a CV through our online system. Applications will close at 23:59 CET on the date shown below. We aim to provide a response within two weeks after the closing date.

Closing Date

08/03/2026


 


At Impactpool we do our best to provide you the most accurate info, but closing dates may be wrong on our site. Please check on the recruiting organization's page for the exact info. Candidates are responsible for complying with deadlines and are encouraged to submit applications well ahead.
Before applying, please make sure that you have read the requirements for the position and that you qualify. Applications from non-qualifying applicants will most likely be discarded by the recruiting manager.