This job has Expired

northeastern_university.jpg

Associate Bioinformatician

Job Description


About the Opportunity

The Research Computing (RC) team at Northeastern University (NU) seeks a motivated, self-starting individual to be a member of our dynamic team as an Associate Bioinformatician. The successful candidate will be a key link between the RC team and the research community at NU, including faculty and students across a broad range of departments, as well as outside users and partners, in order to help them understand and utilize RC resources for their research and teaching.

As an Associate Bioinformatician at NU, your primary objective will be to support and enhance the research enterprise through direct interaction with faculty and students.  You will assist research groups in taking full advantage of Discovery, NU’s 1,700+ node HPC cluster installed at the Massachusetts Green High Performance Computing Center.  You will help with optimization of code and leveraging of scheduler features to maximize throughput of jobs.  You will work with research groups to help identify, implement and improve their computational and data workflow, as well as assist in facilitating computing and data infrastructure solutions specific to Northeastern University researchers.  As a PhD level scientist, you will also participate in grant funding opportunities, and author research papers and presentations, with faculty members at Northeastern University. Additionally, you will have the opportunity to design and lead projects, working directly with RC Graduate Research Assistant (GRA) and Co-op student workers.

At Team ITS, your success matters as much as the mission. Learn more about our flexible, highly dynamic, and values-first culture at careers.its.northeastern.edu

This position is eligible for remote work.

QUALIFICATIONS

  • Ph.D. in biology, molecular biology, biostatistics, bioinformatics, computational biology, or other related discipline

  • Strong scientific background in biology (molecular biology, genetics, developmental biology, biochemistry, etc)

  • High proficiency in Python

  • Knowledge in biostatistics packages (R or Matlab) and SQL queries

  • Proficiency working in Unix/Linux environments

  • Proficiency in commonly used command-line tools (Bedtools, Blast+, SAMTools, Bowtie2, Picard, GATK, FastQC, etc)

  • Experience in a batch HPC environment with a parallel file system using Slurm

  • Experience in an academic or research community environment

  • Experience with scientific workflows

  • Excellent interpersonal communication skills

  • Ability to understand and communicate statistical measures for interrogating the quality of data manipulation

  • Ability and willingness to learn new technologies and remain current in developing trends in the scientific computing community

  • Ability to communicate and document clearly

  • Demonstrated strong individual writing skills

  • Desire for continuous self-improvement and maintenance of skills through training

  • Curious by nature and unafraid of asking questions until you understand well-enough to teach someone else

  • Self-starter, with a proven track record for creativity and initiative

  • Able to prioritize multiple projects and work streams in a fast-paced environment

Preferred Qualifications

  • Postdoctoral work at a computing facility or biologically focused industry research environment

  • Strong publication record consistent with a history of collaborative research in computational science

  • High proficiency in R

  • Proficiency with pipelines for handling reads from different platforms (Illumina, IonTorrent, PacBio, etc)

  • Experience with parallel programming paradigms, including but not limited to; MPI, OpenMP, and CUDA

  • Experience with database management including SQL, as well as non-relational databases such as MongoDB

  • Knowledge of different mapping algorithms for next gen sequencing reads to published genomes

  • Knowledge of de novo genome assembly tools and algorithms

  • Knowledge of sources of error in library preparation and bioinformatics analysis of next gen sequencing data

  • Knowledge of machine learning and data mining techniques

KEY RESPONSIBILITIES

Software and Infrastructure Support

  • Partner with scientist and research staff to efficiently and effectively use NU HPC clusters at MGHPCC, to be expanded to over 10,000 cores with high bandwidth, low latency interconnects, and 3+ petabytes of data storage

  • Work with research groups to help identify, implement, and improve their computational and data workflows; assist in facilitating computing and data infrastructure solutions specific to NU researchers

  • Participate in the research and deployment of new technologies such as GPUs, Big Data and ML technologies (Hadoop, Tensorflow, etc), and cloud computing resources (AWS, Azure, GCP)

  • Develop effective pipelines for high throughput analysis, maximizing the utilization of computational resources

  • Troubleshoot, isolate, and resolve application errors, and other technical issues

  • Benchmark application codes, develop regression tests, and deploy application performance tools

  • Assist biologically focused faculty in the development of appropriate software stacks for analysis of sequencing and biological samples

  • Assist in developing and writing proposals to enhance the research enterprise at NU

  • Work with scientists and researchers to effectively use complementary national resources, including at national laboratories and other user facilities (TACC, NCSA, SDSC, MSI, ALCF, OLCF, etc)

Data Management and Workflow Development

  • Help define and deploy a comprehensive computational and data vision for NU researchers

  • Participate in the development and implementation of backup policies

  • Participate in the integration of HPC and storage resources with research equipment such as sequencers, microscopes, etc.

Internal and External Documentation

  • Attend scientific, RC, or HPC centric conferences to explore technologies relevant to the mission of RC at NU

  • Interface with senior management to develop a documentation strategy (e.g. user manuals, interface documentation, etc.)

  • Communicate progress and participate in reviews with technical staff and senior management

Community Engagement

  • Develop new and creative ways to engage the user community through both documentation, as well as through workshops and community events

  • Develop, present, and maintain training sessions and workshops relevant to RC services and resources

  • Mentor RC GRA and Co-op student workers

Position Type

Information Technology

Additional Information

Northeastern University considers factors such as candidate work experience, education and skills when extending an offer.  


Northeastern has a comprehensive benefits package for benefit eligible employees. This includes medical, vision, dental, paid time off, tuition assistance, wellness & life, retirement- as well as commuting & transportation. Visit https://hr.northeastern.edu/benefits/ for more information.  


Northeastern University is an equal opportunity employer, seeking to recruit and support a broadly diverse community of faculty and staff.  Northeastern values and celebrates diversity in all its forms and strives to foster an inclusive culture built on respect that affirms inter-group relations and builds cohesion. 

All qualified applicants are encouraged to apply and will receive consideration for employment without regard to race, religion, color, national origin, age, sex, sexual orientation, disability status, or any other  characteristic protected by applicable law.

To learn more about Northeastern University’s commitment and support of diversity and inclusion, please see www.northeastern.edu/diversity .


*Please mention you saw this ad on AcademicJobs.*

Apply Now

Be Seen By Recruiters at the
Best Institutions

Create Your FREE Profile Now!