HPC and Research Computing Engineer

The position

The Scientific Computing and Data Analysis (SCDA) section, under the Research Support Division (RSD), promotes the effective use of High-Performance Computing (HPC) in OIST research environment. The SCDA manages OIST scientific computing resources and services to support computationally intensive research studies, ranging from bioinformatics to computational physics.
The HPC and research computing member will support and enhance the usage of OIST’s substantial HPC and scientific computing services. Under the direction of the SCDA Leader, the member will support usage of OIST computing resources, which involves day to day management of OIST HPC clusters and computing services, as well as general systems administration and programming tasks.

About OIST

The Okinawa Institute of Science and Technology Graduate University was established in 2011 to contribute to the development of science and technology worldwide and to serve as a hub of innovation in Okinawa. OIST is a dynamic new graduate university of science and technology in Okinawa Prefecture, Japan which offers a 5-year PhD program and brings together outstanding researchers from across the country and across disciplines to conduct cutting-edge scientific research.

The university is located on 85 hectares of protected forestland overlooking a beautiful shoreline and coral reefs. The campus is striking architecturally, and the facilities are outstanding. To facilitate multidisciplinary research, there are no academic departments. Outstanding resources and equipment are provided and managed to encourage easy access and collaboration.

English is the official language of the University, and the university research community is fully international, with more than 50 countries represented. OIST is rapidly gaining recognition in the worldwide academic community as a model for excellence in education and research, and our unwavering commitment to scientific and technological innovation is dedicated to generating progress that will fuel Okinawa's economic growth.

Responsibilities
  1. Supports day-to-day operations for the HPC team by monitoring computing resource performance, managing configurations, and addressing security administration
  2. Installs, configures, and performs document management for cluster infrastructure components (OS, scheduler, storage, network, etc.)
  3. Investigates, debugs, maintains hardware and apply revisions to system firmware and software
  4. Deploys and operates management and monitoring tools to ensure proper HPC system operation
  5. Engages and collaborates with vendors to assist with support and maintenance activities as required
  6. Explores emerging technologies and technical developments to address expanding analytical requirements
  7. Stays current with best practices in the HPC field
  8. Contributes to a team culture of trust and transparency by sharing information openly, and deliberately
  9. Performs other related duties as assigned or requested by the Section Leader
Qualifications

(Required)

  1. Bachelor’s degree in a relevant field such as computer science, computer information systems, etc., or equivalent combination of education, training, and experience
  2. 3+ years of operation and administration experience in HPC environment for research computing using Linux/Unix variants
  3. Good organization and communication skills, verbal and written, either in Japanese or in English
  4. Ability to develop positive working relationships and a strong rapport with team members
  5. Ability to identify and resolve problems
  6. Ability to learn and apply new concepts, methods and practices
  7. Shell scripting commands - bash, perl, ruby or python or any combination
  8. Daily usage of version control tools such as Git (preferred), SVN, CVS, etc

(Preferred)

  1. Expertise with system administrating, monitoring, and maintaining secure Linux/Unix-based HPC environment
  2. Automation/configuration management experience (Puppet, Ansible, Chef, Salt, Cobbler, Kickstart, etc.)
  3. Experience with HPC system software cluster management tools (SLURM, Docker/Singularity, Enroot, etc.)
  4. Familiarity with shared and distributed memory parallelism (OpenMP, MPI) and accelerators (GPUs)
  5. Experience with HPC parallel storage, file systems (Lustre, GPFS, NFS, ZFS, TSM, Isilon, etc.), and computer node storage (SSD, NVME, etc.)
  6. Experience with OOB management technology (BMC, IPMI, iDrac, iLO, etc.)
  7. Hands-on experience of physically deploying cluster (racking, cabling, part swapping, etc.)
  8. Good understanding of networking concepts.
Compensation and Benefits

In accordance with the OIST Employee Compensation Regulations

Benefits:

Submission Documents
  • Cover letter in English required. Japanese version required only for Applicants whose native language is Japanese.
  • Curriculum vitae in English required. Japanese version required only for Applicants whose native language is Japanese.
  • 2 reference letters will be required if the Applicant is selected as a shortlisted candidate.

* Please be sure to indicate where you first saw the job advertisement.
* Prior to the start of employment all new hires are required to successfully complete a background check. Personal information including employment history and academic background should be submitted to OIST after a conditional offer of employment

Declaration
  • OIST Graduate University is an equal opportunity, affirmative action educator and employer and is committed to increasing the diversity of its faculty, students and staff. The University strongly encourages applications from underrepresented groups.
  • Information provided by applicants or references will be kept confidential, documents will not be returned. All applicants will be notified regarding the status of their applications.
  • Please view OIST policy for rules on external professional activities
  • Further details about the University can be viewed on the OIST website www.oist.jp.
Job Type
Research support
Starting Date
As early as possible
Employment Term

Full-time, fixed term appointment for 2 years. Contract initially with 3-month probationary period (inclusive). This contract may be renewed. 

Working Hours

9:00-17:30 (Discretionary)

Report To
Research Support Leader, Scientific Computing & Data Analysis Section
Job Location
Main Campus: 1919-1 Tancha, Onna-son, Kunigami-gun, Okinawa
Application Due Date
Applications will be accepted until all positions are filled.
*Application will be closed once the position is filled.
If you have any question, please contact us