Advanced Search
Search Results
42 total results found
OAsis User Guide
Welcome to the OAsis high-performance computing cluster, which OneAsia Network Limited actively maintains. This user guide will quickly walk you through the cluster design and provide examples of consuming our computing resources. Content is grouped into...
To all users
To researchers
To project leads
To technical/billing owners
Case studies
Troubleshooting
Brief introduction to the cluster
The cluster consists of many components to provide a good experience for various tasks for various users. Below are some highlights: Variety in compute nodes Parallel file system storage Fast Infiniband and Ethernet network SSH servers cluster Web porta...
Access the cluster
There are three ways to access the cluster: web portal, SSH, and CLI client. This article will cover how they authenticate users. Web portal You should be able to log in to the web portal https://oasishpc.hk using the provided username and password. If this...
Builtin software
The OAsis HPC cluster has some standard software built-in. We provide them via Lmod and Containers. Users may choose the way they are comfortable with. Lmod All software and module files are put in the parallel file system, and accessible in any compute node...
Submit jobs
To cater to users from various knowledge backgrounds with different preferences, we provide multiple ways to use the cluster's resources. SLURM is our job scheduler. When you need computing resources, you submit a job. The system will then verify your request...
Fine tune your workload
The web portal provides data for you to understand the actual utilization of your jobs, running or completed. When you have a job running, click jobs, then the running jobs from the top menu bar to see the below screen. This screen tells the duration and cons...
Access your files
The cluster has a parallel file system which provides fast and reliable access to your data. You should have at least 3 file sets: User home directory For storing your persistent data Located at /pfss/home/$USER Environment variable: $HOME The default...
Finding help
We recommend you reach us by creating a support ticket through the web portal if you need help. You may contact us by email or phone if you can't access the portal. Create a ticket in the portal Please follow the below steps to file a ticket: Log in to t...
Manage your team
If you are an account coordinator, you may invite new users or revoke their access. In addition, you may view all your linked accounts by clicking Accounts in the top menu bar. The default account is used when you submit a job without specifying a particula...
Custom software
When the built-in software doesn't fit your needs, feel free to bring your software to the cluster. This article covers how you can do this in Lmod and containers and how to share it with your teammates. Lmod First, please study the official Lmod guide about...
Jobs, quota, and setup alerts
You may want to check the jobs teammate submitted to ensure they are reasonably leveraging your resources. This article covers how you check jobs, how the quota system works, and how we can set up alerts to let the system monitor for you. Check running, queui...
Manage accounts and quotas
Account hierarchy If you are on behalf of an institute or an enterprise, most likely, you will have a billing account. Then you may have consumer accounts to consume resources with jobs or sub-billing accounts to grant departments autonomy power. Each sub-ac...
Billing, cost allocation and reports
You may download your monthly bill anytime under the "Billing" tab on your account page. This article covers how OAsis HPC charges. There are three components, and we will discuss them one by one. Monthly storage charge Resource usage charge Fixed additi...
Quick jobs
Quick job is one of our web portal's features. It is an excellent way to unify and speed up your team's workflow. For example, you may define what computing resources are required, what software to use, and where the output goes. You may also expose options to...
Run docker-based workload on HPC with GPU
In this case study, we will walk thru how to convert a docker image into singularity format and import it into the cluster, how to look up appropriate hardware, and finally enqueue a job. Due to security concerns, OAsis HPC supports Singularity rather than Do...
Render 3D graphics with Blender
Welcome to OAsis! If you're looking to render 3D graphics using the cluster GPU, we've got you covered. Here's a quick guide on how to get started: Request a VNC interactive session: To begin, you'll need to submit the VNC quick job from the porta...
AI painting with stable diffusion
The OAsis cluster is equipped with 80GB A100 GPUs that can be leveraged to create artwork using a generative AI model called Stable Diffusion. This model supports text-to-image generation, image-to-image generation, and image inpainting. If you're inter...
Run and train chatbots with OpenChatKit
OpenChatKit provides an open-source framework to train general-purpose chatbots. It includes a pre-trained 20B parameter language model as a good starting point. At least 40GB of VRAM is required to load the 20B model. So a full 80GB A100 is required. Firs...
PyTorch with GPU in Jupyter Lab using container-based kernel
The easiest way to kick start deep learning is to use our Jupyter Lab feature with container kernel. This article shows how this is achieved using the OAsis web portal. Jupyter Lab It is an exceptional tool for interactive development, particularly in deep l...
Run NVIDIA-Merlin MovieLens Example in Jupyter Lab
NVIDIA-Merlin NVIDIA Merlin is an open source library that accelerates recommender systems on NVIDIA GPUs. The library enables data scientists, machine learning engineers, and researchers to build high-performing recommenders at scale. Merlin includes tools t...
Multinode PyTorch Model Training using MPI and Singularity
Why multiple nodes? Multinode training in PyTorch allows for the distribution of the computational workload across multiple nodes, which results in faster model training and increased scalability. By leveraging multiple nodes, each with its own set of resourc...