<rss xmlns:atom="http://www.w3.org/2005/Atom" version="2.0"><channel><title>Slurm - Tag - Xiaopeng Xu</title><link>https://xu-xp.com/tags/slurm/</link><description>Slurm - Tag - Xiaopeng Xu</description><generator>Hugo -- gohugo.io</generator><language>en</language><managingEditor>xiaopeng.xu@kaust.edu.sa (Xiaopeng Xu)</managingEditor><webMaster>xiaopeng.xu@kaust.edu.sa (Xiaopeng Xu)</webMaster><lastBuildDate>Sat, 24 Oct 2020 00:00:00 +0000</lastBuildDate><atom:link href="https://xu-xp.com/tags/slurm/" rel="self" type="application/rss+xml"/><item><title>Slurm 常用命令</title><link>https://xu-xp.com/posts/slurm_commons/</link><pubDate>Sat, 24 Oct 2020 00:00:00 +0000</pubDate><author>xiaopeng.xu@kaust.edu.sa (Xiaopeng Xu)</author><guid>https://xu-xp.com/posts/slurm_commons/</guid><description><![CDATA[<h2 id="相关培训">相关培训</h2>
<p><a href="https://www.hpc.kaust.edu.sa/content/data-science-training" target="_blank" rel="noopener noreffer ">https://www.hpc.kaust.edu.sa/content/data-science-training</a></p>
<p>包括：</p>
<ol>
<li>
<p>Trillion-parameter scale model training and inference with DeepSpeed</p>
<ol>
<li>Codes: <a href="https://github.com/kaust-rccl/deepspeed_workshop/tree/master/HelloDeepSpeed" target="_blank" rel="noopener noreffer ">https://github.com/kaust-rccl/deepspeed_workshop/tree/master/HelloDeepSpeed</a>, <a href="https://github.com/microsoft/Megatron-DeepSpeed" target="_blank" rel="noopener noreffer ">https://github.com/microsoft/Megatron-DeepSpeed</a></li>
</ol>
</li>
<li>
<p>Introduction to Containers on KSL Platforms</p>
</li>
<li>
<p>High Throughput Hyperparameter Optimization on KSL platforms</p>
</li>
<li>
<p>Distributed Deep Learning on KSL platforms</p>
</li>
<li>
<p>Data Science on-boarding on KSL platforms</p>
</li>
</ol>
<h2 id="基础操作">基础操作</h2>
<h3 id="登录-ibex">登录 iBex</h3>
<p>外网时，需要先连接 KAUST VPN。通常使用 Cisco anyconnect</p>
<p>登录 iBex：</p>]]></description></item></channel></rss>