<rss xmlns:atom="http://www.w3.org/2005/Atom" version="2.0"><channel><title>Bioinformatics - Tag - Xiaopeng Xu</title><link>https://xu-xp.com/tags/bioinformatics/</link><description>Bioinformatics - Tag - Xiaopeng Xu</description><generator>Hugo -- gohugo.io</generator><language>en</language><managingEditor>xiaopeng.xu@kaust.edu.sa (Xiaopeng Xu)</managingEditor><webMaster>xiaopeng.xu@kaust.edu.sa (Xiaopeng Xu)</webMaster><lastBuildDate>Thu, 08 Feb 2024 00:00:00 +0000</lastBuildDate><atom:link href="https://xu-xp.com/tags/bioinformatics/" rel="self" type="application/rss+xml"/><item><title>Ig-seq 操作说明</title><link>https://xu-xp.com/posts/ig-seq-mixcr/</link><pubDate>Thu, 08 Feb 2024 00:00:00 +0000</pubDate><author>xiaopeng.xu@kaust.edu.sa (Xiaopeng Xu)</author><guid>https://xu-xp.com/posts/ig-seq-mixcr/</guid><description><![CDATA[<h2 id="数据下载">数据下载</h2>
<p>从 NCBI SRA 平台下载对应的 Ig seq 项目数据。</p>
<ul>
<li><a href="https://www.ncbi.nlm.nih.gov/Traces/study/" target="_blank" rel="noopener noreffer ">https://www.ncbi.nlm.nih.gov/Traces/study/</a></li>
</ul>
<p>先导入 google cloud 的 bucket 中，然后通过 gsutils 下载到本地或 ibex 文件夹中。</p>
<h2 id="mixcr-数据分析">MiXCR 数据分析</h2>
<h3 id="安装--mixcr">安装  MiXCR</h3>
<div class="code-block code-line-numbers open" style="counter-reset: code-block 0">
    <div class="code-header language-powershell">
        <span class="code-title"><i class="arrow fas fa-angle-right" aria-hidden="true"></i></span>
        <span class="ellipses"><i class="fas fa-ellipsis-h" aria-hidden="true"></i></span>
        <span class="copy" title="Copy to clipboard"><i class="far fa-copy" aria-hidden="true"></i></span>
    </div><div class="highlight"><pre tabindex="0" class="chroma"><code class="language-powershell" data-lang="powershell"><span class="line"><span class="cl"><span class="n">conda</span> <span class="n">install</span> <span class="n">-c</span> <span class="nb">imperial-college</span><span class="n">-research-computing</span> <span class="n">mixcr</span></span></span></code></pre></div></div>
<h3 id="ig-seq-快速使用">Ig-seq 快速使用</h3>
<div class="code-block code-line-numbers open" style="counter-reset: code-block 0">
    <div class="code-header language-powershell">
        <span class="code-title"><i class="arrow fas fa-angle-right" aria-hidden="true"></i></span>
        <span class="ellipses"><i class="fas fa-ellipsis-h" aria-hidden="true"></i></span>
        <span class="copy" title="Copy to clipboard"><i class="far fa-copy" aria-hidden="true"></i></span>
    </div><div class="highlight"><pre tabindex="0" class="chroma"><code class="language-powershell" data-lang="powershell"><span class="line"><span class="cl"><span class="c"># Align</span>
</span></span><span class="line"><span class="cl"><span class="n">Align</span><span class="err">:</span> <span class="n">mixcr</span> <span class="n">align</span><span class="err">–</span><span class="n">library</span> <span class="n">my_library</span> <span class="n">-t</span> <span class="mf">8</span> <span class="n">-r</span> <span class="n">align_log</span><span class="p">.</span><span class="py">txt</span> <span class="n">R1</span> <span class="n">R2</span> <span class="n">alignments</span><span class="p">.</span><span class="py">vdjca</span> <span class="n">-s</span> <span class="n">hs</span>
</span></span><span class="line"><span class="cl"><span class="c"># What is the my_library used here</span>
</span></span><span class="line"><span class="cl">
</span></span><span class="line"><span class="cl"><span class="c"># Assemble</span>
</span></span><span class="line"><span class="cl"><span class="n">Assemble</span><span class="err">:</span> <span class="n">mixcr</span> <span class="n">assemble</span> <span class="n">-r</span> <span class="n">assemble_log</span><span class="p">.</span><span class="py">txt</span> <span class="n">-OseparateByV</span> <span class="p">=</span> <span class="n">true</span> <span class="n">-OseparateByJ</span> <span class="p">=</span> <span class="n">true</span> <span class="n">-OseparateByC</span> <span class="p">=</span> <span class="n">true</span> <span class="n">align</span><span class="p">-</span>
</span></span><span class="line"><span class="cl"><span class="n">ments</span><span class="p">.</span><span class="py">vdjca</span> <span class="n">clones</span><span class="p">.</span><span class="n">clns</span></span></span></code></pre></div></div>
<h3 id="使用介绍">使用介绍</h3>
<p>典型的 MiXCR 工作流程主要由三个部分构成：</p>]]></description></item><item><title>Nanopore 分析流程</title><link>https://xu-xp.com/posts/nanopore_data_analysis/</link><pubDate>Thu, 08 Feb 2024 00:00:00 +0000</pubDate><author>xiaopeng.xu@kaust.edu.sa (Xiaopeng Xu)</author><guid>https://xu-xp.com/posts/nanopore_data_analysis/</guid><description><![CDATA[<h2 id="基础的分析流程">基础的分析流程：</h2>
<ol>
<li>
<p>Base calling 从电信号中识别碱基序列。原始电信号是Fast5/Pod5格式，需要转为 Fastq 格式。 Fast5/Pod5-&gt; Fastq。</p>
</li>
<li>
<p>Alignment 比对到参考基因组。常用工具是 minimap2。</p>
</li>
<li>
<p>Call variants 获取突变数据，包括 GATK，VarScan 等。</p>]]></description></item><item><title>Nanopore 信号分析</title><link>https://xu-xp.com/posts/nanopore_signal_analysis/</link><pubDate>Wed, 07 Feb 2024 00:00:00 +0000</pubDate><author>xiaopeng.xu@kaust.edu.sa (Xiaopeng Xu)</author><guid>https://xu-xp.com/posts/nanopore_signal_analysis/</guid><description><![CDATA[<h2 id="原始测序电信号数据格式">原始测序电信号数据格式</h2>
<h3 id="fast5-数据格式">FAST5 数据格式</h3>
<p><a href="https://github.com/nanoporetech/fast5_research" target="_blank" rel="noopener noreffer ">https://github.com/nanoporetech/fast5_research</a></p>
<p><a href="https://blog.csdn.net/Emmett_Bioinfo/article/details/113847543" target="_blank" rel="noopener noreffer ">https://blog.csdn.net/Emmett_Bioinfo/article/details/113847543</a></p>
<p><a href="https://zhuanlan.zhihu.com/p/137069950" target="_blank" rel="noopener noreffer ">https://zhuanlan.zhihu.com/p/137069950</a></p>
<p>FAST5格式（.fast5）实际上是在HDF5格式上的一种变体。HDF是Hierarchical Data Format的首字母缩写，从名字上就可以看出来这种文件格式储存信息的方式是层级嵌套的（hierarchical, nested）。它采用chunking（分解）的方式来存储多维数据，它内部表现出来的是类似于文件夹（树）的结构。由于这种文件层级分解的特性，想要获取某一部分信息，只需要获得该部分信息所在的chunk即可，这样就让这种文件格式非常的flexible，也非常适合用于多种编程语言来处理。FAST5格式是Oxford推出Nanopore测序之后在HDF5格式的基础上设计用于存储Nanopore测序信息的文件。</p>]]></description></item><item><title>组学数据下载</title><link>https://xu-xp.com/posts/omics_data_download/</link><pubDate>Sun, 01 Oct 2023 00:00:00 +0000</pubDate><author>xiaopeng.xu@kaust.edu.sa (Xiaopeng Xu)</author><guid>https://xu-xp.com/posts/omics_data_download/</guid><description><![CDATA[<h2 id="原始数据下载">原始数据下载</h2>
<h3 id="european-nucleotide-archive-ena">European Nucleotide Archive (ENA)</h3>
<p><a href="https://www.ebi.ac.uk/ena/browser/view/PRJEB40872" target="_blank" rel="noopener noreffer ">https://www.ebi.ac.uk/ena/browser/view/PRJEB40872</a></p>
<p>选择后，会生成 bash 脚本，包括 wget 命令。直接运行脚本即可。</p>
<h3 id="sra-数据库">SRA 数据库</h3>
<p>大部分测序的原始数据都在 SRA。可以安装 SRA-tools 来下载相关的数据。</p>
<p>选择数据后，下载 access_list，会生成一个包含样本编号的 txt 文件。</p>]]></description></item><item><title>三维结构分析</title><link>https://xu-xp.com/posts/3d-struct-analysis/</link><pubDate>Fri, 11 Feb 2022 00:00:00 +0000</pubDate><author>xiaopeng.xu@kaust.edu.sa (Xiaopeng Xu)</author><guid>https://xu-xp.com/posts/3d-struct-analysis/</guid><description><![CDATA[<h2 id="rdkit-使用">Rdkit 使用</h2>
<h3 id="小技巧">小技巧</h3>
<h4 id="禁用-log">禁用 log</h4>
<div class="code-block code-line-numbers open" style="counter-reset: code-block 0">
    <div class="code-header language-powershell">
        <span class="code-title"><i class="arrow fas fa-angle-right" aria-hidden="true"></i></span>
        <span class="ellipses"><i class="fas fa-ellipsis-h" aria-hidden="true"></i></span>
        <span class="copy" title="Copy to clipboard"><i class="far fa-copy" aria-hidden="true"></i></span>
    </div><div class="highlight"><pre tabindex="0" class="chroma"><code class="language-powershell" data-lang="powershell"><span class="line"><span class="cl"><span class="n">from</span> <span class="n">rdkit</span> <span class="n">import</span> <span class="n">RDLogger</span>
</span></span><span class="line"><span class="cl"><span class="n">RDLogger</span><span class="p">.</span><span class="py">DisableLog</span><span class="p">(</span><span class="s1">&#39;rdApp.*&#39;</span><span class="p">)</span></span></span></code></pre></div></div>
<h2 id="pymol-使用">PyMol 使用</h2>
<p>参考：<a href="http://pymol.chenzhaoqiang.com/index.html" target="_blank" rel="noopener noreffer ">http://pymol.chenzhaoqiang.com/index.html</a></p>
<h3 id="3d-模型操作">3D 模型操作</h3>
<h4 id="模型的对齐align">模型的对齐（align）</h4>
<h5 id="和-pdb-中已有模型做对齐">和 PDB 中已有模型做对齐</h5>
<div class="code-block code-line-numbers open" style="counter-reset: code-block 0">
    <div class="code-header language-powershell">
        <span class="code-title"><i class="arrow fas fa-angle-right" aria-hidden="true"></i></span>
        <span class="ellipses"><i class="fas fa-ellipsis-h" aria-hidden="true"></i></span>
        <span class="copy" title="Copy to clipboard"><i class="far fa-copy" aria-hidden="true"></i></span>
    </div><div class="highlight"><pre tabindex="0" class="chroma"><code class="language-powershell" data-lang="powershell"><span class="line"><span class="cl"><span class="n">pymol</span> <span class="n">grafting</span><span class="p">/</span><span class="n">model</span><span class="p">-</span><span class="mf">0</span><span class="p">.</span><span class="py">relaxed</span><span class="p">.</span><span class="py">pdb</span>
</span></span><span class="line"><span class="cl">
</span></span><span class="line"><span class="cl"><span class="c"># Input in pymol command line</span>
</span></span><span class="line"><span class="cl"><span class="n">fetch</span> <span class="n">4m5y</span>  <span class="c"># fetch 4m5y as framework to compare</span>
</span></span><span class="line"><span class="cl"><span class="n">alignto</span>  <span class="c"># alignto 4m5y for close comparison </span></span></span></code></pre></div></div>
<h5 id="和本地模型做对齐">和本地模型做对齐</h5>
<p>将已有的两个模型同时打开，并比较其结构的相似性。</p>]]></description></item><item><title>生物信息学会议与期刊</title><link>https://xu-xp.com/posts/bioinfo_conf_journal/</link><pubDate>Sat, 24 Apr 2021 00:00:00 +0000</pubDate><author>xiaopeng.xu@kaust.edu.sa (Xiaopeng Xu)</author><guid>https://xu-xp.com/posts/bioinfo_conf_journal/</guid><description><![CDATA[<h2 id="生物信息学两大顶会">生物信息学两大顶会：</h2>
<ol>
<li>
<p>ISMB</p>
</li>
<li>
<p>RECOMB</p>
</li>
</ol>
<p>ISMB会转投Bioinformatics杂志， RECOMB会转发Journal of Computational Biology</p>
<p> </p>
<h2 id="杂志">杂志：</h2>
<ol>
<li>
<p>Bioinformatics 好文章基本都发这上面了</p>
</li>
<li>
<p>Nature Method 偶尔有生物信息的文章</p>
</li>
<li>
<p>Genome Research (Method)</p>]]></description></item></channel></rss>