Bioinformatics (Oxford, England)2020Apr01Vol.36issue(7)

NextPolish：長い読み取りアセンブリのための高速で効率的なゲノム研磨ツール

PMID：31778144DOI：10.1093/bioinformatics/btz891

文献タイプ：

Journal Article

5大医学誌の要約と
著名医師による解説が無料で読めます

会員登録(医師のみ)してログイン
すると翻訳の精度が向上します

概要

Abstract

動機：長い読み取りシーケンステクノロジーは、長い連続性を持つゲノムを生成する可能性がありますが、エラー率が高くなります。したがって、NextPolishを開発しました。これは、長い読み取りで組み立てられたゲノムのシーケンスエラーを効率的に修正するツールです。この新しいツールは、高品質の短い読み取りからK-Mersをスコアリングしてカウントするように設計された2つの相互リンクされたモジュールと、多数のベースエラーを含むゲノムアセンブリを磨くように設計されています。結果：ヒトおよび植物（シロイヌナズナ）ゲノムを使用して速度と効率を評価すると、次のポリッシュはシーケンスエラーをより速く補正し、より高い補正精度でピアンを上回りました。可用性と実装：NextPolishはCおよびPythonで実装されています。ソースコードは、https：//github.com/nextomics/nextpolishから入手できます。補足情報：補足データは、バイオインフォマティクスオンラインで入手できます。

MOTIVATION: Although long-read sequencing technologies can produce genomes with long contiguity, they suffer from high error rates. Thus, we developed NextPolish, a tool that efficiently corrects sequence errors in genomes assembled with long reads. This new tool consists of two interlinked modules that are designed to score and count K-mers from high quality short reads, and to polish genome assemblies containing large numbers of base errors. RESULTS: When evaluated for the speed and efficiency using human and a plant (Arabidopsis thaliana) genomes, NextPolish outperformed Pilon by correcting sequence errors faster, and with a higher correction accuracy. AVAILABILITY AND IMPLEMENTATION: NextPolish is implemented in C and Python. The source code is available from https://github.com/Nextomics/NextPolish. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

医師のための臨床サポートサービス

ヒポクラ x マイナビのご紹介

無料会員登録していただくと、さらに便利で効率的な検索が可能になります。

Translated by Google