About: Untargeted accurate strain-level classification of a priori unidentified organisms using tandem mass spectrometry is a challenging task. Reference databases often lack taxonomic depth, limiting peptide assignments to the species level. However, the extension with detailed strain information increases runtime and decreases statistical power. In addition, larger databases contain a higher number of similar proteomes. We present TaxIt, an iterative workflow to address the increasing search space required for MS/MS-based strain-level classification of samples with unknown taxonomic origin. TaxIt first applies reference sequence data for initial identification of species candidates, followed by automated acquisition of relevant strain sequences for low level classification. Furthermore, proteome similarities resulting in ambiguous taxonomic assignments are addressed with an abundance weighting strategy to improve candidate confidence. We apply our iterative workflow on several samples of bacterial and viral origin. In comparison to non-iterative approaches using unique peptides or advanced abundance correction, TaxIt identifies microbial strains correctly in all examples presented (with one tie), thereby demonstrating the potential for untargeted and deeper taxonomic classification. TaxIt makes extensive use of public, unrestricted and continuously growing sequence resources such as the NCBI databases and is available under open-source license at https://gitlab.com/rki

Facets (new session)
Description
Metadata
Settings
- owl:sameAs
- Inference Rule:

About: Untargeted accurate strain-level classification of a priori unidentified organisms using tandem mass spectrometry is a challenging task. Reference databases often lack taxonomic depth, limiting peptide assignments to the species level. However, the extension with detailed strain information increases runtime and decreases statistical power. In addition, larger databases contain a higher number of similar proteomes. We present TaxIt, an iterative workflow to address the increasing search space required for MS/MS-based strain-level classification of samples with unknown taxonomic origin. TaxIt first applies reference sequence data for initial identification of species candidates, followed by automated acquisition of relevant strain sequences for low level classification. Furthermore, proteome similarities resulting in ambiguous taxonomic assignments are addressed with an abundance weighting strategy to improve candidate confidence. We apply our iterative workflow on several samples of bacterial and viral origin. In comparison to non-iterative approaches using unique peptides or advanced abundance correction, TaxIt identifies microbial strains correctly in all examples presented (with one tie), thereby demonstrating the potential for untargeted and deeper taxonomic classification. TaxIt makes extensive use of public, unrestricted and continuously growing sequence resources such as the NCBI databases and is available under open-source license at https://gitlab.com/rki_bioinformatics. Goto Sponge NotDistinct Permalink

An Entity of Type : fabio:Abstract, within Data Space : wasabi.inria.fr associated with source document(s)

Attributes	Values
type	abstract
value	Untargeted accurate strain-level classification of a priori unidentified organisms using tandem mass spectrometry is a challenging task. Reference databases often lack taxonomic depth, limiting peptide assignments to the species level. However, the extension with detailed strain information increases runtime and decreases statistical power. In addition, larger databases contain a higher number of similar proteomes. We present TaxIt, an iterative workflow to address the increasing search space required for MS/MS-based strain-level classification of samples with unknown taxonomic origin. TaxIt first applies reference sequence data for initial identification of species candidates, followed by automated acquisition of relevant strain sequences for low level classification. Furthermore, proteome similarities resulting in ambiguous taxonomic assignments are addressed with an abundance weighting strategy to improve candidate confidence. We apply our iterative workflow on several samples of bacterial and viral origin. In comparison to non-iterative approaches using unique peptides or advanced abundance correction, TaxIt identifies microbial strains correctly in all examples presented (with one tie), thereby demonstrating the potential for untargeted and deeper taxonomic classification. TaxIt makes extensive use of public, unrestricted and continuously growing sequence resources such as the NCBI databases and is available under open-source license at https://gitlab.com/rki_bioinformatics.
Subject	Proteomics Peptides National Institutes of Health Tandem mass spectrometry Biological databases Biological classification Online databases Forensic genetics Biological nomenclature Taxonomy (biology)
part of	An iterative and automated computational pipeline for untargeted strain-level identification using MS/MS spectra from pathogenic samples
is abstract of	An iterative and automated computational pipeline for untargeted strain-level identification using MS/MS spectra from pathogenic samples
is hasSource of	covid:ann/target/416cc519a02da381ae4165a71d4a4e6acf734978 covid:ann/target/75903b509927d2f2f24f023c3260e6d202a3893a covid:ann/target/2278876cf2fe80d8d5253666d862339cc63160f9 covid:ann/target/3e9734bb5f6e8bc588bdb81f72547f8f67574f09 covid:ann/target/63cf4c09b9ccd5b551ce2f9490e968bfc42bcff3 covid:ann/target/97473dcef35157d9ed7e70e0a1ebae390a5ab781 covid:ann/target/048b02d7d721f22811fbfedca996fad3cfff83d3 covid:ann/target/4a4add47f4a20054b3fd516d5b4ada18b5482150 covid:ann/target/67ea06d6a39a538ad6e316446a3db316c783dcd9 covid:ann/target/298e1893d816f5dced89236819228afee45f6e2e covid:ann/target/30461aea3910b97f1fce7744fc2fea3e909004a2 covid:ann/target/72181e0cff0f62c60e5b722d13d8535cee7b0d55 covid:ann/target/aa78013429cf4c793b748ac737bc71e3251faa38 covid:ann/target/cf404ae4d208f65dec4b6e7bb99d77714a2380c4 covid:ann/target/90413ec5ea881c0604df6fd80ae3e0a537997cf1 covid:ann/target/6eb0cccc1643e9a2cabec95befd87a0488372b71 covid:ann/target/156c37bc5f4d1fa83d8a092f5c4f6d7076fb2694 covid:ann/target/3bf7c57d21caaaeebcc306cc86cacf33d4af365f covid:ann/target/5ff485535fdc2e8e7441ca37f3dddb3c898a65f5 covid:ann/target/86c8bce6937c63c495468b449e04654528a97308 covid:ann/target/b714a4e9c3711d7bdc186ad3829176fd67d4f1ed covid:ann/target/c03198f0aa8f503220fe09935d89c9fab0da3731 covid:ann/target/4261fc5f81590ba6b46695e140ff349091857605 covid:ann/target/efe43039f1a56e730f60fffee97051dc2b6b01dd covid:ann/target/84f04a6bf9edcccb15e5109feed7e779d29e88c1 covid:ann/target/b1f06fb471f6a11ec209cf0395d84da25d2022dd covid:ann/target/de19035e28bd8f57c7f49a001a9425cdc4fcda7d covid:ann/target/db5a437b5cc51bd2543e15a48c397f60610684dd covid:ann/target/5e6aa3f78baa4eea78465e47e89c391a01b4e41a covid:ann/target/67ce5739b8681c4d12989988edc532acd2332479 covid:ann/target/32ac889d801493e3b9945fdf4730cfbce679013f covid:ann/target/97cc3c4c950179c138594defc6daf0713a7866c1 covid:ann/target/fcfe71c76e4f2dcab71d8a4f5b936fca513c9004 covid:ann/target/909a1ba3b923068cbe576d622ea14d1298f93a97 covid:ann/target/ab7b8120f64b94c8ac0551d6246773ce4af59376 covid:ann/target/497d281b031ad08a91d964136f65f4158df10e6c covid:ann/target/521f6663ec3b2a1cdbee17b036b113a2664d6a5f covid:ann/target/953e2db9992a1d05ff828007dd7ae63725feae67 covid:ann/target/b3d494e17f2dd405dbfc822b04f7da1836d12a3d covid:ann/target/1be16bc9592dec0863873d0839e29039b1a9bb6d covid:ann/target/3b6e56e70020395a90bba81d32176f5626ef2385 covid:ann/target/3d1ce16bdde5ba634f47c04c6a5840c63b543d82 covid:ann/target/93d90fe9a1f64777a565d2351f31e9ab79f20c43 covid:ann/target/e7daa9013f00c659361fd8f2fbfe90069c1a4dc7 covid:ann/target/61bcbfe338b3ee1628f864071ee17f572254702b covid:ann/target/9f2ce54b47dcd4430a5d34cb700ecc013577dfbf covid:ann/target/ca6978fd5d2e9e97d24a7106b0c34ac024fd2dac covid:ann/target/d9f6427594f9d70fb639f3ca713d5c7fc8dc17b0 covid:ann/target/92be099554f4f285010c78b310812206a6d06ef8 covid:ann/target/41aeaadc1fd078625b933f42d41c0c8205cea1ec covid:ann/target/465bab7b2cce6424f6b67a55f724043b53eee729 covid:ann/target/375a84bf6ada938e2de39a13d8fd505eccda502c covid:ann/target/a2dd15d834b269090c8aa07f77ff090be09c3f0f covid:ann/target/cee03fb70dcb2b7af5504f1d722c0a25a7cb2a07 covid:ann/target/f5d26e92907fd8aa0bada55dae6055003c211ff4 covid:ann/target/f9890227a8575f40cebdbec0f5c6ad40e285a0a5 covid:ann/target/083a9e7f20b2a05985afd9c3aa5598bf1a89ca76 covid:ann/target/0932e1f49df5067976b70b5e155dba1b61a98770 covid:ann/target/bd30ad09e08e58d7095c8ce5a9a72bd80499ff0c covid:ann/target/c4a72b921af01ff8ea7e45b68d113f01ef29ac28 covid:ann/target/e9eb5e0fb6ab574893acacb746e5b3516f31b8e3 covid:ann/target/e55e1fde2780a88ee09367181bb353615a8d469b covid:ann/target/8b3ba0192897302ed6dcc1479c656234a91358fe covid:ann/target/4d7698507958ed9184f3a56842814f52f0aa5c70 covid:ann/target/4d964acb5e6b49b54c63d7421ef5d9f160c89a2e covid:ann/target/73751ac0bb18ec8e30b7a7b8373b6fc734195c70 covid:ann/target/95a826e52812469441712b23dbed128c55d07042 covid:ann/target/0f9fed0b8ed2ef925593e0b20c6335514f4ee2c4 covid:ann/target/ff91a189f2718cd307a9b58fb31282c748458787 covid:ann/target/18df4e5b9d4353bf19ca7bc16b37333da174d746 covid:ann/target/60d4cdf3e805c57a8181c8b555499c75090cbbe5 covid:ann/target/b30e4161d39ba2668bad992a22af1c740c38814e covid:ann/target/1aab9d2395e842c802c9e71ec26ba380d68201a6 covid:ann/target/bbcf72f5be9434d1d74e3a73d41d43bb5942ba0a covid:ann/target/e7b179c9687cefebcf08793981430b762f0b59b0 covid:ann/target/4463ea011d0c68314124ea868bf3f1a6248daf41 covid:ann/target/ff3b5e1dad50140ff8e9f81b72e21eba22bf8f61 covid:ann/target/d58735ed8ba7595240ab0ebf47812b31bb3cda1b covid:ann/target/1a3a6aaf07b5920bcab504241d2861849c650d01 covid:ann/target/37cd598e85d2a0b4815cd2b6184ff0359f7a19b4 covid:ann/target/8c84471178c07a2ace2e774cb181735254ca8aae covid:ann/target/81e53c7a5af320919e10fecf3f4949cb781bb20f covid:ann/target/f1f33c4dc6e853151042e4fa4fe26a6b1334810f covid:ann/target/07ca3b33bbaf9f69c9f611ac85bb693652f86367 covid:ann/target/7cb8395fa3b64650b74829ed8cc02f65111371e7 covid:ann/target/2cc3c13dc5d0188fcb9e08dc34dd96c7f565c0a7

Faceted Search & Find service v1.13.91 as of Mar 24 2020

Alternative Linked Data Documents: Sponger | ODE Content Formats:

RDF

ODATA

Microdata

About

OpenLink Virtuoso version 07.20.3229 as of Jul 10 2020, on Linux (x86_64-pc-linux-gnu), Single-Server Edition (94 GB total memory)
Data on this page belongs to its respective rights holders.
Virtuoso Faceted Browser Copyright © 2009-2024 OpenLink Software