Doubletdetector: a method to detect doublet cells in open chromatin regions

buir.advisorÇiçek, A. Ercüment
dc.contributor.authorEroğlu, Alper
dc.date.accessioned2020-10-12T10:03:51Z
dc.date.available2020-10-12T10:03:51Z
dc.date.copyright2020-09
dc.date.issued2020-09
dc.date.submitted2020-09-25
dc.descriptionCataloged from PDF version of article.en_US
dc.descriptionThesis (M.S.): Bilkent University, Department of Computer Engineering, İhsan Doğramacı Bilkent University, 2020.en_US
dc.descriptionIncludes bibliographical references (leaves 44-47).en_US
dc.description.abstractAssay for Transposase-Accessible Chromatin using sequencing (ATAC-seq) is a simple and effective technique in genomic studies that shows the chromatin accessibility of the genome. The open regions of the genome play an important role in DNA replication and transcription. It has many practical applications such as nucleosome mapping, identifying regulatory elements, cancer research and immune system aging. With the development of the technology used, this technique is now applied at single cell level in the form of single nucleus ATAC-seq (snATAC-seq). Single cell level resolution helps further the possible implications of ATAC-seq by helping in detection of rare cell types that play roles in the regulatory networks. Like other single cell technologies, snATAC-seq suffers from the existence of doublet cells that occur when multiple cells are simultaneously captured and sequenced which confounds downstream analyses. A unique property of snATAC-seq data is that at a given loci in the genome there can be at most two overlapping reads, one from the maternal and other from the paternal chromosome. When a loci has more than 2 reads this can be due to doublets or alignment/sequencing errors. We propose a count-based method, DoubletDetector, that makes use of this property to detect doublets. It identifies doublets by counting the number of loci within the cell that has more than 2 ATAC-seq reads. It also finds the types of the cells that formed the doublets, to further help understand their nature. DoubletDetector achieved high recall near 90% for detecting simulated doublets in human PBMC and islet snATAC-seq samples. Artificial doublets were then traced back to their cells of origin with near 78% recall using a marker peak-based algorithm. DoubletDetector is the first method to effectively identify both homotypic and heterotypic doublets from snATAC-seq.en_US
dc.description.provenanceSubmitted by Betül Özen (ozen@bilkent.edu.tr) on 2020-10-12T10:03:51Z No. of bitstreams: 1 Thesis.pdf: 5153415 bytes, checksum: 7b274220cf1f3c06e3fd6454c980415b (MD5)en
dc.description.provenanceMade available in DSpace on 2020-10-12T10:03:51Z (GMT). No. of bitstreams: 1 Thesis.pdf: 5153415 bytes, checksum: 7b274220cf1f3c06e3fd6454c980415b (MD5) Previous issue date: 2020-09en
dc.description.statementofresponsibilityby Alper Eroğlu.en_US
dc.embargo.release2021-03-25
dc.format.extentxiii, 47 leaves : illustrations (color), charts (color) ; 30 cm.en_US
dc.identifier.itemidB160511
dc.identifier.urihttp://hdl.handle.net/11693/54197
dc.language.isoEnglishen_US
dc.rightsinfo:eu-repo/semantics/openAccessen_US
dc.subjectsnATAC-seqen_US
dc.subjectDoublet cellen_US
dc.subjectCell type annotationen_US
dc.titleDoubletdetector: a method to detect doublet cells in open chromatin regionsen_US
dc.title.alternativeDoubletdetector: açık kromatin bölgelerinde ikili hücre tespit etme aracıen_US
dc.typeThesisen_US
thesis.degree.disciplineComputer Engineering
thesis.degree.grantorBilkent University
thesis.degree.levelMaster's
thesis.degree.nameMS (Master of Science)

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Thesis.pdf
Size:
4.91 MB
Format:
Adobe Portable Document Format
Description:
Full printable version

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: