Institute of Bioinformatics, University of Georgia, Athens, GA, United States.
Center for Tropical and Emerging Global Diseases, University of Georgia, Athens, GA, United States.
Front Cell Infect Microbiol. 2021 Jan 14;10:608298. doi: 10.3389/fcimb.2020.608298. eCollection 2020.
is a protist parasite that has been identified as the second leading cause of moderate to severe diarrhea in children younger than two and a significant cause of mortality worldwide. has a complex, obligate, intracellular but extra cytoplasmic lifecycle in a single host. How genes are regulated in this parasite remains largely unknown. Long non-coding RNAs (lncRNAs) play critical regulatory roles, including gene expression across a broad range of organisms. lncRNAs have been reported to enter the host cell nucleus and affect the host response. However, no systematic study of lncRNAs in has been conducted to identify additional lncRNAs. In this study, we analyzed a strand-specific RNA-seq developmental time series covering both asexual and sexual stages to identify lncRNAs associated with parasite development. In total, we identified 396 novel lncRNAs, mostly antisense, with 86% being differentially expressed. Surprisingly, nearly 10% of annotated mRNAs have an antisense transcript. lncRNAs occur most often at the 3' end of their corresponding sense mRNA. Putative lncRNA regulatory regions were identified and many appear to encode bidirectional promoters. A positive correlation between lncRNA and upstream mRNA expression was observed. Evolutionary conservation and expression of lncRNA candidates was observed between , and . Ten protein-encoding genes with antisense transcripts have orthologs that also have antisense transcripts. Three lncRNAs with exceptional properties (e.g., intron splicing) were experimentally validated using RT-PCR and RT-qPCR. This initial characterization of the non-coding transcriptome facilitates further investigations into the roles of lncRNAs in parasite development and host-pathogen interactions.
是一种原生动物寄生虫,已被确定为 2 岁以下儿童中度至重度腹泻的第二大主要原因,也是全球范围内导致死亡的重要原因。它在单一宿主中有一个复杂的、专性的、细胞内但细胞外的生活周期。这种寄生虫的基因调控方式在很大程度上仍然未知。长非编码 RNA(lncRNA)发挥着关键的调控作用,包括在广泛的生物体中进行基因表达。lncRNA 已被报道进入宿主细胞核并影响宿主反应。然而,尚未对进行系统的 lncRNA 研究,以鉴定其他 lncRNA。在这项研究中,我们分析了一个 链特异性 RNA-seq 发育时间序列,该序列涵盖了无性和有性阶段,以鉴定与寄生虫发育相关的 lncRNA。总共鉴定出 396 个新的 lncRNA,主要是反义的,其中 86%表达差异。令人惊讶的是,近 10%的注释 mRNA 有一个反义转录本。lncRNA 最常出现在与其相应的有义 mRNA 的 3' 端。鉴定了推定的 lncRNA 调控区,许多似乎编码双向启动子。观察到 lncRNA 和上游 mRNA 表达之间存在正相关。在 、 和 之间观察到 lncRNA 候选物的进化保守性和表达。具有反义转录本的 10 个 蛋白编码基因的同源基因也具有反义转录本。三个具有特殊特性(例如,内含子剪接)的 lncRNA 通过 RT-PCR 和 RT-qPCR 进行了实验验证。该 非编码转录组的初步特征分析有助于进一步研究 lncRNA 在寄生虫发育和宿主-病原体相互作用中的作用。