Suppr超能文献

由 F1(BALB/c×C57BL/6)小鼠长读 VDJ-C 序列的单倍型分析定义的 BALB/c IGHV 参考集。

A BALB/c IGHV Reference Set, Defined by Haplotype Analysis of Long-Read VDJ-C Sequences From F1 (BALB/c x C57BL/6) Mice.

机构信息

Immunology Division, The Garvan Institute of Medical Research, Darlinghurst, NSW, Australia.

Department of Biochemistry and Molecular Genetics, University of Louisville School of Medicine, Louisville, KY, United States.

出版信息

Front Immunol. 2022 Jun 3;13:888555. doi: 10.3389/fimmu.2022.888555. eCollection 2022.

Abstract

The immunoglobulin genes of inbred mouse strains that are commonly used in models of antibody-mediated human diseases are poorly characterized. This compromises data analysis. To infer the immunoglobulin genes of BALB/c mice, we used long-read SMRT sequencing to amplify VDJ-C sequences from F1 (BALB/c x C57BL/6) hybrid animals. Strain variations were identified in the and genes, and analysis of VDJ rearrangements led to the inference of 278 germline IGHV alleles. 169 alleles are not present in the C57BL/6 genome reference sequence. To establish a set of expressed BALB/c IGHV germline gene sequences, we computationally retrieved IGHV haplotypes from the IgM dataset. Haplotyping led to the confirmation of 162 BALB/c IGHV gene sequences. A musIGHV398 pseudogene variant also appears to be present in the BALB/cByJ substrain, while a functional musIGHV398 gene is highly expressed in the BALB/cJ substrain. Only four of the BALB/c alleles were also observed in the C57BL/6 haplotype. The full set of inferred BALB/c sequences has been used to establish a BALB/c IGHV reference set, hosted at https://ogrdb.airr-community.org. We assessed whether assemblies from the Mouse Genome Project (MGP) are suitable for the determination of the genes of the IGH loci. Only 37 (43.5%) of the 85 confirmed IMGT-named BALB/c IGHV and 33 (42.9%) of the 77 confirmed non-IMGT IGHV were found in a search of the MGP BALB/cJ genome assembly. This suggests that current MGP assemblies are unsuitable for the comprehensive documentation of germline IGHVs and more efforts will be needed to establish strain-specific reference sets.

摘要

常用于抗体介导的人类疾病模型的近交系小鼠品系的免疫球蛋白基因特征描述较差,这会影响数据分析。为了推断 BALB/c 小鼠的免疫球蛋白基因,我们使用长读长 SMRT 测序技术从 BALB/c×C57BL/6 的 F1 杂交动物中扩增 VDJ-C 序列。在 和 基因中发现了品系变异,VDJ 重排分析导致推断出 278 个胚系 IGHV 等位基因。169 个等位基因不存在于 C57BL/6 基因组参考序列中。为了建立一组表达的 BALB/c IGHV 胚系基因序列,我们从 IgM 数据集计算上检索 IGHV 单倍型。单倍型分析证实了 162 个 BALB/c IGHV 基因序列。在 BALB/cByJ 亚系中似乎也存在 musIGHV398 假基因变体,而在 BALB/cJ 亚系中高度表达功能性 musIGHV398 基因。只有四个 BALB/c 等位基因也存在于 C57BL/6 单倍型中。推断出的完整 BALB/c 序列集已用于建立 BALB/c IGHV 参考集,网址为 https://ogrdb.airr-community.org。我们评估了小鼠基因组计划(MGP)的组装是否适合确定 IGH 基因座的基因。在对 MGP BALB/cJ 基因组组装的搜索中,仅发现了 85 个经 IMGT 命名的 BALB/c IGHV 中 37 个(43.5%)和 77 个经非 IMGT 命名的 IGHV 中 33 个(42.9%)。这表明当前的 MGP 组装不适合全面记录胚系 IGHV,需要做更多的工作来建立特定于品系的参考集。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eefe/9205180/482b897534b6/fimmu-13-888555-g001.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验