Falk C T, Gilchrist J M, Pericak-Vance M A, Speer M C
Lindsley F. Kimball Research Institute of The New York Blood Center, New York, NY 10021, USA.
Am J Hum Genet. 1998 Apr;62(4):941-9. doi: 10.1086/301780.
Studies of the genetics of certain inherited diseases require expertise in the determination of disease status even for single-locus traits. For example, in the diagnosis of autosomal dominant limb-girdle muscular dystrophy (LGMD1A), it is not always possible to make a clear-cut determination of disease, because of variability in the diagnostic criteria, age at onset, and differential presentation of disease. Mapping such diseases is greatly simplified if the data present a homogeneous genetic trait and if disease status can be reliably determined. Here, we present an approach to determination of disease status, using methods of artificial neural-network analysis. The method entails "training" an artificial neural network, with input facts (based on diagnostic criteria) and related results (based on disease diagnosis). The network contains weight factors connecting input "neurons" to output "neurons," and these connections are adjusted until the network can reliably produce the appropriate outputs for the given input facts. The trained network can be "tested" with a second set of facts, in which the outcomes are known but not provided to the network, to see how well the training has worked. The method was applied to members of a pedigree with LGMD1A, now mapped to chromosome 5q. We used diagnostic criteria and disease status to train a neural network to classify individuals as "affected" or "not affected." The trained network reproduced the disease diagnosis of all individuals of known phenotype, with 98% reliability. This approach defined an appropriate choice of clinical factors for determination of disease status. Additionally, it provided insight into disease classification of those considered to have an "unknown" phenotype on the basis of standard clinical diagnostic methods.
对某些遗传性疾病的遗传学研究,即使对于单基因性状,也需要在确定疾病状态方面具备专业知识。例如,在常染色体显性肢带型肌营养不良症(LGMD1A)的诊断中,由于诊断标准、发病年龄和疾病表现的差异,并不总是能够明确地确定疾病。如果数据呈现出均匀的遗传性状,并且疾病状态能够可靠地确定,那么对这类疾病进行基因定位就会大大简化。在此,我们提出一种利用人工神经网络分析方法来确定疾病状态的途径。该方法需要用输入事实(基于诊断标准)和相关结果(基于疾病诊断)对人工神经网络进行“训练”。网络包含将输入“神经元”与输出“神经元”相连的权重因子,这些连接会不断调整,直到网络能够根据给定的输入事实可靠地产生适当的输出。经过训练的网络可以用第二组事实进行“测试”,这些事实的结果是已知的,但未提供给网络,以检验训练的效果如何。该方法应用于一个LGMD1A家系的成员,该家系现已定位到5号染色体长臂。我们使用诊断标准和疾病状态来训练一个神经网络,以便将个体分类为“患病”或“未患病”。经过训练的网络对所有已知表型个体的疾病诊断重现率达到了98%的可靠性。这种方法确定了用于疾病状态判定的临床因素的合适选择。此外,它还为那些基于标准临床诊断方法被认为具有“未知”表型的个体的疾病分类提供了见解。