长短期记忆循环神经网络学习简单的上下文无关语言和上下文相关语言。

LSTM recurrent networks learn simple context-free and context-sensitive languages.

作者信息

Gers F A, Schmidhuber E

机构信息

IDSIA, 6928 Manno, Switzerland.

出版信息

IEEE Trans Neural Netw. 2001;12(6):1333-40. doi: 10.1109/72.963769.

DOI:10.1109/72.963769

PMID:18249962

Abstract

Previous work on learning regular languages from exemplary training sequences showed that long short-term memory (LSTM) outperforms traditional recurrent neural networks (RNNs). We demonstrate LSTMs superior performance on context-free language benchmarks for RNNs, and show that it works even better than previous hardwired or highly specialized architectures. To the best of our knowledge, LSTM variants are also the first RNNs to learn a simple context-sensitive language, namely a(n)b(n)c(n).

摘要

先前关于从示例训练序列中学习正则语言的研究表明，长短期记忆网络（LSTM）优于传统循环神经网络（RNN）。我们证明了LSTM在RNN的上下文无关语言基准测试中的卓越性能，并表明它甚至比以前的硬连线或高度专业化架构表现得更好。据我们所知，LSTM变体也是第一个学习简单上下文敏感语言（即a(n)b(n)c(n)）的RNN。

相似文献

LSTM recurrent networks learn simple context-free and context-sensitive languages.

IEEE Trans Neural Netw. 2001;12(6):1333-40. doi: 10.1109/72.963769.

Language Identification in Short Utterances Using Long Short-Term Memory (LSTM) Recurrent Neural Networks.

PLoS One. 2016 Jan 29;11(1):e0146917. doi: 10.1371/journal.pone.0146917. eCollection 2016.

A Review of Recurrent Neural Networks: LSTM Cells and Network Architectures.

Neural Comput. 2019 Jul;31(7):1235-1270. doi: 10.1162/neco_a_01199. Epub 2019 May 21.

Incremental training of first order recurrent neural networks to predict a context-sensitive language.

Neural Netw. 2003 Sep;16(7):955-72. doi: 10.1016/S0893-6080(03)00054-6.

On learning context-free and context-sensitive languages.

IEEE Trans Neural Netw. 2002;13(2):491-3. doi: 10.1109/72.991436.

Learning nonregular languages: a comparison of simple recurrent networks and LSTM.

Neural Comput. 2002 Sep;14(9):2039-41. doi: 10.1162/089976602320263980.

Framewise phoneme classification with bidirectional LSTM and other neural network architectures.

Neural Netw. 2005 Jun-Jul;18(5-6):602-10. doi: 10.1016/j.neunet.2005.06.042.

Learning to forget: continual prediction with LSTM.

Neural Comput. 2000 Oct;12(10):2451-71. doi: 10.1162/089976600300015015.

Working Memory Connections for LSTM.

Neural Netw. 2021 Dec;144:334-341. doi: 10.1016/j.neunet.2021.08.030. Epub 2021 Sep 4.

Explicit Duration Recurrent Networks.

IEEE Trans Neural Netw Learn Syst. 2022 Jul;33(7):3120-3130. doi: 10.1109/TNNLS.2021.3051019. Epub 2022 Jul 6.

引用本文的文献

Predicting 28-day all-cause unplanned hospital re-admission of patients with alcohol use disorders: a machine learning approach.

Alcohol Alcohol. 2025 May 14;60(4). doi: 10.1093/alcalc/agaf036.

Global progress in competitive co-evolution: a systematic comparison of alternative methods.

Front Robot AI. 2025 Jan 21;11:1470886. doi: 10.3389/frobt.2024.1470886. eCollection 2024.

Hybrid Twins Modeling of a High-Level Radioactive Waste Cell Demonstrator for Long-Term Temperature Monitoring and Forecasting.

Sensors (Basel). 2024 Jul 30;24(15):4931. doi: 10.3390/s24154931.

LSTM4piRNA: Efficient piRNA Detection in Large-Scale Genome Databases Using a Deep Learning-Based LSTM Network.

Int J Mol Sci. 2023 Oct 27;24(21):15681. doi: 10.3390/ijms242115681.

An integrated analysis of air pollution and meteorological conditions in Jakarta.

Sci Rep. 2023 Apr 9;13(1):5798. doi: 10.1038/s41598-023-32817-9.

AI enhanced collaborative human-machine interactions for home-based telerehabilitation.

J Rehabil Assist Technol Eng. 2023 Mar 20;10:20556683231156788. doi: 10.1177/20556683231156788. eCollection 2023 Jan-Dec.

Dysarthria Speech Detection Using Convolutional Neural Networks with Gated Recurrent Unit.

Healthcare (Basel). 2022 Oct 7;10(10):1956. doi: 10.3390/healthcare10101956.

S-Swin Transformer: simplified Swin Transformer model for offline handwritten Chinese character recognition.

PeerJ Comput Sci. 2022 Sep 20;8:e1093. doi: 10.7717/peerj-cs.1093. eCollection 2022.

A Bayesian-based classification framework for financial time series trend prediction.

J Supercomput. 2023;79(4):4622-4659. doi: 10.1007/s11227-022-04834-4. Epub 2022 Sep 29.

Research on Blended Teaching of Flipped Classroom Based on CNN-SSA-Bi-LSTM Deep Learning Model Computer Media.

Comput Intell Neurosci. 2022 Jul 30;2022:3740634. doi: 10.1155/2022/3740634. eCollection 2022.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

长短期记忆循环神经网络学习简单的上下文无关语言和上下文相关语言。

LSTM recurrent networks learn simple context-free and context-sensitive languages.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献