Skip to content

wikty/nlp-quickstart

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Language Model

单词级 n-gram 前向神经网络语言模型 A Neural Probabilistic Language Model (Bengio et al., 2001; 2003)

字符级 RNN 语言模型介绍 The Unreasonable Effectiveness of Recurrent Neural Networks

字符级 n-gram 语言模型跟 RNN 对比 The unreasonable effectiveness of Character-level Language Models

好玩的文本生成

Word Embedding

review:

word2vec tutorial:

word2vec paper:

embedding for downstream tasks:

distributional semantic model:

sentence embedding:

Convolutional Neural Networks for Sentence Classification

A Convolutional Neural Network for Modelling Sentences

paragraph vector

Skip-Thought Vectors

Semi-supervised Sequence Learning

Supervised Learning of Universal Sentence Representations from Natural Language Inference Data

https://github.com/Separius/awesome-sentence-embedding

https://www.dataiku.com/product/plugins/sentence-embedding/

Are distributional representations ready for the real world?

more aboue embedding:

Enriching Word Vectors with Subword Information

https://ruder.io/word-embeddings-2017/

A Survey of Cross-lingual Word Embedding Models

visual embedding:

Machine Learning

Linear Regression

Logistic Regression

Softmax Regression

https://cs231n.github.io/linear-classify/

Linear model by Andrew Ng

An overview of gradient descent optimization algorithms

Neural Network

神经网络入门介绍 Neural Networks and Deep Learning by Nielsen

RNN:

CNN:

RecNN:

Seq2Seq

From SMT to NMT

Attention

Transformer:

Natural Language Processing

基于 NLTK 库的自然语言处理实践教程 The NLTK Book

基于神经网络的自然语言处理方法的历史演进 A Review of the Neural History of Natural Language Processing by Sebastian Ruder

基于神经网络的自然语言处理常用方法简介 A Primer on Neural Network Models for Natural Language Processing by Yoav Goldberg

基于深度学习的自然语言处理技术最佳实践 Deep Learning for NLP Best Practices

基于 Pytorch 深度学习和自然语言处理入门

Pretrained language models

NLP's ImageNet moment has arrived

The Illustrated BERT, ELMo, and others

Semi-supervised Sequence Learning

ELMo

ULMFiT

OpenAI-Transformer

BERT

BERT Word Embeddings

http://ruder.io/multi-task/

auxiliary task

https://github.com/huggingface/transformers

https://github.com/hanxiao/bert-as-service

Text Classification

Convolutional Neural Networks for Sentence Classification

A Sensitivity Analysis of (and Practitioners' Guide to) Convolutional Neural Networks for Sentence Classification

http://www.wildml.com/2015/12/implementing-a-cnn-for-text-classification-in-tensorflow/

https://github.com/yoonkim/CNN_sentence

http://albertxiebnu.github.io/fasttext/

Text embedding search

https://github.com/facebookresearch/faiss

https://engineering.fb.com/data-infrastructure/faiss-a-library-for-efficient-similarity-search/

https://www.elastic.co/blog/text-similarity-search-with-vectors-in-elasticsearch

https://hanxiao.io/2019/11/22/Video-Semantic-Search-in-Large-Scale-using-GNES-and-TF-2-0/

Tools

fasttext: https://github.com/facebookresearch/fastText

textcnn: https://github.com/dennybritz/cnn-text-classification-tf

Resources

深度学习 500 问

中文自然语言处理相关资料

中文自然语言处理各任务最新进展 by 滴滴人工智能实验室

http://ruder.io/

http://www.hankcs.com/

liuhuanyong 常见自然语言处理任务项目

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors