HanLP: Han Language Processing

thumb_up Good

Live Demo

Any multilingual sentence within 120 characters
SDP standard
Use multilingual model.

Linguistics

Dep Tree   
────────── 
       ┌─► 
    ┌─►├── 
    │  └─► 
    │  ┌─► 
┌┬┬─┴──┴── 
│││  ┌───► 
│││  │┌──► 
│││  ││┌─► 
││└─►└┴┴── 
││    ┌──► 
││    │┌─► 
│└───►└┴── 
└────────► 
Token            
──────────────── 
In               
2021             
,                
HanLPv2.1        
delivers         
state-of-the-art 
multilingual     
NLP              
techniques       
to               
production       
environments     
.                
Relation 
──────── 
case     
obl      
punct    
nsubj    
root     
amod     
amod     
compound 
obj      
case     
compound 
obl      
punct    
Lemma            
──────────────── 
in               
2021             
,                
HANlpv2.1        
deliver          
state-of-the-art 
multilingual     
NLP              
technique        
to               
production       
environment      
.                
PoS   
───── 
ADP   
NUM   
PUNCT 
PROPN 
VERB  
ADJ   
ADJ   
PROPN 
NOUN  
ADP   
NOUN  
NOUN  
PUNCT 
Tok              
──────────────── 
In               
2021             
,                
HanLPv2.1        
delivers         
state-of-the-art 
multilingual     
NLP              
techniques       
to               
production       
environments     
.                
NER Type        
─────────────── 
                
───►DATE        
                
───►WORK_OF_ART 
                
                
                
                
                
                
                
                
                
Tok              
──────────────── 
In               
2021             
,                
HanLPv2.1        
delivers         
state-of-the-art 
multilingual     
NLP              
techniques       
to               
production       
environments     
.                
SRL PA1      
──────────── 
◄─┐          
◄─┴►ARGM-TMP 
             
───►ARG0     
╟──►PRED     
             
             
             
             
◄─┐          
  ├►ARG2     
◄─┘          
             
Tok              
──────────────── 
In               
2021             
,                
HanLPv2.1        
delivers         
state-of-the-art 
multilingual     
NLP              
techniques       
to               
production       
environments     
.                
PoS      3       4       5       6
──────────────────────────────────
ADP ───────────┐                  
NUM ────►NP ───┴────────►PP ───┐  
PUNCT──────────────────────────┤  
PROPN───────────────────►NP────┤  
VERB ──────────────────┐       │  
ADJ ───┐               │       │  
ADJ    │               │       │  
PROPN  ├────────►NP────┼►VP────┼►S
NOUN ──┘               │       │  
ADP ───────────┐       │       │  
NOUN ──┐       ├►PP ───┘       │  
NOUN ──┴►NP ───┘               │  
PUNCT──────────────────────────┘  

Lexical

                            In 2021 , HanLPv2.1 delivers state-of-the-art multilingual NLP techniques to production environments .
T1 ADP 0 2 In
#1	AnnotatorNotes	T1	adposition
T2 NUM 3 7 2021
#2	AnnotatorNotes	T2	numeral
T3 PUNCT 8 9 ,
#3	AnnotatorNotes	T3	punctuation
T4 PROPN 10 19 HanLPv2.1
#4	AnnotatorNotes	T4	proper noun
T5 VERB 20 28 delivers
#5	AnnotatorNotes	T5	verb
T6 ADJ 29 45 state-of-the-art
#6	AnnotatorNotes	T6	adjective
T7 ADJ 46 58 multilingual
#7	AnnotatorNotes	T7	adjective
T8 PROPN 59 62 NLP
#8	AnnotatorNotes	T8	proper noun
T9 NOUN 63 73 techniques
#9	AnnotatorNotes	T9	noun
T10 ADP 74 76 to
#10	AnnotatorNotes	T10	adposition
T11 NOUN 77 87 production
#11	AnnotatorNotes	T11	noun
T12 NOUN 88 100 environments
#12	AnnotatorNotes	T12	noun
T13 PUNCT 101 102 .
#13	AnnotatorNotes	T13	punctuation
T14 DATE 3 7 2021
T15 WORK_OF_ART 10 19 HanLPv2.1

                            
                        

Syntactics

                            1	In	In	ADP	ADP	_	2	case	_	_
2	2021	2021	NUM	NUM	_	5	obl	_	_
3	,	,	PUNCT	PUNCT	_	2	punct	_	_
4	HanLPv2.1	HanLPv2.1	PROPN	PROPN	_	5	nsubj	_	_
5	delivers	delivers	VERB	VERB	_	0	root	_	_
6	state-of-the-art	state-of-the-art	ADJ	ADJ	_	9	amod	_	_
7	multilingual	multilingual	ADJ	ADJ	_	9	amod	_	_
8	NLP	NLP	PROPN	PROPN	_	9	compound	_	_
9	techniques	techniques	NOUN	NOUN	_	5	obj	_	_
10	to	to	ADP	ADP	_	12	case	_	_
11	production	production	NOUN	NOUN	_	12	compound	_	_
12	environments	environments	NOUN	NOUN	_	5	obl	_	_
13	.	.	PUNCT	PUNCT	_	5	punct	_	_

                            
                        

Semantics

%3 1 In 2 2021 1->2 ARG2 3 , 1->3 orphan 4 HanLPv2.1 1->4 orphan 5 delivers 1->5 orphan 6 state-of-the-art 1->6 orphan 7 multilingual 1->7 orphan 8 NLP 1->8 orphan 10 to 1->10 orphan 11 production 1->11 orphan 13 . 1->13 orphan 5->4 ARG1 9 techniques 5->9 ARG2 12 environments 5->12 ARG3 6->9 ARG1 7->9 ARG1 8->9 compound 11->12 compound

Introduction

The multilingual NLP library for researchers and companies, built on PyTorch and TensorFlow 2.x, for advancing state-of-the-art deep learning techniques in both academia and industry. HanLP was designed from day one to be efficient, user friendly and extendable.

Thanks to open-access corpora like Universal Dependencies and OntoNotes, HanLP 2.1 now offers 10 joint tasks on 104 languages: tokenization, lemmatization, part-of-speech tagging, token feature extraction, dependency parsing, constituency parsing, semantic role labeling, semantic dependency parsing, abstract meaning representation (AMR) parsing. See also GitHub

HanLP versions