An Empirical Study of Chinese Name matching and Applications

Ivan Zapreev
Mind Map by Ivan Zapreev, updated more than 1 year ago
Ivan Zapreev
Created by Ivan Zapreev about 5 years ago
21
0

Description

paper by Nanyun Peng, Mo Yu, Mark Dredze

Resource summary

An Empirical Study of Chinese Name matching and Applications
1 Introduction
1.1 Name matching
1.1.1 Important
1.1.1.1 Downstream tasks
1.1.1.1.1 Entity linking
1.1.1.1.1.1 Includes context of mentions
1.1.1.1.2 Entity clustering
1.1.1.1.2.1 Includes context of mentions
1.1.1.1.3 ?
1.1.1.1.4 Entity coreference
1.1.1.1.4.1 ?
1.1.1.1.5 Name transliteration
1.1.1.1.6 Identifying names for mining paraphrases
1.1.1.1.6.1 ?
1.1.1.1.7 Standalone name matching
1.1.1.1.7.1 Context independen
1.1.1.2 Entity disambiguation
1.1.1.2.1 Determine if two mentioned strings refer to the same entity
1.1.2 Methods
1.1.2.1 Language type
1.1.2.1.1 Alphabetic languages
1.1.2.1.1.1 Focused on
1.1.2.1.1.1.1 Example
1.1.2.1.1.1.1.1 English
1.1.2.1.1.1.2 Indo-European
1.1.2.1.2 Logogram languages
1.1.2.1.2.1 Example
1.1.2.1.2.1.1 Chinese
1.1.2.1.2.1.1.1 Hanzi
1.1.2.1.2.1.1.1.1 Challenge
1.1.2.1.2.1.1.1.1.1 A small number of hanzi represents an entire name
1.1.2.1.2.1.1.1.1.2 There are X*10.000 hanzi in use
1.1.2.1.2.1.1.1.1.3 Current methods
1.1.2.1.2.1.1.1.1.3.1 Largely UNTESTED
1.1.2.1.2.1.1.1.1.3.2 Coreference resolution errors
1.1.2.1.2.1.1.1.1.3.2.1 Caused by
1.1.2.1.2.1.1.1.1.3.2.1.1 Chinese name matching errors
1.1.2.2 Focus on persons names
1.1.3 Challenge
1.1.3.1 Issue: Name variations
1.1.3.1.1 Nicknames
1.1.3.1.2 Aliases
1.1.3.1.3 Acronyms
1.1.3.1.4 Differences in translation
1.1.3.2 Exact string matching
1.1.3.2.1 POOR results!
1.1.4 Determine whether two strings refer to the same entity based on the strings above.
2 Research
2.1 Evaluate Name Matching methods
2.1.1 In Chineese
2.1.2 Approaches
2.1.2.1 Existing
2.1.2.1.1 String matching
2.1.2.1.1.1 ?
2.1.2.1.2 Learnig
2.1.2.1.2.1 ?
2.1.2.2 New
2.1.2.2.1 New Representation for Chinese
2.1.3 Experiments
2.1.3.1 New Representation for Chinese
2.1.3.1.1 Improves
2.1.3.1.1.1 name matching
2.1.3.1.1.2 Entity clustering
2.1.3.1.2 No details?!
2.2 Newly developed data sets
2.2.1 Matched Chinese name pairs
2.3 Mingpipe
2.3.1 Name matching tool
2.3.1.1 Python package
2.3.1.1.1 Usage
2.3.1.1.1.1 As stand alone
2.3.1.1.1.2 Integrated in a larger system
Show full summary Hide full summary

Similar

Luo-Connecting Points
elisegreaves
Chinese HSK -1 Characters Flashcards
ASHISH AWALGAONKAR
Chinese HSK -1 Characters Flashcards
Diem Huong Bui
Chinese HSK -1 Characters Flashcards
Ali Mendoza
Chinese HSK -1 Characters Flashcards
Jose Horta
Chinese 2 Dictation
Diem Huong Bui
Chinese 2 Dictation
Yumi Suazo
Chinese(Traditional)-Adjectives
yvonnechan
Chinese Characters Lesson 4
Bruno Herrero No
Mandarin useful phrases
A K
Chinese Character Lesson 5
Bruno Herrero No