XU Yizhou

Master Student in NLP at INaLCO

Download CV
avatar

Education

present

Master in Natural Language Processing

Text, Computing and Multilingualism Department

INaLCO

Juin 2017

Diplôme Universitaire d'Etudes Françaises

Département Didactique du Français Langue Etrangère

Université Sorbonne Nouvelle

Juin 2013

Master in Translation and Interpreting (English/Chinese)

School of Translation and Interpreting

Beijing Language and Culture University

Juin 2011

B.E. in Computer Science

College of Information Science

Beijing Language and Culture University

Skills

Programming Languages

  • Python

    • Seasoned in Python programming (Python3 > Python2)
    • Solid knowledge of Python syntax and advanced features like generator, decorator, collections and itertools
    • Basic knowledge of functional programming in Python
    • Working knowledge of OOP in Python
    • NLP:3 course projects using NLTK, Spacy and Gensim
    • Machine Learning and Data Mining:
      1 course project and 1 production level project using Scikit-learn, Numpy, SciPy, Pandas, Matplotlib, Seaborn
      Basic knowledge of Deep Learning and Tensorflow
    • Experienced in processing data files in different formats (json,xml/html,csv,pdf...) or using different encodings
    • Hands-on Knowledge of web crawling / data scrapping with urlibe+bs4, Selenium and Scrapy
    • Basic knowledge of web development frameworks like Django and Flask
    • Familiar with Python best practices like PEP8(coding style) and PEP257(documentation)
    • Experienced in PyCharm
  • Perl

    • Hands-on knowledge and experience of Perl
    • Basic knowledge of (Linux) system administration with Perl
    • NLP: 2 course projects
  • Java

    • Developing proficiency in Java programming
    • Working knowledge of OOP
    • NLP: 1 course project using CoreNLP; familiar with OpenNLP
    • Experienced in Eclipse
  • C++

    • Basic knowledge of C++ syntax and STL

Natural Language Processing

  • Natural Language Understanding

    • Word level:
    • Sentence level:
    • Document level:
  • Natural Langauge Generation

    • Text to text:
    • Data to text:
  • Software

    • Gate
    • TXM
    • Unitex
  • English

  • French

  • Chinese

    • Sound knowledge of Chinese NLP pipeline
    • Toolkits :
      jieba(2 projects in Python), THULAC(1 project in Python), CoreNLP(1 project in Java)
    • Coprus:
    • 1 project in Bash, 2 projects in Perl, 1 project in Java and 3 projects in Python

Machine Learning / Data Mining / Information Retrieval

  • Machine Learning

Database / Data Warehouse

  • Database-SQL

    • Sound knowledge SQL grammar
  • Database-NoSQL

  • Data Warehouse

Web and Semantic Web

  • Front-end

  • Semantic Web

  • Web Crawling

Linux

  • Operating System

    • Hands-on knowledge and experience of Unix-like operating system
  • Bash Programming

    • Skilled in Bash Programming

Miscellaneous

  • Git/Github

  • Cloud

    • AWS : Experience with AWS EC2, AWS S3, AWS Route 53
  • Latex

Languages

Publications

Translation

The Smartest Places on Earth: Why Rustbelts Are the Emerging Hotspots of Global Innovation

Publisher: CITIC Press

ISBN: 9787508656090

Authors: Antoine van Agtmael, Fred Bakker

Translator: XU Yizhou


English Pronunciation in Use Intermediate

Publisher: Beijing Language and University Press

ISBN: 9787561933220

Author: Mark Hancock

Translator: XU Yizhou