CV
updated in 28 Sept 2020
Education
Master of Science in University of Melbourne, 2019-2021
Major: Computer Science
Core Modules: Statistical Machine Learning, Distributed Systems, Natural Language Processing, Quantum Computing
Research Topic: Machine Learning
Supervisors: Ben Rubinstein, Renata Borovica-Gajic
Bachelor of Engineering in Jiangsu Univeristy, 2015-2019
Major: Information Security
Core Modules: Database Security, IOT Security, Cryptography Principles, Networks and Systems Security
GPA:3.89/5.0, Ranking:2/56
Supervisor: Conghua Zhou
Skills
- Familiar with Java (Swing, JUnit), C++ (socket, MFC) and Python (sklearn, tensorflow)
- Familiar with multithreaded programming, as well as Object-Oriented Programming and Functional Programming, have good coding styles
- Familiar with Linux commands, git operations and basic usage of gcc and vim
- Familiar with Data Structure and Algorithms (LinkedList, Stack, Queue, Binary Tree, Divide and Conquer, Dynamic Programming etc), as well as networks (TCP, UDP, HTTP etc)
- Familiar with MySQL Database and basic knowledge of Redis; know about common database optimize strategies
- Familiar with cryptography (symmetric encryption and asymmetric encryption), and common Web attacks and their defence strategies (SQL injection, XSS attack etc)
- Know about machine learning (KNN, SVM etc) and distribution systems (Vector clocks, Leader election algorithm, Mutual exclusion algorithm, 2PC etc)
Projects
MVCC Algorithms Implementation (Feb - June 2020)
Task: Simulate a database, and implement the MVCC Algorithm from scratch to ensure data consistency.
Details:
- Isolation level is Read Committed. Each transaction can only access the committed data of other transactions;
- When the read operation of one transaction conflicts with write operation of another transaction, it will get the previous version of data and another transaction will fail and rollback automatically.
Results: It can ensure data consistency and prevent dirty read.
Responsibility: data transmission module and core concurrency control algorithm
Climate Change Misinformation Detection System (Apr - May 2020)
Task: Judge whether a given article is climate change misinformation by using natural language processing tools.
Details:
- Used requests to get articles from different kinds of websites, and label the article whether it is about climate change misinformation;
- preprocessing the article and feed them into neural networks to train the model;
- Building a 3-layer fully-connected neural networks and conducting hyper-parameter tuning, to predict whether a given article is about climate change misinformation.
Results: The model has achieved 80% accuracy in testing dataset.
Geolocation of Tweets (Oct - Nov 2019)
Task: design a system to predict the location where a tweet is sent.
Details:
- Used requests to get tweets from Twitter and label the location;
- feed the tweets to machine learning models, like knn and random forest, and make predictions.
Results: The model has achieved 85% precision in testing dataset.
Distributed Shared Whiteboard System (Sept - Nov 2019)
Task: Design a distributed system so that multiple users can draw and chat simultaneously.
Details:
- Recording and broadcasting the operation of drawing board in the form of Json when a client draws a pixel, so that data consistency in distributed systems can be ensured;
- Using AES to encrypt messages during transmission. When reading records from file, it will use SHA-256 to verify digest of the file.
Results: It can reach the anticipated results and handle fewer than 50 clients of concurrency volume.
Responsibility: Drawing board function designing, data encryption and digest verification
Electronic Document Management Platform (Feb - June 2019)
Task: Design a platform to store and manage electronic documents, with multiversion management functions.
Details:
The server saves all versions of different documents, and the transmission of data is implemented by using TCP protocol. FTP protocol is used for transmitting documents;
The messages and documents are encrypted with AES algorithm. When downloading documents, SHA-256 is used to ensure integrity of the documents;
Meanwhile, RBAC and backup and recovery skills are used to enhance the platform security.
Skills: WinForm + SQL Server
Results: This project has implemented a secure multiversion document management platform.
Online Communication Software (May - June 2017)
Task: Design a software that can be used to communicate between users.
Details:
- The system has used TCP communication to send messages between users;
- Encryption (symmetric and asymmetric) and digital signature technologies have applied to the software to make transmission secure;
Result: The software can be used for securely chatting between people, with the function of secret communication and group communication.
Rewards
- Study Scholarship and Triple-A Student for Academic Year 2017-2018, 2nd Prize (2018.12)
- Lanqiao Software and Information Technology Competition, 3rd Prize (2018.04)
- Study Scholarship and Triple-A Student for Academic Year 2016-2017, 1st Prize (2017.12)
- National English Competition for College Students, 3rd Prize (2017.05)
- Cyber Security Competition at Jiangsu University, ranked 3rd (2016.12)
- 13th Mathematics Competition, 3rd Prize (2016.08)
Professional Qualifications
- Languages: Chinese (native); English: IELTS (Overall 7.0); CET-6 (Overall 559)
- IT: Level 3, National Computer Rank Examination