
Education
Pennsylvania State University – University Park
2016 - 2020
Bachelor of Science: Information Sciences
Bachelor of Science: Risk Analysis & Cybersecurity
GPA: 3.84/4.0
Experience
Software Engineer Intern, JP Morgan Chase
June 2019 - August 2019
Worked in a full agile, full stack development team
Cassandra data models
Software Engineer Intern, Lockheed Martin
May 2018 - February 2019
Infrastructure Health Management team
Undergraduate Researcher, CiteSeerX Research Group
April 2017 - April 2018
Worked with Dr. Lee Giles and Dr. Jian Wu
Developed machine learning models to clean digital metadata
Learning Assistant, Penn State University
January 2018 - Current
Professors include: Dr. Jian Wu, Dr. David Fusco, Dr. Rick Winscot
Projects
Face Detection
Developed using the deep learning library Keras
Original idea developed to track classroom attendance
Later modified to dynamically identify faces with computer webcams
Written in Python
Implemented libraries include face_recognition, sys, dlib, and skimage
Sparked from my interest in neuron networks and transforming human data into useful applications
API Based Web Application
Web application developed using Play Framework 2.7.2
Used USGS, Bing Maps, and OpenCage APIs to locate and visually represent earthquake histories
Written in Java, Scala, HTML, CSS, CoffeeScript and uses both a in-memory database and PostgreSQL
Deployed to Heroku, a cloud application service
Learned how web requests operate within a service platform
I choose this project because I want to track how people move away from natural disasters with social media geography-tagging
Predictive Modeling
Created multiple machine learning models to test the accuracy of replacing missing metadata tags with existing data
Used an open source extractor called GROBID to extract metadata from PDF
Implemented a 5 fold validation, which partitions each samples randomly, into each model
Written in Python and stored in Microsoft SQL
Learned how to create models and apply it to a larger research idea
This was the project that got me interested in data science
Publications
-
Athar Sefid, Jian Wu, Jing Zhao, Lu Liu, Allen C. Ge, Cornelia Caragea, Prasenjit Mitra, C. Lee Giles. "Cleaning Noisy and Heterogeneous Metadata for Record Linking Across Scholarly Big Datasets." In: Proceedings of the 31th Innovative Applications of Artificial Intelligence Conference (IAAI 2019), January 29-31, 2019, Honolulu, Hawaii, USA. [pdf]
-
Jian Wu, Athar Sefid, Allen C. Ge, and C. Lee Giles. "A Supervised Learning Approach to Entity Matching Between Scholarly Big Datasets." In: Proceedings of the 9th International Conference on Knowledge Capture (K-CAP 2017), December 4-6, 2017, Austin, Texas, USA. [pdf]
Skills & Languages
Java
4 Years
Python
AngularJS
1 Year
3 Years
SQL
3 Years
CSS
2 Year
R
1 Year
HTML/JavaScript
2 Years

Allen Ge
BS Information Sciences
BS Risk Analysis
Pennsylvania State University



