Zekun's Zone

Experiences

Text Detection and Recognition for Historical Maps

Spatial Sciences Institute, USC

Dec 2019 - Aug 2020, Los Angeles

Responsibilities:

Built a deep neural network for detecting text of various font size, style and orientation angles on historical map patches. The network is able to handle text regions of arbitrary shapes
Designed the network to highlight probable text regions and then predict accurate bounding boxes given both map features and text probability distributions in a coarse-to-fine manner

Generating Historical Maps from online Maps

Spatial Sciences Institute, USC

Aug 2018 - Dec 2019, Los Angeles

Responsibilities:

Synthesized historical maps from Open Street Map tiles with conditional generative adversarial networks
The network generated background and foreground separately using different targets to solve the content mismatch problem in online maps and historical maps
Used the synthesized historical maps as the base-map and automatically place text labels on them to provide a large amount of training data for text detection networks

Synthetic Face Generation for Facial Landmark Detection

Responsibilities:

Built a pipeline to generate synthetic face images with landmark annotations using 3D modeling application Makehuman and rendering application Blender
Rendered the images from 3D models with various poses, camera setting, lighting conditions and backgrounds
Verified that the 2D landmark detection task and the 3D mesh prediction task can both benefit from the large amount of generated synthetic images

Automated Visual Data Extraction from Chart Images

Responsibilities:

Built a pipeline to automatically infer numerical values for each chart given the column chart images
Applied trident-net to extract the chart object heights. Designed a ruler encoding module to interpret the y-axis information to convert the objects from pixel-space to ruler space to generate reading
The ruler encoding module focuses on the minimum and maximum values of the ruler to decide the numerical range that the charts represent

Publications

SpaBERT Pretrained Language Models on Geographic Data for Geo-Entity Representation

EMNLP 2022

Zekun Li Jina Kim Yao-Yi Chiang Muhao Chen

The paper proposes a novel spatial language model called SpaBERT that captures the spatial context of named geographic entities (geo-entities) in geospatial data. The model is based on the hypothesis that the characteristics of a geo-entity can be inferred by its surrounding entities, similar to word meanings in linguistic context.

language model spatial data

Details

ACE: Anchor-free Corner Evolution for Real-time Arbitrarily-oriented Object Detection

Transaction on Image Processing (TIP) 2022

Pengwen Dai Siyuan Yao Zekun Li Sanyi Zhang Xiaochun Cao

The paper proposes a novel model for detecting arbitrarily-oriented objects, such as texts/hands or objects in aerial images. The model evolves the axis-aligned bounding box to an oriented quadrilateral box using contour information.

object detection

Details

ChartOCR: Data Extraction from Charts Images via a Deep Hybrid Framework.

WACV 2021

Junyu Luo Zekun Li Jinpeng Wang Chin-Yew Lin

The paper proposes an unified method, called ChartOCR, to extract data from various types of chart images, including bar charts, line charts, and pie charts. We combine deep learning and rule-based methods to achieve generalization ability and obtain accurate and semantic-rich intermediate results.

object detection

Details

An Automatic Approach for Generating Rich, Linked Geo-Metadata from Historical Map Images

KDD 2020

Zekun Li Yao-Yi Chiang Sasan Tavakkol Basel Shbita Johannes H. Uhl Stefan Leyk Craig A. Knoblock

We present an end-to-end approach that automatically processes historical map images to extract their text content and generate a set of metadata linked to large geographic databases. The approach combines OCR with geocoding to accurately identify location phrases and assign geospatial coordinates.

maps object detection

Details

Synthetic Map Generation to Provide Unlimited Training Data for Historical Map Text Detection

SIGSPATIAL GeoAI workshop 2021

Zekun Li Runyu Guan Qianmu Yu Yao-Yi Chiang Craig A. Knoblock

Many text detection algorithms have been proposed to locate text regions in map images automatically, but most of the algorithms are trained on out-of-domain datasets. This paper introduces a method to automatically generate an unlimited amount of annotated historical map images for training text detection models.

maps text detection

Details

Combining Remote-Sensing-Derived Data and Historical Maps for Long-Term Back-Casting of Urban Extents.

Remote Sensing 2021

Johannes H. Uhl Stefan Leyk Zekun Li Weiwei Duan Basel Shbita Yao-Yi Chiang Craig A. Knoblock

The paper proposes a framework that uses remote sensing data (the Global Human Settlement Layer, GHSL) and georeferenced historical maps to generate historical urban extents for the early 20th century.

object detection

Details

Weighted Feature Pooling Network in Template-Based Recognition

ACCV 2019

Zekun Li Yue Wu Wael Abd-Almageed Prem Natarajan

The paper proposes a template-based learning approach for computer vision tasks, where multiple instances of a concept are available. The method dynamically predicts weights that consider noise and redundancy to aggregate image-level features into a single template-level representation.

object detection

Details

Selected Projects

mapKurator system

Team Lead March 2020 - Present

A deep learning tool to process scanned historical maps. Performs text detection & recoginition, postOCR correction and entity linking.

map processing deep learning

Details

SpaBERT

First Author Jan 2021 - Nov 2022

SpaBERT extends BERT to capture linearized spatial context, while incorporating a spatial coordinate embedding mechanism to preserve spatial relations of entities in the 2-dimensional space.

nlp geospatial

Details

Synthetic Map Generation

First Author Jun 2019 - Aug 2020

Utilized cycle-GAN to convert open street map (OSM) images into historical map style.

historical maps

Details

Accomplishments

Ordnance Survey Award

British Cartographic Society September 2022

Mishmash: A Mix of Old and New won the Ordnance Survey Award 2022. I adopted cycleGAN to generate synthetic historical maps from Open Street Map (OSM) vector data.

View Certificate

AI for Critical Mineral Assessment Competition

DARPA December 2022

The Map Feature Extraction Challenge required us to identify and label map features–lines, polygons, and points– that appear in the legend of historical maps. Our team isi-umn won the first place.

View Certificate

Hi, I am Zekun

Zekun Li

Ph.D. Student at University of Minnesota, Twin Cities

Experiences

Text Detection and Recognition for Historical Maps

Spatial Sciences Institute, USC

Responsibilities:

Generating Historical Maps from online Maps

Spatial Sciences Institute, USC

Responsibilities:

Synthetic Face Generation for Facial Landmark Detection

Amazon

Responsibilities:

Automated Visual Data Extraction from Chart Images

Microsoft Research Asia

Responsibilities:

Publications

SpaBERT Pretrained Language Models on Geographic Data for Geo-Entity Representation

ACE: Anchor-free Corner Evolution for Real-time Arbitrarily-oriented Object Detection

ChartOCR: Data Extraction from Charts Images via a Deep Hybrid Framework.

An Automatic Approach for Generating Rich, Linked Geo-Metadata from Historical Map Images

Synthetic Map Generation to Provide Unlimited Training Data for Historical Map Text Detection

Combining Remote-Sensing-Derived Data and Historical Maps for Long-Term Back-Casting of Urban Extents.

Weighted Feature Pooling Network in Template-Based Recognition

Selected Projects

mapKurator system

SpaBERT

Synthetic Map Generation

Accomplishments

Ordnance Survey Award

AI for Critical Mineral Assessment Competition

Recent Posts

Two Github Accounts with Two SSH Keys

ChatGPT for Novel Translation

Raspberry Pi Timelapse Video