🌈Hello, I’m Jo-Ku Cheng!

šŸ§‘ā€šŸŽ“ I’m a final year master’s student at Peking University, School of Mathematical Sciences, majoring in Applied Mathematics under the supervision of Prof. Jinwen Ma. I am also the TA for the undergraduate course ā€˜Mathematical Modelling’ at PKU.

🧐 My research interests include multimodal reasoning and Large Language Model applications.

šŸ˜„ I am from Taichung, Taiwan and grew up in Beijing.

šŸ„ŠšŸ‹ļø I am interested in doing sports. I do boxing and crossfit.

Selected Experience

GeoUni: A Unified Model for Generating Geometry Diagrams, Problems and Problem Solutions

Project Homepage | Paper

  • Proposed the first unified multi-modal geometry expert model, GeoUni,capable of solving geometry problems, generating precise geometric diagrams using both formal and natural language, and creating geometry problems based on knowledge points.
  • Proposed Geo-MAGVIT, a module specifically designed for the tokenization of geometric diagrams. By introducing topo-structural awareness loss and text region loss, it significantly improves the precision of geometry structure and text reconstruction.
  • Combined GRPO and LoRA to train the Geo-Reasoning-Adapter, which effectively boosts geometric reasoning capability and seamlessly integrates into the unified model architecture.
  • Established a novel diagram generation evaluation metrics, which includes the Geometry Semantic Matching Scores (GSMSs) and Geometry Pixel Matching Score (GPMS) to comprehensively evaluate the diagram generation task.

Diagram Formalization Enhanced Geometry Problem Solver

pipeline

Project Homepage | Hugging Face Dataset | Paper

  • Designed a multimodal framework integrating visual features and geometric formal languages for solving complex geometry problems.
  • Proposed a synthetic data approach (SynthGeo228K dataset) for improving model training and diagram interpretation.
  • Achieved an accuracy of 82.38% on the publicly available FormalGeo7k dataset, significantly outperforming existing multi-modal and language models, including GPT-4.