Mingyu Zheng
@SpursGoZmyPhD candidate from Institute of Information Engineering, Chinese Academy of Sciences.
Language Breakdown
Lines of code distribution across 7 owned repositories
I-Shaped Developer
I-shapedSpecialist — deep expertise in Python
Collaboration Network
Global Impact visualization
Repos
21
PRs
0
Growth
+18%
Top Collaborators
No collaborator data yet.
Coding Streak
Contribution activity over the past year
Théo Biasutto--Lervat
@NiziL
Yaowei Zheng
@hiyouga
wlr737
@wlr737
Naihao Deng
@dnaihao
Heaven-zhw
@Heaven-zhw
Top Repositories
本项目旨在收集开源的表格智能任务数据集(比如表格问答、表格-文本生成等),将原始数据整理为指令微调格式的数据并微调LLM,进而增强LLM对于表格数据的理解,最终构建出专门面向表格智能任务的大型语言模型。
We collect papers about "large language models (LLM) for table-related tasks", e.g., using LLM for Table QA task. “表格+LLM”相关论文整理
Dataset and Code for our ACL 2024 paper: "Multimodal Table Understanding". We propose the first large-scale Multimodal IFT and Pre-Train Dataset for table understanding and develop a generalist tabular MLLM named Table-LLaVA.
Dataset and Code for ACL 2023 paper: "IM-TQA: A Chinese Table Question Answering Dataset with Implicit and Multi-type Table Structures". We proposed a new TQA problem which aims at real application scenarios, together with a supporting dataset and a baseline method.
编译原理实践作业,针对C-Minus语言的扫描器,语法分析器,语义分析器实现。
Code and Data for the finding of ACL 2025: "TableDreamer: Progressive and Weakness-guided Data Synthesis from Scratch for Table Instruction Tuning"
Code for the Finding of EMNLP 2023 "Chain-of-Thought Reasoning in Tabular Language Models"
mPLUG-DocOwl: Modularized Multimodal Large Language Model for Document Understanding
mPLUG-Owl & mPLUG-Owl2: Modularized Multimodal Large Language Model
[ACL 2024 Findings] "TempCompass: Do Video LLMs Really Understand Videos?", Yuanxin Liu, Shicheng Li, Yi Liu, Yuxiang Wang, Shuhuai Ren, Lei Li, Sishuo Chen, Xu Sun, Lu Hou
Open Source Impact
Contributions to external projects
No external contributions found.