TZstats Convergence Lab @ Columbia


Where disciplines collaborate and research meets education: tackling real-world problems with data, tools, methods — and preparing the next generation of polymath researchers.


Lab lead - Professor Tian Zheng

Latest Posts

Untangling AI Education Narratives

Discussions on “AI Education” often feel tangled, confusing, and mixed. I spend A LOT of time thinking about what could help structure such dialogues. I wrote up my thoughts and posted them on arXiv.

Untangling AI Education Narratives
AI+X Faculty Leadership Fellows Program - Year Beta

Grounded in the belief that faculty are the drivers of transformative educational change, this emerging initiative seeks to accelerate progress in AI education at scale by empowering faculty to lead innovative, cross-disciplinary efforts.

AI+X Faculty Leadership Fellows Program - Year Beta
Frontiers of Statistics in Science and Engineering: 2035 and Beyond

I am honored to serve on the National Academies Committee on the Frontiers of Statistics in Science and Engineering: 2035 and Beyond

The committee will produce a forward-looking assessment of the state of the statistical sciences, emerging opportunities, and the evolving needs of allied fields. Our work spans:

Frontiers of Statistics in Science and Engineering: 2035 and Beyond
An Interview with Columbia News (April 2024)

I shared the best things about being a statistician. Read more

An Interview with Columbia News (April 2024)

Featured Projects

AI for Social Good and Society

AI for Social Good and Society

The AI for Social Good and Society (AI4SGS) Initiative is a bold interdisciplinary effort to apply artificial intelligence to some of the world’s most pressing social and public health challenges.

AutoClimDS: Climate Data Science Agentic AI

AutoClimDS: Climate Data Science Agentic AI

We explore genAI tools to lower the barriers in Climate Data Science in collaboration with AWS.

Currect subprojects include: Knowledge graph construction and expansion. Development of agents for data acquision, analysis, modeling, visualization, etc. Evaluation through case studies.

NSF-STC: Learning the Earth with AI and Physics

NSF-STC: Learning the Earth with AI and Physics

Learning the Earth with Artificial Intelligence and Physics (LEAP) is an NSF Science and Technology Center (STC) launched in 2021. LEAP’s mission is to increase the reliability, utility, and reach of climate projections through the integration of climate and data science.

Statistical Machine Learning for Ecology

Statistical Machine Learning for Ecology

A long-time collaboration between Professors Tian Zheng and Professor Maria Uriarte on using machine learning to unlock potentials of new data types to understand the impact of climate change on tropical forests.

Collaboratory at Columbia - An Aspen Grove of Data Science Education

Collaboratory at Columbia - An Aspen Grove of Data Science Education

The Collaboratory is both a set of “data science in context” educational approaches, as well as a meta-model for an accelerator program that allows different institutions to respond flexibly to their own disciplinary heterogeneity in terms of data science educational needs. The novelty of the Collaboratory lies in its crowd-sourcing approach to creating new data science pedagogy and its ability to kindle transdisciplinary collaboration in doing so. Read our Havard Data Science Review article to learn more.

Applied Data Science at Columbia

Applied Data Science at Columbia

Applied Data Science at Columbia is a project-based learning course that started in 2016. It employs the common task framework and runs 5 mini project cycles during one semester to give students a broad exposure to various areas in data science. Projects are developed and updated each year, drawing inspirations from active research, challenges and interesting public datasets.

Statistical Methods for Aggregated Relational Data

Statistical Methods for Aggregated Relational Data

A core and active research area of the TZstats lab.