Text-Driven 3D Human Generation

Summary

Our project aims to develop a text-driven 3D human generation system. Users will be able to describe their desired character using natural language, and the system will automatically generate a realistic and detailed 3D model in response.

Team members

Jian Yu, Xiaoyu Zhu, Xinzhe Wei, Zimo Fan

Problem Discription

Problem: Creating high-quality 3D human models is a time-consuming and skill-intensive process. Traditional modeling software requires technical expertise, making it inaccessible for many users and hindering rapid iteration.

Importance: Realistic 3D humans are essential in various industries. Gaming, animation, virtual/augmented reality, and even personalized medicine rely on these models. A faster, more intuitive model creation process would streamline development and broaden accessibility.

Challenge: The complexity of human form poses a major challenge. Capturing diverse body types, clothing, and subtle details requires both precision and artistic skill. What's more, reconstruction often consumes a lot of time and creates a barrier to the rollout of the application.

Proposed Solution: We will develop a text-driven 3D human generation system that leverages the new Gaussian splatting method for accelerated rendering. This system will use natural language processing (NLP) to understand user descriptions and generate corresponding 3D models, simplifying the creation process significantly.

Goals and Deliverables

Baseline Plan

Aspirational Plan

Schedule

Resources