Hi, I am a researcher in Zhipu AI. Before that, I was a research scientist in 4Paradigm Inc. and the leader of OpenRL Lab. I received my B.E. and Ph. D. degrees (co-advised by Prof. Jun Zhu and Prof. Ting Chen) from the Department of Computer Science and Technology, Tsinghua University in July, 2017 and June, 2022. My researches focus on deep reinforcement learning, multi-agent reinforcement learning, distributed reinforcement learning, RL for robotics, LLM as agent, artificial general intelligence (AGI) and generative artificial intelligence (GAI). I have also spent time working at RealAI Inc. , Huawei Noah's Ark Lab, Tencent AI Lab, Carnegie Mellon University and Sensetime Inc. . And I am also the founder of the OpenRL Lab and TARTRL group.
We are looking for self-motivated interns and full-timers who have a strong background in mathematics/computer science and are eager to get involved in cutting-edge, fundamental AI research. Please feel free to drop me an email if you are interested in collaborating with me.
I have read your paper, and you said you make a dataset called Precarious Pedestrian dataset. Can you share the dataset for research ?
What's more can I ask a question in Chinese...?
文中的算法1的意思,是指训练一个二分类器,把生成数据和真实数据作为0 和1的标签,然后把其中生成数据中得到概率为1比较大的返回么?
如何体现对抗的**?