DreamDojo is trained on what the team calls a large-scale video dataset of human activity, captured from an egocentric (first-person) perspective. This dataset, named DreamDojo-HV, comprises 44,000 ...