calico-1226

Follow

Calico calico-1226

Follow

RL researcher

23 followers · 10 following

ZJU
Hangzhou, Zhejiang, China
16:39 (UTC +08:00)
[email protected]

Achievements

BetaSend feedback

Achievements

BetaSend feedback

Highlights

Pro

Organizations

Block or Report

Block or report calico-1226

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Pinned Loading

PKU-Alignment/safe-rlhf PKU-Alignment/safe-rlhf Public

Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback

Python 1.2k 108
PKU-Alignment/omnisafe PKU-Alignment/omnisafe Public

OmniSafe is an infrastructural framework for accelerating SafeRL research.

Python 873 128
PKU-Alignment/beavertails PKU-Alignment/beavertails Public

BeaverTails is a collection of datasets designed to facilitate research on safety alignment in large language models (LLMs).

Makefile 88 3
PKU-Alignment/safe-sora PKU-Alignment/safe-sora Public

SafeSora is a human preference dataset designed to support safety alignment research in the text-to-video generation field, aiming to enhance the helpfulness and harmlessness of Large Vision Models…

Python 15 3