Skip to content
View quanshr's full-sized avatar

Block or report quanshr

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. QwenLM/Self-Lengthen QwenLM/Self-Lengthen Public

    Python 92 9

  2. AugCon AugCon Public

    [AAAI 2025]Automatically Generating Numerous Context-Driven SFT Data for LLMs across Diverse Granularity

    Python 26 3

  3. DMoERM DMoERM Public

    [ACL2024 Findings]DMoERM: Recipes of Mixture-of-Experts for Effective Reward Modeling

    Python 18

  4. DPOOJ/dpooj DPOOJ/dpooj Public

    Data Points Oriented Online Judge system for OO course

    Python 35

  5. KbsdJames/Awesome-LLM-Preference-Learning KbsdJames/Awesome-LLM-Preference-Learning Public

    The official repository of our survey paper: "Towards a Unified View of Preference Learning for Large Language Models: A Survey"

    185 4