Repository search results

Filter by

Advanced
Advanced search

0 files

(88 ms)inmrunalmania/Direct-Preference-Optimization (press backspace or delete to remove)

mrunalmania/Direct-Preference-Optimization

In this repo, I've implemented the LLM alignment technique known as DPO and successfully aligned the Microsoft Phi-2 LLM model.

machine-learning

deep-learning

alignment

large-language-models

llm

Python

Updated
on Aug 21, 2024

Star

Sponsor open source projects you depend on

Contributors are working behind the scenes to make open source better for everyone—give them the help and recognition they deserve.Explore sponsorable projects

ProTip!

Press the

key to activate the search input again and adjust your query.

Sponsor open source projects you depend on

Contributors are working behind the scenes to make open source better for everyone—give them the help and recognition they deserve.Explore sponsorable projects

ProTip!

Press the

key to activate the search input again and adjust your query.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Filter by

Advanced

mrunalmania/Direct-Preference-Optimization

Sponsor open source projects you depend on

Sponsor open source projects you depend on

repositories Search Results · repo:mrunalmania/Direct-Preference-Optimization language:Python

Filter by

Advanced

0 files

mrunalmania/Direct-Preference-Optimization

Sponsor open source projects you depend on

Sponsor open source projects you depend on