Updates to datasets II code #7

greglandrum · 2022-12-19T16:47:27Z

Things in this PR:

Get the scripts which operate on datasets II working in python 3
Add additional scorers for: XGB, balanced random forests, LMNB
Swap the RF scorer to use vanilla scikit-learn RFs instead of our monkey-patched implementation of balanced random forests.

Notes:

I have not done as much work with the datasets I scripts. Those datasets are, with some years of perspective, less interesting and useful, so I'm not feeling strongly compelled to spend time working on them
There's significant room for refactoring and removing duplicate code in the scoring scripts. I'll think about doing this.
The scoring scripts are quite verbose in their output (generating huge amounts of data). I think it wouldn't be terrible to make the output more compact, but that's a longer term project.

greglandrum · 2022-12-19T16:48:16Z

@sriniker : if you have time and inclination to look at this, I'd lover your comments. I have a bit more work to do before marking it as "done", but I wanted to give you a heads up.

greglandrum added 9 commits August 12, 2017 07:12

save

fe9ceb9

get most everything working with python3

f0763b2

add reversible (Crude!)

4029f33

update

84d67e7

seems to work

cef81bf

additional validation funcs

23e826d

add BRF scorer

a1225b0

update NB scorer

7f6df3c

basics

59fd09b

greglandrum marked this pull request as draft December 19, 2022 16:47

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Updates to datasets II code #7

Updates to datasets II code #7

Uh oh!

greglandrum commented Dec 19, 2022

Uh oh!

greglandrum commented Dec 19, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Updates to datasets II code #7

Are you sure you want to change the base?

Updates to datasets II code #7

Uh oh!

Conversation

greglandrum commented Dec 19, 2022

Uh oh!

greglandrum commented Dec 19, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant