-
Notifications
You must be signed in to change notification settings - Fork 4
Allow rsample to sample with replacement #2
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: master
Are you sure you want to change the base?
Allow rsample to sample with replacement #2
Conversation
In preparation for adding more options, namely bootstrap replacement and samples for various pseudorandom distribution
|
Prefer to avoid |
|
Umm, so I was planning to add support for arbitrary distributions here. This was mostly me making room / splitting work into pieces. I want to do things like:
The use case being, "I have no idea how strange this graph for my data is, I should see what it looks like with some normal data". My experience suggests that this will become unreadable without
This has some impacts on documentation / discoverability, but does result in simpler programs that are more readable by non/semi-programmers. Philosophicaly mutteringOpinions? I have a general misgiving that one might end reimplementing R / numpy with pipes instead of broadcasting. There's a question about what this library represents in the shadow of tools like R and numpy. I mostly like the idea because I am loathe to leave the shell, and am not terribly keen on all the state that comes along with using ipython notebook / babel. I've hacked up a tool called There's a similar tool called pyline that does a similar thing with python. |
|
Anyway, here's a branch where https://github.com/talwrii/datatools/tree/talwrii--normal-data--2016-09-20
|
|
So for a while I had a package of scripts in parallel with What this reads to me is that you think randtools would be worthwhile. That's great! It turned out that I didn't need it, but you might, so go and build it (maybe I'll send some PRs!).
Yes, I agree. You like datatools for the same reason I do, staying inside the shell. However, R/Python are so good that baking too much into datatools isn't worth it because if what you're doing is complex enough it's better to do it in that context. This is my overriding motivation for keeping
Not in datatools please. |
|
Cool cool. My motivation for the pull requests is "here's a library for command line data analysis, it doesn't have the tools I want, I shall implement them, now I've implemented them I may as well give you a pull request" Umm, so I'm going to implement a version of Do you want sampling with replacement in More generally, I'm probably going to carry on tweaking these tools here and making complementary tools as I go about my day-to-day activities. I don't know how you want to interact with them: your goal of minimality may be at odds with my goal of "create tools for all the things I do" I could:
|
No description provided.