You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I would like to implement some version of a function a created in a blog post a while back.
This is how I started the post: I was contacted about the possibility of creating a simple function in simstudy to generate a large data set that could include possibly 10’s or 100’s of potential predictors and an outcome. In this function, only a subset of the variables would actually be predictors. The idea is to be able to easily generate data for exploring ridge regression, Lasso regression, or other “regularization” methods. Alternatively, this can be used to very quickly generate correlated data (with one line of code) without going through the definition process.
In the post, I created function genMultPred. I would like to implement something similar to this in simstudy.
The text was updated successfully, but these errors were encountered:
I have skimmed the blog post and it looks interesting, I'm guessing that they wanted to use this in an ML context?
My only issue with this is that it does not adhere to the usual API/workflow of simstudy, which is of course possible but we should think about how to handle these non-definition-table-functions so that we don't add a bunch of different function that all work differently and are hard to remember and maintain.
I agree - but that cat is already out of the bag, with functions like genOrdCat, genMarkov, and genSplines. I totally get your point, but this is something that could be quite useful to folks. Are you thinking it would be better in a different package, like simstudyExtra?
😹 That is true,not sure how to improve that situation. I think an extra package is too much at this point, maybe we can homogenize the API of these functions in some way for simstudy 2.0? I'll think about it but I think it is a useful function!
I would like to implement some version of a function a created in a blog post a while back.
This is how I started the post: I was contacted about the possibility of creating a simple function in simstudy to generate a large data set that could include possibly 10’s or 100’s of potential predictors and an outcome. In this function, only a subset of the variables would actually be predictors. The idea is to be able to easily generate data for exploring ridge regression, Lasso regression, or other “regularization” methods. Alternatively, this can be used to very quickly generate correlated data (with one line of code) without going through the definition process.
In the post, I created function
genMultPred
. I would like to implement something similar to this insimstudy
.The text was updated successfully, but these errors were encountered: