by Skye
Last Updated September 14, 2018 15:19 PM

This is somewhat both a programming question, and a stats question. So sorry for the overlap (although it seems that there is a large overlap between the realm of stats and programming of professionals). I have a dataset with ~1000 cases matched to ~100,000 controls (each case matched to multiple controls). However, the matches are not mutually exclusive. That is, some of the controls that are matched to one case, may also be matched to another case. However, I do not consider it many-to-many matching because I can't match the cases to each other (an explanation of the reason for this would make this post unbearably long, so I'll try to spare it if possible)

What I would like to do is compute a mean difference (essentially a grand mean difference), with confidence intervals. Lets assume the variable of interest is normally distributed. How can I construct these confidence intervals in R and properly account for this matching scheme? I'm open to bootstrapping if necessary, but would like to avoid it if possible (mainly in the interest of time).

- ServerfaultXchanger
- SuperuserXchanger
- UbuntuXchanger
- WebappsXchanger
- WebmastersXchanger
- ProgrammersXchanger
- DbaXchanger
- DrupalXchanger
- WordpressXchanger
- MagentoXchanger
- JoomlaXchanger
- AndroidXchanger
- AppleXchanger
- GameXchanger
- GamingXchanger
- BlenderXchanger
- UxXchanger
- CookingXchanger
- PhotoXchanger
- StatsXchanger
- MathXchanger
- DiyXchanger
- GisXchanger
- TexXchanger
- MetaXchanger
- ElectronicsXchanger
- StackoverflowXchanger
- BitcoinXchanger
- EthereumXcanger