Skip to content

Commit d33a31f

Browse files
committed
tiny_sample_size
1 parent 5149dce commit d33a31f

File tree

1 file changed

+11
-1
lines changed

1 file changed

+11
-1
lines changed

notes/main.typ

Lines changed: 11 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -124,7 +124,17 @@ But it now an empirical observation, can we theoretically explain this phenomeno
124124

125125
#question("Theoretical explanation for overparameterized models with small sample size")[For sample siez $n = 200$, outcome $Y in RR$ and predictors $X in RR^(p)$, $p / n = c in (0, infinity)$, $Y$ is independent of $X$, under what condition? We will see that a machine learning algorithm can still predict $Y$ well from $X$?.]
126126

127-
Well, this is just the global null hypothesis testing problem in high dimension generalized linear model.
127+
Well, this is just the global null hypothesis testing problem in high dimension models. We can use a nonparametric regression view to see this problem.
128+
129+
$ Y = f(X) + epsilon, f in cal(H) $
130+
131+
$ "H"_0 : f = 0 , "H"_1 : f eq.not 0 $
132+
133+
Using a base function of $cal(H)$ and truncating at $k$ terms, it should be answered well as a hypothesis testing problem, the signal noise ratio, sparsity and the constant $p/n$ will give the detection boundary, also the local minimax rate.
134+
135+
Well, that's asymptotic theory, there still is the question for tiny $n$, say $n = 10$ or $20$. Can we give any answer for this? Can we know anything useful form so tiny sample size? This case may be called #link("https://en.wikipedia.org/wiki/Knightian_uncertainty")[Knightian uncertainty]?
136+
137+
#quote("In economics, Knightian uncertainty is a lack of any quantifiable knowledge about some possible occurrence, as opposed to the presence of quantifiable risk (e.g., that in statistical noise or a parameter's confidence interval). The concept acknowledges some fundamental degree of ignorance, a limit to knowledge, and an essential unpredictability of future events.")
128138

129139
= On the undistinguishable or identification of statistical models
130140

0 commit comments

Comments
 (0)