mattblackwell / gov2002-book Goto Github PK
View Code? Open in Web Editor NEWAn Introduction to Statistical Inference and Regression
Home Page: https://mattblackwell.github.io/gov2002-book/
An Introduction to Statistical Inference and Regression
Home Page: https://mattblackwell.github.io/gov2002-book/
This aforementioned second last sentence is:
If any variable in
This should instead be:
If any variable in
since the thing of interest (that we want to estimate) is
The first (left) display equation after Section 5.2.2 is:
This should be replaced with:
(added a missing plus sign)
The first sentence of the second paragraph of Section 6.6 starts with:
Let $\mathcal{C}(\mathbb{X})={ \mathbb{X}\mathbf{b}\colon \mathbb{b}\in \mathbb{R}^2} be the column space...
This should be:
Let $\mathcal{C}(\mathbb{X})={ \mathbb{X}\mathbf{b}\colon \mathbb{b}\in \mathbb{R}^{k+1}} be the column space...
(replaced
In the paragraph before the "Chi-squared critical values" box, there is a sentence that starts with:
After recentering ad rescaling by the covariance matrix, ...
This should be
After recentering and rescaling by the covariance matrix, ...
(replaced "ad" with "and")
The middle sentence in Theorem 7.3 is:
and its conditional sampling variance issue
This should be:
and its conditional sampling variance is
(changed "issue" to "is")
The third point (3.) in the "Residual regression approach" box is:
Use OLS to regression
This should be:
Use OLS to regress
(replaced "regression" with "regress")
There is a sentence in Chapter 7 which starts with:
which also typically uses the the homoskedastic variance estimator ...
This should be replaced with:
which also typically uses the homoskedastic variance estimator ...
(removed an extra "the")
The second equation of Example 3.3 is:
$\sum_{i=1}^n (X_i - \bar{X}n)^2 = \sum{i=1}^n X_i^2 + n\bar{X}_n$
but should be
$\sum_{i=1}^n (X_i - \bar{X}n)^2 = \sum{i=1}^n X_i^2 + n\bar{X}_n^2$
which is consistent with the later derivation.
The first sentence of the third last paragraph before Section 5.4 is:
Thus, we can write the CEF with two binary covariates as linear when the linear specification includes and multiplicative interaction between them
This should instead be:
Thus, we can write the CEF with two binary covariates as linear when the linear specification includes a multiplicative interaction between them
(replaced "and" with "a" after "includes")
Theorem 3.3 states:
Let
which should probably be replaced by
Let
(removing the extra "a" and keeping the iid shorthand consistent with the rest of the book)
The last line of Example 4.3 ends with:
... and if we want an asymptotically level of 0.05, we can reject when
This should instead be:
... and if we want an asymptotic level of 0.05, we can reject when
(replace "asymptotically" with "asymptotic")
Before Section 2.4.2, there is the sentence:
Maximum likelihood estimators have very nice properties, especially in large samples. Unfortunately, it also requires the correct knowledge of the parametric model, which is often difficult to justify.
The second sentence should probably refer to MLEs in the plural as so:
Maximum likelihood estimators have very nice properties, especially in large samples. Unfortunately, they also require the correct knowledge of the parametric model, which is often difficult to justify.
The third sentence of Chapter 5 is:
For example, we may want to know how wait voting poll wait times vary as a function of some socioeconomic features of the precinct, like income and racial composition.
This should instead be:
For example, we may want to know how voting poll wait times vary as a function of some socioeconomic features of the precinct, like income and racial composition.
(removed an extra "wait" before "voting poll")
In point #2 on this page, the sentence:
So many methods are either use regression estimators like ordinary least squares or extend it in some way.
should be modified to
So many methods either use regression estimators like ordinary least squares or extend it in some way.
i.e. there's an extra "are" term.
The first part of the second display equation in Section 6.9.1 is:
This should be replaced with:
to keep the notation of the projection matrix consistent with what is used elsewhere in the chapter
The sentence after Figure 6.6 is:
One measure of influence is called DFBETA$_i$ measures how much
This should be:
One measure of influence, called DFBETA$_i$, measures how much
A sentence in the paragraph before Section 6.9.1 starts with:
Thus, by definition, This means that when an observation ...
This should be replaced by:
Thus, by definition, this means that when an observation ...
The second sentence of the second paragraph of the Warning box before Section 4.9 is:
Of course, this doesn’t make sense from our definition because the p-values conditions on the null hypothesis—it cannot tell us anything about the probability of that null hypothesis.
This should instead be:
Of course, this doesn’t make sense from our definition because the p-value conditions on the null hypothesis—it cannot tell us anything about the probability of that null hypothesis.
(replace "p-values" with "p-value")
One sentence after the Section 7.1 header reads:
Remember that since
This should be:
Remember that since
(removed the "then")
A sentence right before Section 7.4.1 starts with:
Recall the heteroskedastic-consistent variance estimator is ...
This should be:
Recall the heteroskedastic-consistent variance estimator ...
(removed the "is" because right after the display equation the sentence starts with "is")
Near the end of the page:
Wooldridge, Jeffrey. Econometric Analysis of Cross Section and Panel Data
should be replaced by
Wooldridge, Jeffrey. Econometric Analysis of Cross Section and Panel Data. The MIT Press.
for consistency with earlier bullet points?
The end of the second last sentence of the first paragraph in Section 6.4 is:
... linear independence means that if
This should be:
... linear independence means that
(removed the "if" before the "if and only if" statement)
The first display equation after Section 5.3 of Chapter 5 is (using the LaTeX shorcuts you defined in the source code):
This should instead be:
(replaced "f" in the argmin expression with "g" to match the "g" the argmin is being taken over)
The statement of Definition 4.4 is:
The p-value of a test is the probability of observing a test statistic is at least as extreme as the observed test statistic in the direction of the alternative hypothesis.
This should instead be:
The p-value of a test is the probability of observing a test statistic at least as extreme as the observed test statistic in the direction of the alternative hypothesis.
(the "is" before "at least" should be removed)
The last sentence in the paragraph after Section 5.3 of Chapter 5 starts with:
In particular, if we label
This should instead be:
In particular, if we label
(added "to" before "be")
The paragraph after Definition 4.3 in Chapter 4 is:
A test with a significance level of
This should instead be (with differences in brackets []):
A test with a significance level of
(a missing "see" word and an addition of "at most" to make the last statement more precise)
The first two points (1. and 2.) of the "Linear projection assumptions" box is:
These should be replaced by:
(removed the period in (1.) and fixed the subscript typo in (2.))
Here a few typos I've collected while consulting the textbook (the corrections are in bold):
Thank you @mattblackwell for writing this helpful resource and making it publicly available for free!
In Example 2.8, the random variable size = 4
, which doesn't match the original rv. It would probably be more appropriate to model using size = 5
for all the subsequent simulations in the example.
In the paragraph preceding the large Jacobian matrix display equation, there is a sentence which ends like:
... and be continuously differentiable (we make the function bold since it ).
The remark in the parenthesis should probably be completed.
In the first paragraph after Section 2.2, it is said that:
They are independent in that the random vectors
However, this is pairwise independence, which is strictly weaker than mutual independence (which is required in the iid assumption). I checked this here:
In the first warning box, there is the sentence:
Then the statistic is a random variable that has a distribution over the numbers from {2, , 12} that describes our uncertainty over what the sum will be before we roll the dice.
However, the {2, , 12} seems like an odd choice to format the set, and could possibly be replaced with nicer LaTeX formatting, i.e. {2,\dots,12} (formatted in LaTeX)?
The sentence right before the Jacobian matrix is:
It will help us use more compact matrix notation if we introduce a Jacobian matrix of all partial derivatives
This should be replaced with
It will help us to use more compact matrix notation if we introduce a Jacobian matrix of all partial derivatives
(adding a "to")
The second sentence in Chapter 5 is:
In particular, these tools show how the conditional mean of
This should be:
In particular, these tools show how the conditional mean of
(added an "of" near the end)
Example 2.10 starts as:
Example 2.10 (Sampling variance of the sample mean:)
but this should probably be replaced with
Example 2.10 (Sampling variance of the sample mean)
to be consistent with the other examples before this
The sentence right before Section 2.6.3 is:
Given the above derivation, the standard error of the sample mean under iid sampling is
This is missing a period at the end of the sentence, i.e. should be replaced with:
Given the above derivation, the standard error of the sample mean under iid sampling is
The paragraph before the numbered statements in the Blue box after Section 5.2.3 is:
Without some assumptions on the joint distribution of the data, The following “regularity conditions” will ensure the existence of the BLP:
This should be replaced by:
Without some assumptions on the joint distribution of the data, the following “regularity conditions” will ensure the existence of the BLP:
(changed "T" to "t" in "the")
The sentence right before Figure 3.3 reads:
... establishes how the standard 95% confidence interval for the sample mean above asymptotically valid.
This should be:
... establishes how the standard 95% confidence interval for the sample mean above is asymptotically valid.
(adding an "is")
The first sentence after the first display equation in Section 5.4 is:
so that the change in the predicted outcome for increasing
Following the logical structure of the argument, this should be:
so that the change in the predicted outcome for increasing
(replaced "isn't" with "is")
The start of a paragraph in Chapter 6 is:
Why does this rank condition matter for the OLS estimator? A key property of full column rank matrices is that
This should be replaced by:
Why does this rank condition matter for the OLS estimator? A key property of full column rank matrices is that
(replaced wrongly used "if" with "is")
The first sentence of Footnote #2 in Chapter 4 reads:
Different people and different textbooks describe what to do when do not reject the null hypothesis in different ways.
This should instead be:
Different people and different textbooks describe what to do when we do not reject the null hypothesis in different ways.
In the proof of Theorem 3.1, the notation
The display equation right before Theorem 7.1 is:
This should instead be:
since beta is a vector and not a scalar.
The first display equation after Section 5.2.3 is:
This should instead be:
(the last x should be lowercase)
The word after the
The last sentence of the second last paragraph before Section 6.6 is:
Then coefficient on the West dummy will be
This should be:
Then, the coefficient on the West dummy will be
(added a "the")
The sentence after the
As we discussed in the earlier, an
This should instead be:
As we discussed earlier, an
(removed the "in the" before "earlier")
The first sentence of the footnote reads:
This approach to inference is often called a model-based approach since we assuming a probability model in the cdf,
and should instead be:
This approach to inference is often called a model-based approach since we are assuming a probability model in the cdf,
(a missing "are" term)
In the last paragraph of Example 3.3, it is said that the bias shrinks as a function of the sample size, so as long as the sampling variance shrinks as a function of the sample size (which it does), the estimator is consistent by Theorem 3.5.
However, Theorem 3.5 only says that an estimator is consistent if it is unbiased and its sampling variance shrinks as a function of the sample size, and no mention was made about the slightly stronger version used in Example 3.3.
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.