-
Notifications
You must be signed in to change notification settings - Fork 0
/
Copy pathMITx 6.431x -- Probability - The Science of Uncertainty and Data + Unit_3.Rmd
704 lines (567 loc) · 25.7 KB
/
MITx 6.431x -- Probability - The Science of Uncertainty and Data + Unit_3.Rmd
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
435
436
437
438
439
440
441
442
443
444
445
446
447
448
449
450
451
452
453
454
455
456
457
458
459
460
461
462
463
464
465
466
467
468
469
470
471
472
473
474
475
476
477
478
479
480
481
482
483
484
485
486
487
488
489
490
491
492
493
494
495
496
497
498
499
500
501
502
503
504
505
506
507
508
509
510
511
512
513
514
515
516
517
518
519
520
521
522
523
524
525
526
527
528
529
530
531
532
533
534
535
536
537
538
539
540
541
542
543
544
545
546
547
548
549
550
551
552
553
554
555
556
557
558
559
560
561
562
563
564
565
566
567
568
569
570
571
572
573
574
575
576
577
578
579
580
581
582
583
584
585
586
587
588
589
590
591
592
593
594
595
596
597
598
599
600
601
602
603
604
605
606
607
608
609
610
611
612
613
614
615
616
617
618
619
620
621
622
623
624
625
626
627
628
629
630
631
632
633
634
635
636
637
638
639
640
641
642
643
644
645
646
647
648
649
650
651
652
653
654
655
656
657
658
659
660
661
662
663
664
665
666
667
668
669
670
671
672
673
674
675
676
677
678
679
680
681
682
683
684
685
686
687
688
689
690
691
692
693
694
695
696
697
698
699
700
701
702
703
704
---
title: "MITx 6.431x -- Probability - The Science of Uncertainty and Data + Unit_3.Rmd"
author: "John HHU"
date: "2022-11-05"
output: html_document
---
```{r setup, include=FALSE}
knitr::opts_chunk$set(echo = TRUE)
```
## R Markdown
This is an R Markdown document. Markdown is a simple formatting syntax for authoring HTML, PDF, and MS Word documents. For more details on using R Markdown see <http://rmarkdown.rstudio.com>.
When you click the **Knit** button a document will be generated that includes both content as well as the output of any embedded R code chunks within the document. You can embed an R code chunk like this:
```{r cars}
summary(cars)
```
## Including Plots
You can also embed plots, for example:
```{r pressure, echo=FALSE}
plot(pressure)
```
Note that the `echo = FALSE` parameter was added to the code chunk to prevent printing of the R code that generated the plot.
# Course / Unit 3: Counting / Solved problems
# 1. The birthday problem
The birthday problem. Consider n people who are attending a party. We assume that every person has an equal probability of being born on any day during the year, independently of everyone else, and ignore the additional complication presented by leap years (i.e., nobody is born on February 29). What is the probability that each person has a distinct birthday?
Teaching assistant: Kuang Xu


In this exercise, we'll be looking at a problem,
so-called the birthday paradox, where we have n
people and each person has a random birthday
out of the 365 days.
And the probability we want to measure is the probability of
the event that no two birthdays coincide.
To set up the problem, we'll define the sample space omega
as a set of all possible birthday combinations.
Let's see how big that is.
Well, we have a collection of n people.
And each person has 365 choices.
And therefore, the total number choices for all
possible birthday combinations will be this number raised to
the power n.
So this will be the size of our sample space omega.
Now, out of the sample space we'll look at all choices, all
combinations where no two people have
exactly the same birthday.
To count that, we can start from the first person, person
number one.
Well, this person has 365 choices because so far there
has not been any other birthdays to
be conflicting with.
However, for the second person we know that whatever the
first person chose, we cannot choose the
same birthday again.
And so we're left with 364 choices.
Keep going like this, we get 363, and so on and so forth.
Until we reach the last person, which will give us 365
minus n plus 1.
Since we've repeated this process n times.
Now, looking at these two numbers we know that the
probability of no two birthdays coincide is equal to
the size of the event, which is this problem here--
365 times 364, so on and so forth, times 365 minus n plus
1, divided by the size of the sample space, 365 raised to
the nth power.
Now, you might wonder how big really is this probability?
Well, if we were to plot the probability of having no two
birthdays coincide as a function of the size of the
group n, where n goes from 0 all the way to 80, we see that
the probability drops very quickly.
In particular, if we were to draw a line right here from
having 50 [percent]
chance of seeing no two birthdays coincide.
We see that roughly around 23 we're already are seeing a
chance smaller than half.
If the group size is about 40 and the chance of having a
unique birthday for everyone is only about 10%.
Now, here on the right we are plotting the same probability
but with a different scaling.
So on the y-axis we're plotting with respect to a
logarithmic scaling, showing a finer granularity of
the value near zero.
Now we can see right here beyond 60 and the plot on the
left that we solved before, you can barely
tell the value p-n.
So here if we were to track 60 around right here, we see that
the values start dipping below 1% and all the way to 10 to
the negative tenth.
So that's a tiny number, even though we only have a
group of size 120.
# 2. Rooks on a chessboard
Rooks on a chessboard. Eight rooks are placed in distinct squares of an 8 x 8 chessboard, with all possible placements being equally likely. Find the probability that all the rooks are safe from one another, i.e., that there is no row or column with more than one rook.
Teaching assistant: Katie Szeto



Today, we're going to do a fun problem called rooks on a
chessboard.
And rooks on a chessboard is a problem that's going to test
your ability on counting.
So hopefully by now in class, you've learned a few tricks to
approach counting problems.
You've learned about permutations, you've learned
about k-permutations, you've learned about combinations,
and you've learned about partitions.
And historically for students that we've taught in the past
and many people, counting can be a tricky topic.
So this is just one drill problem to help you get those
skills under your belt.
So what does the rooks on a chessboard problem ask you?
Well, you're given an 8-by-8 chessboard, which I've tried
to draw here.
It's not very symmetrical.
Sorry about that.
And you're told that you have eight rooks.
I'm sure most of you guys are familiar with chess.
But if any of you aren't, chess is a
sophisticated board game.
And one of the types of pieces you have in this game is
called a rook.
And in this particular problem,
there are eight rooks.
And your job is to place all eight rooks onto this 8-by-8
chessboard.
Now, you're told in the problem statement that all
placements of rooks are equally likely.
And you are tasked with finding the probability that
you get a safe arrangement.
So that is to say, you place your eight rooks on the board.
What is the probability that the way you
placed them is safe?
So what do I mean by "safe"?
Well, if you're familiar with the way chess works, so if you
place a rook here, it can move vertically or it can move
horizontally.
Those are the only two legal positions.
So if you place a rook here and you have another piece
here, then this is not a safe arrangement, because the rook
can move this way and kill you.
Similarly, if you have a rook here and another piece here,
the rook can move horizontally and kill you that way.
So two rooks on this board are only safe from each other if
they are neither in the same column nor in the same row.
And that's going to be key for us to solve this problem.
So let's see-- where did my marker go?
I've been talking a lot, and I haven't really
been writing anything.
So our job is again, to find the probability that you get a
safe arrangement.
So I'm just going to do "arrange" for short.
Now, I talked about this previously, and you guys have
heard it in lecture.
Hopefully you remember something called the discrete
uniform law.
So the discrete uniform law is applicable when your sample
space is discrete and all outcomes are equally likely.
So let's do a quick check here.
What is our sample space for this problem?
Well, a logical choice would be that the set of all
possible outcomes is the set of all possible spatial
arrangements of rooks.
And hopefully it's clear to you that that is discrete.
And the problem statement furthermore gives us that
they're equally likely.
So the discrete uniform law is in fact
applicable in our setting.
So I'm going to go ahead and write what this means.
So when your sample space is discrete and all outcomes are
equally likely, then you can compute the probability of any
event, A, simply by counting the number of outcomes in A
and then dividing it by the total number of outcomes in
your sample space.
So here we just have to find the number of total safe
arrangements and then divide it by the total number of
arrangements.
So again, as you've seen in other problems, the discrete
uniform law is really nice, because you reduce the problem
of computing probabilities to the problem of counting.
And so here's where we're going to exercise those
counting skills, as I promised earlier.
Now, I would like to start with computing the
denominator, or the total number of arrangements,
because I think it's a slightly easier computation.
So we don't care about the arrangements being safe.
We just care about how many possible
arrangements are there.
Now, again, we have eight rooks, and we need to place
all of them.
And we have this 8-by-8 board.
So pretty quickly, you guys could probably tell me that
the total number of square is 64, because this is
just 8 times 8.
Now, I like to approach problems sequentially.
That sort of really helps me think clearly about them.
So I want you to imagine a sequential process during
which we place each rook one at a time.
So pick a rook.
The chessboard is currently empty.
So how many squares can you place that rook in?
Well, nobody's on the board.
You can place it in 64 spots.
So for the first rook that you pick, there are 64 spots.
Now, once you place this rook, you need to place the second
rook, because again, we're not done until
all eight are placed.
So how many possible spots are left.
Well, I claim that there are 63, because one rule of chess
is that if you put a piece in a particular square, you can
no longer put anything else on that square.
You can't put two or more things.
So the first rook is occupying one spot, so there's only 63
spots left.
So the second rook has 63 spots that it could go in.
Similarly, the third rook has 62 spots.
Hopefully you see the pattern.
You can continue this down.
And remember, we have to place all eight rooks.
So you could do it out yourself or just
do the simple math.
You'll figure out that the eighth rook only has 57 spots
that it could be in.
So this is a good start.
We've sort of figured out if we sequentially place each
rook, how many options do we have.
But we haven't combined these numbers in any useful way yet.
We haven't counted the number of total arrangements.
And this may already be obvious to some, but it wasn't
obvious to me when I was first learning this material, so I
want to go through this slowly.
You have probably heard in lecture by now about the
counting principle.
And what the counting principle tells you is that
whenever you have a process that is done in stages and in
each stage, you have a particular number of choices,
to get the total number of choices available at the end
of the process, you simply multiply the number of choices
at each stage.
This might be clear to you, again, simply from the
statement, for some of you.
But for others, it might still not be clear.
So let's just take a simple example.
Forget about the rook problem for a second.
Let's say you're at a deli, and you
want to make a sandwich.
And to make a sandwich, you need a choice of bread and you
need a choice of meat.
So we have a sandwich-building process,
and there's two stages.
First, you have to pick the bread, and then you have to
pick the meat.
So let's say for the choice of bread, you can
choose wheat or rye.
So again, you can always use a little decision tree--
wheat or rye.
And then let's say that for the meats,
you have three options.
You have ham, turkey, and salami.
So you can have ham, turkey, or salami--
ham, turkey, or salami.
How many total possible sandwiches can you make?
Well, six.
And I got to that by 2 times 3.
And hopefully this makes sense for you, because there's two
options in the first stage.
Freeze an option.
Given this choice, there's three options
at the second stage.
But you have to also realize that for every other option
you have at the first stage, you have to add an additional
three options for the second stage.
And this is the definition of multiplication.
If you add three two times, you know that's 3 times 2.
So if you extrapolate this example to a larger, more
general picture, you will have derived for yourself the
counting principle.
And we're going to use the counting principle here to
determine what the total number of arrangements are.
So we have a sequential process, because we're placing
the first rook and then the second rook, et cetera.
So at the first stage, we have 64 choices.
At the second stage, we have 63 choices.
At the third stage, we have 62 choices, et cetera.
And so I'm just multiplying these numbers together,
because the counting principle says I can do this.
So my claim is that this product is equal to the total
number of arrangements.
And we could stop here, but I'm going to actually write
this in a more useful way.
You guys should have been introduced to
the factorial function.
So you can express this equivalently as 64 factorial
divided by 56 factorial.
And this is not necessary for your problem solution, but
sometimes it's helpful to express these types of
products in factorials, because you can see
cancellations more easily.
So if it's OK with everybody, I'm going to erase this work
to give myself more room.
So we'll just put our answer for the denominator up here,
and then we're going to get started on the numerator.
So for the numerator, thanks to the discrete uniform law,
we only need to count the number of safe arrangements.
But this is a little bit more tricky, because now, we have
to apply our definition of what "safe" means.
But we're going to use the same higher-level strategy,
which is realizing that we can place rooks sequentially.
So we can think of it as a sequential process.
And then if we figure out how many choices you have in each
stage that sort of maintain the "safeness" of the setup,
then you can use the counting principle to multiply all
those numbers together and get your answer.
So we have to place eight rooks.
Starting the same way we did last time, how many spots are
there for the first rook that are safe?
Nobody is on the board yet, so nobody can harm the first rook
we put down.
So I claim that it's just our total of 64.
Now, let's see what happens.
Let's pick a random square in here.
Let's say we put our first rook here.
Now, I claim a bunch of spots get invalidated because of the
rules of chess.
So before, I told you a rook can kill anything in the same
column or in the same row.
So you can't put a rook here, because they'll kill each
other, and you can't put a rook here.
So by extension, you can see that everything in the column
and the row that I'm highlighting in blue, it's no
longer an option.
You can't place a rook in there.
Otherwise, we will have violated
our "safety" principle.
So where can our second rook go?
Well, our second rook can go in any of the blank spots, any
of the spots that are not highlighted by blue.
And let's stare at this a little bit.
Imagine that you were to take scissors to your chessboard
and cut along this line and this line and this
line and this line.
So you essentially sawed off this cross that we created.
Then you would have four free-floating chessboard
pieces-- this one, this one, this one, and this one.
So this is a 3-by-4 piece, this is 3-by-3, this is
4-by-3, and this is 4-by-4.
Well, because you cut this part out, you can now slide
those pieces back together.
And hopefully you can convince yourself that that would leave
you with a 7-by-7 chessboard.
And you can see that the dimensions match up here.
So essentially, the second rook can be placed anywhere in
the remaining 7-by-7 chessboard.
And of course, there are 49 spots in a 7-by-7 chessboard.
So you get 49.
So let's do this experiment again.
Let me rewrite the reduced 7-by-7 chessboard.
You're going to have to forgive me if the lines are
not perfect--
one, two, three, four, five, six, seven; one, two, three,
four, five, six, seven.
Yep, I did that right.
And then we have one, two, three, four, five, six, seven.
That's not too bad for my first attempt.
So again, how did I get this chessboard from this one?
Well, I took scissors and I cut off of the blue strips,
and then I just merged the remaining four pieces.
So now, I'm placing my second rook.
So I know that I can place my second rook in any of these
squares, and it'll be safe from this rook.
Of course, in reality, you wouldn't really cut up your
chessboard.
I'm just using this as a visual aid to help you guys
see why there are 49 spots.
Another way you could see 49 spots is literally just by
counting all the white squares, but I think it takes
time to count 49 squares.
And this is a faster way of seeing it.
So you can put your second rook anywhere here.
Let's actually put in the corner, because the corner is
a nice case.
If you put your rook in the corner, immediately, all the
spots in here and all the spots in here become invalid
for the third rook, because otherwise, the rooks can hurt
each other.
So again, you'll see that if you take scissors and cut off
the blue part, you will have reduced the dimension of the
chessboard again.
And you can see pretty quickly that what you're left with is
a 6-by-6 chessboard.
So for the third rook, you get a 6-by-6 chessboard, which has
36 free spots.
And I'm not going to insult your intelligence.
You guys can see the pattern--
64, 49, 36.
These are just perfect squares decreasing.
So you know that the fourth rook will have 25 spots.
I'm going to come over here because I'm out of room.
The fifth rook will have 16 spots.
The sixth rook will have nine spots.
The seventh rook will have four spots.
And the eighth rook will just have one spot.
And now, here we're going to invoke the
counting principle again.
Remember the thing that I just defined to you by talking
about sandwiches.
And we'll see that to get the total number of safe
arrangements, we can just multiply
these numbers together.
So I'm going to go ahead and put that up here.
You get 64 times 49 times 36 times 25 times 16
times 9 times 4.
And in fact, this is our answer.
So we're all done.
So I really like this problem, because we don't normally ask
you to think about different spatial arrangements.
So it's a nice exercise, because it lets you practice
your counting skills in a new and creative way.
And in particular, the thing that we've been using for a
while now is the discrete uniform law.
But now, I also introduced the counting principle.
And we used the counting principle twice--
once to compute the numerator and once to compute the
denominator.
Counting can take a long time for you to absorb it.
So if you still don't totally buy the counting
principle, that's OK.
I just recommend you do some more examples and try to
convince yourself that it's really counting the right
number of things.
So counting principle is the second takeaway.
And then the other thing that is just worth mentioning is,
you guys should get really comfortable with these
factorials, because they will just show up again and again.
So that's the end of the problem, and I'll
see you next time.
A written solution to this problem can be found here. It includes a second approach, different than the one in the video.
https://courses.edx.org/assets/courseware/v1/a7e46484586f3ea79bd6d5845dee3418/asset-v1:MITx+6.431x+2T2022+type@asset+block/recitation_U03-rooks_on_a_chessboard-sol2.pdf
# 3. Hypergeometric probabilities
Hypergeometric probabilities. An urn contains n balls, out of which exactly m are red. We select k of the balls at random, without replacement (i.e., selected balls are not put back into the urn before the next selection). What is the probability that i of the selected balls are red?
Teaching assistant: Kuang Xu



In this problem, we're given an urn with n balls in it, out
of which m balls are red balls.
To visualize it, we can draw a box that represents the set of
all n balls.
Somewhere in the middle or somewhere else we have a cut,
such that to the left we have all the red balls (there are
m), and non-red balls.
Let's for now call it black balls.
That is n minus m.
Now, from this box, we are to draw k balls, and we'd like to
know the probability that i out of those k
balls are red balls.
For the rest of the problem, we'll refer to this
probability as p-r, where r stands for the red balls.
So from this picture, we know that we're going to draw a
subset of the balls, such that i of them are red, and the
remaining k minus i are black.
And we'll like to know what is the probability that this
event would occur.
To start, we define our sample space, omega, as the set of
all ways to draw k balls out of n balls.
We found a simple counting argument -- we know that size
of our sample space has n-choose-k, which is the total
number of ways to draw k balls out of n balls.
Next, we'd like to know how many of those samples
correspond to the event that we're interested in.
In particular, we would like to know c, which is equal to
the number of ways to get i red balls after
we draw the k balls.
To do so, we'll break c into a product of two numbers --
let's call it a times b --
where a is the total number of ways to select i red balls out
of m red balls.
So the number of ways to get i out of m red balls.
Going back to the picture, this corresponds to the total
number of ways to get these balls.
And similarly, we define b as the total number of ways to
get the remaining k minus i balls out of the set n minus m
black balls.
This corresponds to the total number of ways to select the
subset right here in the right side of the box.
Now as you can see, once we have a and b, we multiply them
together, and this yields the total number of ways
to get i red balls.
To compute what these numbers are, we see that a is equal to
m-choose-i number of ways to get i red balls, and b is n
minus m, the total number of black balls, choose k minus i,
the balls that are not red within those k balls.
Now putting everything back, we have p-r, the probability
we set out to compute, is equal to c, the size of the
event, divided by the size of the entire sample space.
From the previous calculations, we know that c
is equal to a times b, which is then equal to m-choose-i
times (n minus m)-choose-(k minus i).
And on the denominator, we have the entire sample space
is a size n-choose-k.
And that completes our derivation.
Now let's look at a numerical example of this problem.
Here, let's say we have a deck of 52 cards.
And we draw a box with n equals 52, out of which we
know that there are 4 aces.
So we'll call these the left side of the box, which is we
have m equals 4 aces.
Now if we were to draw seven cards--
call it k equal to 7--
and we'd like to know what is the probability that out of
the 7 cards, we have 3 aces.
Using the notation we did earlier, if we were to draw a
circle representing the seven cards, we want to know what is
the probability that we have 3 aces in the left side of the
box and 4 non-aces for the remainder of the deck.
In particular, we'll call i equal to 3.
So by this point, we've cast the problem of drawing cards
from the deck in the same way as we did earlier of drawing
balls from an urn.
And from the expression right here, which we computed
earlier, we can readily compute the probability of
having 3 aces.
In particular, we just have to substitute into the expression
right here the value of m equal to 4, n equal to 52, k
equal to 7, finally, i equal to 3.
So we have 4-choose-3 times n minus m, in this case would be
48, choose k minus i, will be 4, and on the denominator, we
have 52 total number of cards, choosing 7 cards.
That gives us [the]
numerical answer [for]
the probability of getting 3 aces when we draw 7 cards.
# 4. Multinomial probabilities



## Course / Unit 3: Counting / Problem Set 3
# 1. Customers arriving at a restaurant


# 2. A three-sided die






# 3. Forming a committee





# 4. Proving binomial identities via counting





[][Use your brain, above question can be solved in your brain]
# 5. Hats in a box









## Course / Unit 4: Discrete random variables / Unit overview