{"id":579,"date":"2019-12-02T14:08:44","date_gmt":"2019-12-02T14:08:44","guid":{"rendered":"http:\/\/adrianbell.me\/?p=579"},"modified":"2019-12-02T14:10:45","modified_gmt":"2019-12-02T14:10:45","slug":"statistics-assignment-1","status":"publish","type":"post","link":"https:\/\/adrianbell.me\/?p=579","title":{"rendered":"Statistics Assignment 1"},"content":{"rendered":"<p>Very happy with the high mark I achieved for this first assignment. Though as usual, there's a decent about to be improving. Let's start by looking at some of the more major things my tutor pointed out.<\/p>\n<h1>Variance<\/h1>\n<p>First thing is variance. How do you calculate it? Well it turns out there are a couple of ways. I just decided to use the most cumbersome way...<\/p>\n<p>Calculating the mean <img loading=\"lazy\" decoding=\"async\" src=\"https:\/\/adrianbell.me\/wp-content\/ql-cache\/quicklatex.com-05d9eae892416bd34247a25207f8b718_l3.png\" class=\"ql-img-inline-formula quicklatex-auto-format\" alt=\"&#92;&#109;&#117;\" title=\"Rendered by QuickLaTeX.com\" height=\"12\" width=\"11\" style=\"vertical-align: -4px;\"\/> (expectation <img loading=\"lazy\" decoding=\"async\" src=\"https:\/\/adrianbell.me\/wp-content\/ql-cache\/quicklatex.com-c48b12e1e1855dca961c88c1ff63d5ab_l3.png\" class=\"ql-img-inline-formula quicklatex-auto-format\" alt=\"&#69;&#40;&#88;&#41;\" title=\"Rendered by QuickLaTeX.com\" height=\"19\" width=\"43\" style=\"vertical-align: -5px;\"\/>) is easy. Multiply each number with its probability and sum them all:<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" src=\"https:\/\/adrianbell.me\/wp-content\/ql-cache\/quicklatex.com-4c65bf283e9989bfa8577bb8573ceb71_l3.png\" class=\"ql-img-inline-formula quicklatex-auto-format\" alt=\"&#92;&#109;&#117;&#32;&#61;&#32;&#69;&#40;&#120;&#41;&#32;&#61;&#32;&#92;&#115;&#117;&#109;&#32;&#120;&#92;&#58;&#32;&#112;&#40;&#120;&#41;\" title=\"Rendered by QuickLaTeX.com\" height=\"19\" width=\"164\" style=\"vertical-align: -5px;\"\/><\/p>\n<p>The variance can then be calculated in one of two different ways:<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" src=\"https:\/\/adrianbell.me\/wp-content\/ql-cache\/quicklatex.com-35571b9bd7cd526283e89efd65febf93_l3.png\" class=\"ql-img-inline-formula quicklatex-auto-format\" alt=\"&#92;&#115;&#105;&#103;&#109;&#97;&#94;&#123;&#50;&#125;&#61;&#69;&#91;&#40;&#88;&#45;&#92;&#109;&#117;&#41;&#94;&#123;&#50;&#125;&#93;&#61;&#92;&#115;&#117;&#109;&#32;&#40;&#120;&#45;&#92;&#109;&#117;&#41;&#94;&#123;&#50;&#125;&#92;&#58;&#32;&#112;&#40;&#120;&#41;\" title=\"Rendered by QuickLaTeX.com\" height=\"20\" width=\"278\" style=\"vertical-align: -5px;\"\/><br \/>\nor<br \/>\n<img loading=\"lazy\" decoding=\"async\" src=\"https:\/\/adrianbell.me\/wp-content\/ql-cache\/quicklatex.com-8221922171b7dcb842c4dc1eee970886_l3.png\" class=\"ql-img-inline-formula quicklatex-auto-format\" alt=\"&#92;&#115;&#105;&#103;&#109;&#97;&#94;&#123;&#50;&#125;&#61;&#69;&#40;&#88;&#94;&#123;&#50;&#125;&#41;&#45;&#40;&#69;&#40;&#88;&#41;&#41;&#94;&#123;&#50;&#125;&#61;&#92;&#108;&#101;&#102;&#116;&#40;&#92;&#115;&#117;&#109;&#32;&#120;&#94;&#123;&#50;&#125;&#92;&#58;&#32;&#112;&#40;&#120;&#41;&#92;&#114;&#105;&#103;&#104;&#116;&#41;&#45;&#92;&#109;&#117;&#94;&#123;&#50;&#125;\" title=\"Rendered by QuickLaTeX.com\" height=\"22\" width=\"337\" style=\"vertical-align: -7px;\"\/><\/p>\n<p>When they're written out like this, it's fairly obvious to see which method is more like the method used to calculate the mean and as\u00a0 such would be <em>far<\/em> less hassle. (<img loading=\"lazy\" decoding=\"async\" src=\"https:\/\/adrianbell.me\/wp-content\/ql-cache\/quicklatex.com-7e5fbfa0bbbd9f3051cd156a0f1b5e31_l3.png\" class=\"ql-img-inline-formula quicklatex-auto-format\" alt=\"&#120;\" title=\"Rendered by QuickLaTeX.com\" height=\"8\" width=\"10\" style=\"vertical-align: 0px;\"\/> is an integer and <img loading=\"lazy\" decoding=\"async\" src=\"https:\/\/adrianbell.me\/wp-content\/ql-cache\/quicklatex.com-c203c21815797e61f72d05db8b0dc974_l3.png\" class=\"ql-img-inline-formula quicklatex-auto-format\" alt=\"&#112;&#40;&#120;&#41;\" title=\"Rendered by QuickLaTeX.com\" height=\"19\" width=\"33\" style=\"vertical-align: -5px;\"\/> and <img loading=\"lazy\" decoding=\"async\" src=\"https:\/\/adrianbell.me\/wp-content\/ql-cache\/quicklatex.com-05d9eae892416bd34247a25207f8b718_l3.png\" class=\"ql-img-inline-formula quicklatex-auto-format\" alt=\"&#92;&#109;&#117;\" title=\"Rendered by QuickLaTeX.com\" height=\"12\" width=\"11\" style=\"vertical-align: -4px;\"\/> can be reals\/rationals).<\/p>\n<p>&nbsp;<\/p>\n<h1>Multivariate Poisson Process<\/h1>\n<p>This was the question in which I lost the most marks:<\/p>\n<p>Customers arrive at a shoe shop according to a Poisson process with a rate of 20 per hour.<br \/>\n15% of customers buy men's shoes.<br \/>\n60% buy women's shoes.<br \/>\n25% buy children's shoes.<br \/>\nCalculate the probability that exactly eight customers arrive in half an hour, exactly three of whom wish to purchase children's shoes.<\/p>\n<p>This is such a typical mistake for me to make in statistics. I'm sure I've made this kind of mistake before...<\/p>\n<p>What I ended up doing was working out the probability of the number of customers being 8 using the Poisson distribution's probability function. This was fine.<\/p>\n<p>Then I used the same probability function to find the probability of 3 people wanting to buy children's shoes and multiplied them together. Wrong. At this point I needed to find the <em>conditional<\/em> probability that of the 8 customers, 3 bought children's shoes. Hence, here I shouldn't have used the Poisson distribution, I should've used the Binomial distribution instead. ie: from 8, choose 25%.<\/p>\n<p>Reflecting back on the question, the correct answer seems slightly more obvious now. Especially given the \"...exactly three of whom...\" part of the question. I struggle to be mindful of stuff like this in the moment of answering a stats question. I suppose this part of the \"translating English into maths\" issue comes with more practise...<\/p>\n<h1>Index Of Dispersion<\/h1>\n<p>Again, my issue here was to not observe subtleties in the question. Given information about the associated distributions, I was initially meant to calculate the mean and variance of the total number of books bought in 9 hours. I managed to get this first part right, but the second part of the question asked me to calculate the index of dispersion for \"this process\". It turns out that \"this process\" refers to the process in the main question generally and not the process of books being bought within 9 hours. In this instance, ignoring the total number of books bought in 9 hours (kind of) simplifies the answer too.<\/p>\n<h1>Other Issues<\/h1>\n<p>In this first assignment, I lost a half a mark here and there for incorrect arithmetic. (GASP!). Upon completing my draft submission, instead of just reading through it, I should sit down and verify all my working. It will take more time, but if it scrapes 2 marks back, it could be worth it.<\/p>\n<p>Other issue that occurred more than once was a lack of units when talking about rates of things happening. So there's a requirement to state \"<img loading=\"lazy\" decoding=\"async\" src=\"https:\/\/adrianbell.me\/wp-content\/ql-cache\/quicklatex.com-adc79107f21b094c1ea7bd243e7ae6b3_l3.png\" class=\"ql-img-inline-formula quicklatex-auto-format\" alt=\"&#92;&#108;&#97;&#109;&#98;&#100;&#97;&#61;&#50;&#48;\" title=\"Rendered by QuickLaTeX.com\" height=\"12\" width=\"52\" style=\"vertical-align: 0px;\"\/> per hour\" instead of just \"<img loading=\"lazy\" decoding=\"async\" src=\"https:\/\/adrianbell.me\/wp-content\/ql-cache\/quicklatex.com-adc79107f21b094c1ea7bd243e7ae6b3_l3.png\" class=\"ql-img-inline-formula quicklatex-auto-format\" alt=\"&#92;&#108;&#97;&#109;&#98;&#100;&#97;&#61;&#50;&#48;\" title=\"Rendered by QuickLaTeX.com\" height=\"12\" width=\"52\" style=\"vertical-align: 0px;\"\/>\".<\/p>\n<p>&nbsp;<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Very happy with the high mark I achieved for this first assignment. Though as usual, there's a decent about to be improving. Let's start by looking at some of the more major things my tutor pointed out. Variance First thing is variance. How do you calculate it? Well it turns out there are a couple &hellip; <a href=\"https:\/\/adrianbell.me\/?p=579\" class=\"more-link\">Continue reading <span class=\"screen-reader-text\">Statistics Assignment 1<\/span> <span class=\"meta-nav\">&rarr;<\/span><\/a><\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[24,15],"tags":[],"class_list":["post-579","post","type-post","status-publish","format-standard","hentry","category-critique","category-statistics"],"_links":{"self":[{"href":"https:\/\/adrianbell.me\/index.php?rest_route=\/wp\/v2\/posts\/579","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/adrianbell.me\/index.php?rest_route=\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/adrianbell.me\/index.php?rest_route=\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/adrianbell.me\/index.php?rest_route=\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/adrianbell.me\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=579"}],"version-history":[{"count":19,"href":"https:\/\/adrianbell.me\/index.php?rest_route=\/wp\/v2\/posts\/579\/revisions"}],"predecessor-version":[{"id":598,"href":"https:\/\/adrianbell.me\/index.php?rest_route=\/wp\/v2\/posts\/579\/revisions\/598"}],"wp:attachment":[{"href":"https:\/\/adrianbell.me\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=579"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/adrianbell.me\/index.php?rest_route=%2Fwp%2Fv2%2Fcategories&post=579"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/adrianbell.me\/index.php?rest_route=%2Fwp%2Fv2%2Ftags&post=579"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}