Reddit report - Copy-converted - Copy.pdf

Introduction The NPU was a project designed to be an extremely difficult timed verbal battery that discriminates primarily in the range above 130. The NPU was constructed by selecting antonyms and analogies across 20+ GREs from the 1980s. Antonyms and analogies were the foc us of this test as p revious ETS reporting from the 80s showed these item types best discriminate at the 130+ level. Additionally, the hardest items from the 1980s GRE are universally antonyms and analogies. Using the ETS item response data, a range of items across a har sh difficulty curve (see Difficulty and g - loading) were selected. The items were chosen such that the average GRE taker would score approximately 20 out of 60. Attempts were made to create an even split between simple and complex vocabulary. Preliminary n orm was devised by setting 20 out of 60 at IQ 118. The test was then tested on a number of people before public release to estimate score distribution Scores were then extrapolated using the same methodology as the Miller Analogy Test. This proved to mat ch the data very closely (see new normalization vs original). Data collection was initially gated by a series of passwords, requiring participants to submit previous IQ scores to me before attempting the test. After most of the data was gathered , the test was opened to the public. Raw Data 82 people attempted NPU. 66 people completed legitimate attempts. 4 9 people submitted previous VIQ results. The average score was 2 6.6 SD 10. 2 All of the incomplete attempts were performing very poorly before dropping out. Antonyms vs Analogies Regressions NPU vs Average VIQ The following plot is the correlation between NPU raw score and Average VIQ. Average VIQ was calculated by taking the unweighted average score of all VIQ scores submitted by a testee. The following tests were allowed and utilized in the calculation: • ANY PROFESSIONAL VERBAL TEST (WAIS, SB, RAIT, PPVT etc ) • Other a ccept ed tests: Jouve Tests, 1980 SAT, Stratosphere VAI, MAT , CMT. Descriptive Statistics N Minimum Maximum Mean Std. Deviation NPU 49 10.00 55.00 28.9796 10.02183 AvIQ 49 90.00 174.00 133.5714 16.05070 Valid N (listwise) 49 • The correlation was 0.8 7 uncorrected at N = 4 9 The average V IQ of people that submitted scores was 133. 5 SD 16.1 NPU vs Professional VIQ The following plot is the correlation between NPU raw scores and Professional VIQ. Professional VIQ tests only include tests that are or have been used in a professional capacity. Tests submitted by users: WAIS (majority), SB, PPVT, RAIT, CMT, Slosson. The correlation was 0.7 3 uncorrected at N = 22 NPU vs SAT Verbal The following plot is the correlation between NPU raw scores and the 1980 SAT Verbal. The correlation was 0.8 2 uncorrected at N = 2 3 Time vs NPU There was a correlation of 0.2 between time spent and score on NPU. The majority of test takers had no issue completing the test within the given time limit. Administration of the NPU as an untimed battery is likely valid for those wit h processing speed difficulties given the low corr elation. Correlation between forms The correlation between antonym and analogy scores was 0.72 Reliability Measures of reliability were acceptable given the length of the battery and the small sample size acquired. Reliability Statistics Cronbach's Alpha N of Items .897 60 Reliability Statistics Cronbach's Alpha Part 1 Value .838 N of Items 30 a Part 2 Value .809 N of Items 30 b Total N of Items 60 Correlation Between Forms .723 Spearman - Brown Coefficient Equal Length .839 Unequal Length .839 Guttman Split - Half Coefficient .838 a. The items are: q1, q2, q3, q4, q5, q6, q7, q8, q9, q10, q11, q12, q13, q14, q15, q16, q17, q18, q19, q20, q21, q22, q23, q24, q25, q26, q27, q28, q29, q30. b. The items are: q31, q32, q33, q34, q35, q36, q37, q38, q39, q40, q41, q42, q43, q44, q45, q46, q47, q48, q49, q50, q51, q52, q53, q54, q55, q56, q57, q58, q59, q60. Difficulty and g loading Below are the tabulated difficulties of each item from the official ETS data in comparison to the percentage answering correctly from the Reddit sample. G - loadings for each item are also listed. Q GRE Difficulty Reddit Diff G - loading 1 89% 86 % 0. 73 2 80% 77 % 0.8 0 3 66% 9 2 % 0. 77 4 62% 8 5 % 0. 78 5 51% 73 % 0. 77 6 46% 59 % 0. 68 7 39% 39 % 0. 82 8 35% 39 % 0.8 2 9 34% 48 % 0.8 1 10 33% 41 % 0. 74 11 32% 17% 0.72 12 31% 52 % 0. 72 13 30% 5 6 % 0. 74 14 29% 36 % 0. 68 15 28% 35 % 0. 84 16 27% 30 % 0.71 17 26% 26 % 0. 71 18 25% 38 % 0. 67 19 24% 2 4 % 0. 80 20 23% 45% 0 .78 21 22% 53 % 0. 81 22 21% 36 % 0. 84 23 20% 3 2 % 0. 77 24 19% 32 % 0. 91 25 18% 52 % 0.8 0 26 17% 33 % 0. 86 27 16% 2 4 % 0.8 7 28 15% 1 4 % 0. 79 29 14% 3 3 % 0. 81 30 7% 2 2 % 0.8 0 End Antonyms 31 91% 8 3 % 0.85 32 80% 72 % 0. 88 33 70% 8 6 % 0. 84 34 62% 73 % 0.8 0 35 50% 76 % 0. 77 36 45% 36 % 0. 83 37 41% 64 % 0. 75 38 35% 45 % 0. 74 39 34% 58 % 0. 81 40 33% 42 % 0.76 41 32% 53 % 0. 83 42 31% 47 % 0. 73 43 30% 5 2 % 0. 85 44 29% 45 % 0. 84 45 28% 21 % 0. 86 46 27% 42 % 0. 73 47 26% 32 % 0. 87 48 25% 47 % 0. 71 49 24% 48 % 0. 74 50 23% 47 % 0. 73 51 22% 33 % 0. 76 52 21% 20 % 0. 77 53 20% 27 % 0. 81 54 19% 3 0 % 0. 74 55 18% 42 % 0.8 3 56 17% 4 2 % 0. 76 57 16% 2 3 % 0. 73 58 15% 2 3 % 0. 76 59 14% 9 % 0. 80 60 13% 8 % 0.7 3 Normalization Update (11/5/2021) Raw IQ 60 190 59 188 58 186 57 184 56 182 55 180 54 178 53 176 52 174 51 172 50 170 49 168 48 166 47 164 46 162 45 160 44 158 43 156 42 154 41 152 40 150 39 148 38 146 37 144 36 142 35 141 34 139 33 137 32 135 31 133 30 132 29 130 28 129 27 127 26 126 25 124 24 123 23 121 22 120 21 119 20 118 19 117 18 116 17 115 or less