CJE 3663 Proficiency # 2 You are a crime analyst for the Orange County Sheriff’s office. You are requested to perform the following tasks. Access the final data set (OC Raw Data) and perform the tasks below. When done email your proficiency to me and title it Last Name Prof3. ** Write your (1) name, (2) and computer number on this answer sheet. _______________________________________________ ** Save the original data file in your newly opened workbook and then create a working file and be sure to save your work over the course of the proficiency (we all know the reason for this). Section I: File Recognition (1 pt) 1. What is the file type? _____________________________________________________ Section II: File Conversion & Data Acquisition (4 pt) 1. Import the raw data file into Excel. Save it as OCSO_Data in an excel format. 2. Describe the native file format. What data format did Orange County Sheriff's Office provide you with? _____________________________________________________________________________________________ Section III: Data Reconciliation and Analysis (70 pt) 1. Once you have the data imported and the excel table designed properly, the data must be cleaned to conform to the following rules (answers may be different so be sure to justify and explain what you have done where results could vary): 2. Insert variable names from data dictionary. a. Create a row above the data set (1st row) with a representative title (bold this title row with 14 font in Red Capitals). b. Bold and use a fill color for the row with your variable names (column headings). c. All non-necessary character elements, including zeroes, are to be removed from the data set (hint: pay close attention to signal codes in the data set and the data dictionary). d. Incidents recorded as occurring at an intersection must have the format, [Street][Space]&&[Space][Street]. In other words, the streets that intersect must be joined by a [space]&&[space]. e. The coded fields (columns) UCRCode and ArrSignal require labels. After each coded field, create a new field (column) to contain a label for the preceding coded field. Enter the correct label based on the information contained in the UCR and Police Arrest signal code data dictionaries. Be sure to create an appropriate variable heading. 3. Create a new field using the OffZone field by stripping off the first character which represents the police district in which the incident occurred. Rename this column District, and remove the OffZone field and its remaining data. 4. Create a new field using the OffTime and AM/PM fields. Title it OffTime1 and combine the two fields with a space so the new column will read for example in the first cell 635 PM. Leave OffTime and AM/PM in the columns preceding the new variable OffTime1. 5. Format the fields so that the information in each field’s cell is visible. 6. Be sure to update your data dictionary as some variable have been created and some have been removed. 7. Once the data is cleaned and properly formatted, you must ensure record non-redundancy for validity. (25 pt) a. If the dataset contains duplicate records, remove all such cases and place them in a separate spreadsheet titled Duplicate_Records. b. Calculate a frequency distribution for UCRCode. In other words, construct a new table in its own sheet with coinciding chart to identify the number of offenses that occurred in Orange County per UCR category. c. Calculate a frequency distribution and create a corresponding chart in its own sheet to indicate the number of offenses per police district. d. Create one pivot table and one pivot chart to look at the data in a manner different than done in sections b. and c. 8. Email me your completed spreadsheet and your updated data dictionary titled Proficiency 2_last name as standard protocol. Proficiency # 2 Data Dictionary Variables (Raw Data) Col 1 Col 2 Col 3 Col 4 Col 5 Col 6 Col 7 Col 8 Col 9 Col 10 Col 11 Col 12 Col 13 Col 14 Col 15 Col 16 CaseNo UCRCode OffDate OffTime AM/PM OFFZONE OffAdd VicDOB VicRace VicSex ArrChr PerpDOB PerpRace PerpSex ArrDate ArrSignal Data Dictionary of UCR Offenses UCR Code 10 20-21 30 31 40-45 50-59 60-69 70-79 80+ Description Murder Rape Armed Robbery Simple Robbery Assault Burglary Theft Auto Theft Other Data Dictionary of Arrest Signal Codes Code 18 21 30 34 35 37 51 59 60 63 64 65 67 68 Description Traffic Complaints Homicide Agg Battery Simple Battery Agg Assault Agg Arson Criminal Mischief Agg Burg Criminal Trespassing Armed Robbery Simple Robbery Theft Unauthorized Use of Movable 69 Possession of Stolen Property 95, 95G Illegal Carrying of Weapons 100 Hit and Run 103 Agg Assault 103F Fight 106 Obscenity 966 Drug Law Violations 103F Fight 17F Fugitive Attachment 17J Juvenile Attachment 34C Cutting 34S Shooting 62 Residential Burglary 64G Armed Robbery (Gun) 67A Auto Theft 67B Bike Theft 67P Pick Pocket 67S Shoplifting Any remaining Other Signals