Sound warning system and its effect on cyclists’ awareness in traffic Group 406 Group members: Anastassia Andreasen Amina Arshad Gediminas Stankevicius Yang Ou Ran Wang Sandy Phi Duong Supervisor: Amalia De Goetzen MED4 Project, Aalborg University Copenhagen, 2013 spring 1 Semester:: 4th Aalborg University Copenhagen Title: A.C. Meyers Vænge 15 Project Period: 4th february -24th may 2013 2450 København SV Telefon: 9940 2500 Semester Theme: Sonic Interaction: Design and Evaluation Semester Coordinator: Secretary: Supervisor(s): Project group no.: Members: Abstract: This report covers the 4th semester project, by six Medialogy students, group 406, made in the spring/summer of 2013, at Aalborg University Copenhagen. The project is created in relation to the semester theme “Sonic Interactions: Design and Evaluation”. The project seeks to answer the problem statement: To what extend could a sound warning system increase cyclists’ awareness of incoming traffic while listening to music through headphones? Testing of the warning system prototype shows, that with help of sound computing techniques, a warning system could potentially decrease time needed for the cyclist to know about approaching vehicles, which also means an increase of traffic awareness. The paper will considerations, guide ideas the and reader choices through made throughout analysis, design and implementation Copies: Pages: Finished: to testing and discussing the product, where ideas for improving the product in a future perspective will be introduced. 2 Table of Contents 1. Introduction ........................................................................................................................................................ 5 1.1. 2. Initial problem statement .................................................................................................................. 5 Pre-analysis......................................................................................................................................................... 6 2.1. Target group ............................................................................................................................................ 6 2.2. Target traffic environment ................................................................................................................ 7 2.3 Music listening and traffic awareness........................................................................................... 9 2.4. Headphones............................................................................................................................................ 10 2.5. State of the art ....................................................................................................................................... 13 2.5.1. Bone Conduction Headphone ......................................................................................................... 13 2.5.2. Microphones helping cyclists ......................................................................................................... 14 2.5.3. Sound-Based Traffic Incident Detection .................................................................................... 14 2.5.4. Traffic congestion estimation ......................................................................................................... 15 2.5. Possible solutions for a warning system ................................................................................... 15 2.5.1. System Input .......................................................................................................................................... 16 2.5.2. System Output ....................................................................................................................................... 21 2.6. Conclusion............................................................................................................................................... 23 2.7. Final Problem Statement .................................................................................................................. 24 3.0. Analysis ............................................................................................................................................................... 25 3.1. Sound properties from road traffic .............................................................................................. 25 3.1.1. Frequency domain............................................................................................................................... 25 3.1.2. Amplitude domain ............................................................................................................................... 26 3.1.3. Target frequency and amplitude detection range ................................................................. 27 3.2. Sound computing techniques ......................................................................................................... 28 3.2.1. Additive synthesis ............................................................................................................................... 28 3.2.2. Subtractive synthesis ......................................................................................................................... 28 3.2.3. General knowledge about filters ................................................................................................... 29 3.3. Envelope follower................................................................................................................................ 30 3.4. MAX MSP ................................................................................................................................................. 30 3.5. Audio feedback ....................................................................................................................................... 31 4.0. Design .................................................................................................................................................................. 34 4.1. Requirements ........................................................................................................................................ 34 4.2. Short description of the system ..................................................................................................... 35 4.2. Overall structure of the system ..................................................................................................... 36 4.2.1. Input .......................................................................................................................................................... 36 4.2.2. Filtering and envelope follower .................................................................................................... 36 4.3. Sound feedback........................................................................................................................................... 37 3 5. Implementation............................................................................................................................................... 38 5.1. Sound source delimitation ............................................................................................................... 38 5.2. Development.......................................................................................................................................... 39 5.2.1. Sound file play ....................................................................................................................................... 39 5.3. Soundscape............................................................................................................................................. 40 5.3.1. Lowpass filter ........................................................................................................................................ 41 5.4.3. Envelope follower ........................................................................................................................................ 42 5.4. 6. Feedback ....................................................................................................................................................... 43 Methods .............................................................................................................................................................. 45 6.1. 7. T-test ......................................................................................................................................................... 45 Testing................................................................................................................................................................. 48 7.1. Test objectives.............................................................................................................................................. 48 7.2. Environmental settings ............................................................................................................................ 49 7.3. Pilot testing............................................................................................................................................. 52 7.4. Hypotheses test .................................................................................................................................... 53 7.5. First hypothesis data analysis ........................................................................................................ 55 7.5.1. Calculation .............................................................................................................................................. 56 7.5.2. Interpretation ........................................................................................................................................ 56 7.6. Second hypothesis data analysis ................................................................................................... 57 7.6.1. Calculation .............................................................................................................................................. 58 7.6.2. Interpretation ........................................................................................................................................ 58 7.7. Presentation of hypothesis tests ................................................................................................... 59 8. Discussion .......................................................................................................................................................... 60 9. Conclusion ......................................................................................................................................................... 63 10. Future perspectives ....................................................................................................................................... 64 11. Appendix ............................................................................................................................................................ 65 11.1. Methods ........................................................................................................................................................ 65 11.2. Questionnaire ........................................................................................................................................ 67 Bibliography ............................................................................................................................................................... 69 4 1. Introduction Usage of portable electronic devices is rapidly increasing in the modern society. Such devices are mp3 music players, smartphones, tablets, etc. They gives us an opportunity to listen to music, talk to friends, and connect to the internet, social websites and many other things at any time and place. People tend to use them on the go, while cycling, driving a car or waiting for a bus. Such devices sometimes engage us to a degree that we pay less attention to what is going on around us what can be very dangerous in a traffic environment, especially while crossing the street or cycling. While driving a car, walking or cycling it is unsafe to use certain portable devices, as they distract attention. Attention in traffic could be distracted in a physical, visual, cognitive or auditory manner [1]. Listening to music through headphones could be an example for such, because the headphones can block the incoming sound and makes it more difficult to hear the traffic. This problem is especially relevant for cyclists, who are moving at the higher speeds than pedestrians and find it difficult to see the incoming traffic from behind. The research done by C. Goldenbeld in the Netherlands in 2009 [1] included 5505 participants, who used a bicycle minimum once a week and showed that almost 70 % of Dutch cyclists used some portable electronic device at least sometimes while cycling [1]. 17% of the participants used electronic devices during almost every bike trips [1]. Listening to music was on the top among other device usage, with 15% of the cyclists doing this almost each trip. Abuse of electronic devices in the traffic often results in road accidents. Further evidence of device use as a risk factor comes from the studies in the Netherlands by de Waard in 2010 [2]. In this research, 3.4% of the crash involved cyclists mentioned listening to music, 0.3% mentioned talking on a hand-held mobile phone, and 0.2% mentioned texting a message [2]. These serious facts draw our interest to find a solution that is able to prevent cyclists from becoming less aware of potential traffic dangers while still enjoying the music. Therefore we would like to research opportunities for creating a system that would enable users to hear music while cycling and get warnings about incoming traffic at the same time. 1.1. Initial problem statement To what extent is it possible to warn cyclists who listen to music through headphones about potentially dangerous situations in traffic? 5 2. Pre-analysis In the preliminary analysis chapter, the target group will be specified and the targeted traffic environment will be defined. Potential causes of cyclist accidents involving music listening will be investigated, as well as different types of headphones used by cyclists. State of the art shall contain research regarding existing products that help to increase traffic awareness of primarily cyclists. Furthermore, possible solutions of the warning system will be explored in regards to both hardware and software. A Final Problem Statement will be made based on the research of this chapter. 2.1. Target group According to the initial problem statement, the preliminary target group is all cyclists using headphones and listening to music. The next step is to investigate whether there is a pattern regarding these statistics in order to possibly narrowing down the problem statement further, which might result in a better goal-oriented project. This will be accomplished by researching the age and gender of cyclists who are involved in traffic accidents while listening to music through headphones. C. Goldenbeld’s made a research in the Netherlands in 2012, which included cyclists using electronic portable devices on a road not only headphones but also handheld devices [1]. He found out that the most frequent and indiscriminate portable device users when cycling in the Netherlands are teens between 12-17 and young adults between 18-34 (see Figure 2.1) [1]. The research also stated that there were no distinctive differences between genders [1]. Figure 2.1 Percentages of cyclists by age group who at sometimes listen to music or use phone while cycling [1]. 6 The higher percentage of cyclists who use electronic devices in those age groups mentioned above was more likely to be involved in traffic accidents where electronic devices were a factor (see Figure 2.2 below). Figure 2.2 Percentage of bicycle crashes which were preceded by portable electronic device use or other factors [1]. Even though minors are more likely to be involved in traffic accidents, it would be much more difficult to access this target group for the test purposes. Besides, children and teenagers do not have as much experience in traffic as adults and their attention abilities to concentrate are still developing, so that by age 20 they are able to concentrate up to 15-20 min, but not before [3]. Therefore based on this data, the target group could be narrowed down to young adults between 18 and 34 years old who listen to music through headphones while cycling. 2.2. Target traffic environment According to statistics published by Organization for Economic Co-operation and Development (OECD)1 more than 60% of all road fatalities in OECD countries occur on rural roads. This number indicates that being in traffic on rural roads are not safer than on urban roads, considering the fact that the traffic flow intensity is generally higher in urban areas compared to rural areas, due to being more populated. Specific research aimed for cyclists showed the same tendency in severe accident occurrence when comparing urban and rural environments. A study by UK Transport Research Laboratory found that the risk of being killed on rural roads is much higher than on other roads, partially due to higher speed limits [4]. Figure 2.3 shows that 1 An international economics organization consisting mostly of European countries including Denmark. 7 nearly half of fatal cyclist crashes occurred in the UK from 2005-2007 happened on rural roads, while the majority of the slightly injured cyclists was involved in accident in urban areas. Figure 2.3 Pedal cyclists casualties by road type 2005-2007 [4]. The study also suggests that in UK, the casualty rate for cyclists is greater in 10% most deprived areas than in 10% least deprived areas [4]. This finding is in accordance with the conclusion reached by Jacobsen (2003) [5] in his study about inter alia walking and cycling in 47 Danish towns. The study stated that a driver is less likely to collide with a person or cyclist when there are more people walking or cycling. When choosing the target traffic environment, road properties also have to be taken into consideration. On many Danish rural roads, the width of the roads is relatively narrow in comparison to city streets. Usually they do not offer separated bicycle lanes, which can be a dangerous combination for cyclists in the presence of bypassing vehicles. Furthermore, rural roads often follow the shape of the landscape, meaning that there are many elevation changes and - or twist and turns. The shortened distance of view due to these crests and blind corners is more likely to add to the danger factor of the country roads. From both the driver’s and the cyclist’s psychological point of view, traffic is generally less expected on rural minor roads than in urban areas. Therefore when traffic does occur, drivers and cyclists are less prepared and therefore more at risk. Due to the discoveries made by comparing urban and rural traffic accident risks, this project will focus on providing warning signals about incoming traffic for cyclists in a rural environment. 8 2.3 Music listening and traffic awareness 3.4 % of cyclists who listened to music through headphones were involved in the accidents, as mentioned in introduction to the report. Therefore the following question arises: in which possible way listening to music might hinder the cyclists´ awareness of incoming traffic? To be aware of what is happening in surrounding traffic, cyclists use two perceptional senses: vision and hearing. However, without paying attention, the two senses might fail to detect potentially dangerous situations. Attention could be described as a selective process in the brain, which involves several mechanisms that are able to restrict processing to a subject of things, places, ideas or moments in time [6]. Cyclists are paying attention on the incoming traffic primary using visual attention. Vision has the highest spatial resolution, meaning that it allows observing everything very quickly in small details [7]. However vision sense in traffic is used in combination with hearing in order to gather more information about the situation on the road. Therefore the auditory sense is also very important for the cyclist’s traffic safety. Attention is divided in case of other extra activities apart from sustained focus on traffic. Cycling and listening to music at the same time reduces the ability to respond to emergency situations, due to decreased attention on traffic. Dr. Richard Lichenstein in his study regarding the relation between headphones, handheld device usage and pedestrian death or injury in USA during 2004-2011, came up with two phenomena that are likely to be the contributors to the possible association between headphone usage and pedestrian injury. They are - distraction and sensory deprivation [8]. The first phenomenon - distraction has partially been caused by inattentional blindness, which basically means “failure to perceive a stimulus that isn’t attended, even if it is in full view” [7]. The second phenomenon described by Dr. Lichenstein’s study is sensory deprivation, which is directly related to the defined target group. Sensory deprivation in terms of hearing means “the inability to hear sounds emanating from the local surroundings” [8]. Sensory hearing deprivation would occur when ears are blocked with some devices such as hoods, earmuffs or headphones, or when surrounding sounds are masked by the music. The same study showed that in 29% of cases where cyclists have been involved in the traffic accidents a warning sound had been heard before crash [8]. The two phenomena, divided attention, which reduces sustained attention on the traffic, and sensory deprivation, which prevents from hearing traffic sounds, are probable causes of cyclist crashes while wearing headphones. 9 Further down headphones are going to be described in general, as they are the main cause of hearing deprivation for cyclists. 2.4. Headphones The target group has headphones on as it listens to music while cycling. Knowledge of different types of headphones would be useful for the testing purposes where a specific type of headphones will be chosen for the test subjects. As there are no facts stating, which type of headphones the target group is using it would be appropriate to get a general information how the headphones work and which types of headphones exist. In this section, different types of headphones are reviewed to find out if some of them block incoming sounds as much as the others. Knowledge of different types of headphones would be useful for the testing part where a specific type of headphones will be chosen for the test subjects. Headphones in general are a pair of loudspeakers that are placed very close to human ears in order to transmit sound into the ear canals. Some of the headphones are totally blocking the surrounding environment sounds, some of them are blocking it only partly and some of them allow the users to hear surrounding sounds perfectly well. Headphones can be connected to MP3 players, TV, home theatres, CD players, sound amplifiers etc. by the wire or they can be wireless. There are different types of headphones that can be found nowadays. Users determine the types of headphones that suite their requirements better. Some of the headphones are made specifically for Apple devices; some are made for Androids the others are of universal usage2. 1. In ear headphones are isolating the surrounding environment totally and do not leave a possibility to hear any other sounds except for the sounds produced from the device it is used with3. Figure 2.3 In-ear headphones 2 3 http://www.earphonesolutions.com/earphones-by-type.html http://www.earphonesolutions.com/in-ear-headphones.html 10 2. Earbuds are not very good mini headphones as they very often come with a cheaper price. Most of them do not block the whole surround environment. Sometime their bass system does not have the same quality as the other type of headphones4. Some of the earbuds have a special purpose of noise consolation, especially if the users need privacy like in the airplanes. Figure 2.4 Earbuds 3. Circumaural headphones5 or full size headphones fully surround ears and that is why they exclude almost all surrounding noises. Figure 2.5 Circumaural headphones 4. Supra aural headphones are smaller than circumaural headphones but in general they also block the surrounding noises. 4 5 http://www.earphonesolutions.com/earbuds.html http://www.headphone.com/selection-guide/full-size-headphones.php 11 Figure 2.6 Supra aural headphones Both circumaural and supra aural headphones can be with opened - and closed back6. Opened have the purpose to open the back of the ear cups giving the sound an opportunity to come out of the headphones making the soundscape more spacious by creating the feeling of the distance to the sound source. They do not isolate users totally from ambient noise, which give a possibility to be aware of the surrounding environment. On the other hand the closed type of the headphones have a smaller soundscape and give the feeling that the sound comes from the users head, as the sound is reflected back to the ears. They have acoustically sealed ear-cups. Noise cancellation can be achieved by two ways: a) Passive noise isolation (in ear headphones work like a plug) b) Active noise isolation (cancelling constant noise only, like a plane engine, but not people talking)7. These types of headphones usually are fully sized headphones.8 It could be summarized that ear buds block the incoming sound partly, while in-ear, circumaural and supra aural headphones block the sound almost totally. The last two headphones can also have opened back of the ear cups, letting some sound in. This means that there are some options for cyclists who want to enjoy the music while cycling and still be safe. However, such headphones also have some drawbacks. Most of them are expensive and sometimes produce a lower quality sounds because of the sounds that leave the ear cup. The blockage of surrounding sounds could contribute to cyclists’ unawareness of dangers. http://www.headphone.com/selection-guide/full-size-headphones.php http://www.earphonesolutions.com/noise-cancelling-headphones.html 8 http://www.headphone.com/selection-guide/full-size-headphones.php 6 7 12 2.5. State of the art In this section different possibilities that already exist on the market and are relevant to the problem will be analyzed. These products might give some inspiration for this project. 2.5.1. Bone Conduction Headphone Bone conduction headphones are specially designed headphones made for cyclists who listen to music in traffic. As a variety of headphones are to be found in the market, these headphones make it possible for the users not to only listen to the music but also to the surroundings. The Bone conduction headphones works by using ones cheekbones as the source to transfer the auditory signals to the cochlea. By doing so, the whole ear will be opened which means that the music will not be the only sound which will be possible to listen to, but the surrounding environment as well. Bone conduction headphones are designed to be placed around the upper ear with a neckband around the neck. It can be turned off and on as well as adjusting the volume. A clip is implemented on the wire of the headphones, to making it more manageable. Figure 2.7 Bone Conduction Headphone In an interview with Laura Laker9, conducted by The Guardian, pros and cons were mentioned. In the interview, Laker reported that, she got struck by how the headphones worked in regard to being able to hear the surroundings while listening to music, but although in overall she found it positive, unfortunately the sound quality produced by the headphones was negative; “The bass was lost”, “tickling sensation on my cheekbones”, “oddly, yawning muted the music momentarily” and “music was drowned out, even with the volume up” [9]. From this example, the ability of allowing cyclists to hear both music and a warning signal or traffic sounds could be taken into considerations. A participant who tested the Bone Conducted Headphones and gave a short review through the interview with the guardian.co.uk [http://www.guardian.co.uk/profile/laura-laker] 9 13 2.5.2. Microphones helping cyclists Through a study Timothy J. Landis in 1996 created an invention that reduces wind noise and wind drags and makes it possible for cyclists to hear the incoming sounds [10]. The invention is a pair of ear covers that is worn as a headband in association with another kind of head protective gear like a helmet. It has an opening towards the rear, where the sound is received. This gives the cyclist a possibility of hearing the sounds coming from the back. In order to minimize the wind noise and wind drags the ear covers are made aerodynamically and are divided into two compartments: an inner and outer compartment, separated by a barrier. Speaker and amplifier are further apart and hereby it reduces undesirable audio feedback. The outer compartment consists of an acoustic receiver, an amplifier and a battery, which is a power supply, while the inner compartment consists of earphones or speakers. When the sound waves are received, the sound waves go through the acoustic receiver where the sound waves get reflected to the microphone and further amplified and transmitted to the earphone in the inner compartment. This invention could be used in order to reduce wind noise and undesirable audio feedback for the cyclist, and making it possible to listen and pay more attention to the traffic especially behind the cyclist. 2.5.3. Sound-Based Traffic Incident Detection The researchers from Mitsubishi electric research laboratories developed the system that is able to accurately detect traffic anomalies such as accidents or near-accidents in road intersections automatically [11]. Because the sounds emitted by the vehicle accidents are fairly consistent, using them for accident detection is an easier task than using video processing. Three different approaches have been employed to this detection problem. The first one is a supervised learning methodology, where the classifiers are trained to different traffic sounds such as vehicle crashes, brake and tire squeals. When applied to the sound stream they are able to classify events. The second methodology is unsupervised learning that detects anomalous sounds and automatically isolates them from the recordings. Finally, a user interface was developed. It allows a human operator to browse through the recordings and find intersections that are the least secure and need more attention. This is a perfect example of how sound analysis can be successfully used for surveillance and safety purposes. 14 2.5.4. Traffic congestion estimation In 2011 Nikhil Bhave and Preeti Rao from Indian Institute of Bombay conducted the research on vehicle engine sound analysis and applied it to the estimation of traffic congestion for intelligent transport systems [12]. Such systems are able to inform drivers about congested traffic and can suggest alternative routes. By extracting acoustic cues from traffic noise the system identifies vehicle type as well as the state of motion. Such cues are expected to be prominent in the shorttime magnitude frequency spectrum extracted from the vehicle sound [12]. Traffic noise database was created from 300 recorded samples, which had predominant sound of the vehicle in traffic. These samples were pre-processed using spectral subtraction in order to enhance spectral and temporal features against the background noise. The pre-processed traffic audio signal is windowed with 180 ms duration Hamming window10. A feature vector is extracted for each windowed segment. Formant and Mel-frequency cepstral coefficient (MFCC) feature vectors are then evaluated. The extracted formants forms the feature vector while MFCC method labels the sounds to one of the three classes: two-wheeler, three-wheeler and heavy vehicles. Using these two methods the system was able to accurately detect up to 96% two-wheeler vehicles, 80% three-wheeler vehicles and 68% of heavy vehicles. The reason why the detection of heavy vehicles was significantly less accurate is because of the similar spectrographic structure to two-wheeler vehicles. This suggests that vehicle sound detection and classification can be a challenging task as well because of the similar acoustic cues. 2.5. Possible solutions for a warning system In this section, possible solutions for a warning system will be explored. Most audio systems consist of two parts: input, where the sounds come in and output, where modified sounds come out. The future prototype might also consist of two most important parts. First part is audio input, which would receive all incoming sounds in real-time and filter out what is unnecessary, leaving will only traffic sounds in our case (e.g. car engine, horn, tire slip). Audio input could be achieved by using microphones. To isolate relevant traffic sounds, audio analysis techniques e.g. filtering could be utilized. The second part of the warning system is an output, or in our case a feedback, which is needed to provide cyclists with a warning. Possible methods for feedback will be evaluated so that the most fitting method for the project could be selected. 10 https://ccrma.stanford.edu/~jos/sasp/Hamming_Window.html 15 2.5.1. System Input For acquiring traffic sounds this project will use microphones. Nowadays, various microphones can be found on the market. Each of the different microphones is used for a specific purpose and therefore it is important to know what type of microphone might be used in this project. All the microphones have something in common. They convert a sound wave into electric signal [13]. On the other hand, they differ a lot as well. This section will describe different types of microphones available on the market in terms of sound recording quality, directionality, size and price. This will let us choose the most suitable microphone for this project. In order to keep the system’s price reasonably low, microphone has to be quite cheap, while still providing enough quality for capturing the traffic sound. The size of the microphone also has a significance. Smaller microphone would be preferred as it is more practical and easier to mount on a body. On rural roads there are not so many cars coming from the sides. This suggests that the project should focus on warning cyclists about incoming traffic from the back. Therefore a microphone that can capture sounds from one direction mostly would be preferred. Firstly, let’s compare microphones by their size. There are two basic types of microphones by their size: handheld and wearable microphones. Handheld Microphones These microphones are well known for their versatility and durability. Handheld microphones are used in variety of settings, from musical performances to television interviews. However, they are not the best choice for reproducing high and low frequencies and require significant energy [14]. Their price starts from around 10 dollars. Figure 2.8 Shure SM58 Dynamic Microphone Wearable Microphones These are tiny little microphones, also referred to as lapel or lavalier microphones that clip to one’s clothes and are usually used for television, theatre or interviews, in order to allow hand-free operation. They are good for capturing consistent audio level, as those microphones do not move that much as handheld microphone [14]. Their price is similar to dynamic microphones, but Figure 2.9 Sennheiser ME 2 omni-directional lavalier microphone. 16 wearable microphones are much smaller and usually provide worse quality than bigger microphones. Directional Microphones One more very important feature of a microphone is its directionality. The directionality of a microphone means the direction from which a microphone is able to pick up incoming sounds [15]. Sounds can come to a microphone from all different directions, from front, rear or only sides, and a microphone will record sounds based on its directionality [15]. There are four basic types of microphones based on their directionality: omnidirectional, unidirectional, bidirectional and cardioid microphones [15]. Omnidirectional microphones These microphones can detect sounds from all directions equally well with good sensitivity [15] (see figure below). Figure 2.10 The directionality of omnidirectional microphone. It can pick up sounds equally well from 0 to 360 degrees all around it [15]. Unidirectional microphones These microphones can pick up sounds with good gain only from a particular side, which means that a user has to speak at the correct side in order to get the best recording sound (see figure below). That is why such microphones are mostly used indoors. Unidirectional microphones could be useful outdoors only if pointed directly towards the sound source. 17 Figure 2.11 Unidirectional Microphone Polar Response [15]. Bidirectional microphones Another type is bidirectional microphones, that can pick up sounds equally well from two opposing faces, that correspond to the front and rear of the microphone while rejecting sounds from the sides. These microphones are perfect for face to face interviews or capturing vocal or instrumental duet. However, mostly they are rather expensive and huge microphones. Figure 2.12 Bidirectional Microphone Polar Plot [15]. Cardioid microphones This type of microphones can pick sounds from front and sides but poorly from back. They are named so because the sensitivity pattern is roughly heart-shaped in nature. 18 Figure 2.13 Cardioid Microphone Polar Plot [15]. Noise interference with a microphone When using microphones there is always a possibility that some unwanted noise will be captured through it and interfere with the system. This can make the process of detection of incoming traffic sounds more complicated. Since the target environment of the project is the rural roads, natural ambient sounds such as wind might be encountered and heavily interfere with the microphone. Denmark is a windy country due to its flat geographical properties. Cycling at speed might also cause air to collide with the microphone. Therefore wind noise could become a problem for the warning system. One approach for achieving a clean recording is by applying sound processing filters which cut out unwanted frequencies. Another approach could be adding windresistant screens on a microphone. Wind noise reduction for microphones is an often discussed subject. Outdoor recordings require wind noise to be dampened or eliminated. The current types of wind screen solutions for microphones are going to be described below. Foam cover is a common solution to reduce wind noise for microphones. Made of disposable foam, the covers are inexpensive and easily replaceable. The cover consists of small open cells, used to dampen vibration caused by the moving air. However these cells might also catch dirt and dust Figure 2.14 Foam windscreen. very easily, which would worsen the detection of the traffic sound and therefore unpractical to be used outdoors by itself. 19 Blimps (also known as Zeppelins) are large, hollow windscreens that surround microphones and are used for outdoor recording. Blimps are effective for eliminating wind noise as they can reduce it by up to 25 dB [16]. The blimp has a hollow cage structure outer frame. Acoustically transparent materials are attached onto the outer frame. The principle behind the design of blimps is to create a space of still air around the microphone. By isolating the microphone from noise sources like this, friction and vibrations noises caused by the wind hitting the cage are much less likely to be transmitted. Though being effective in wind noise reduction, the blimp is not suitable for Figure 2.15 Blimp windscreen. the project due to its large size. The type of windscreens called windjammer, also known as “furry”, is made of synthetic fur material with long and soft hairs. The windscreen is acoustically transparent, allowing good throughput for the sound of interest. The hairs act as shock absorbers to any wind turbulence hitting the windscreen. Windjammers can be slipped on top of other types of windscreens such as foam covers. This way, not only are the open Figure 2.16 Windjammer windscreen. cells in foam covers protected from dirt and dust, the synthetic fur cover can reduce wind noise by an additional 10 dB [16] and is worth considering for the project. A performance comparison shows the effectiveness of a few types of windscreens in dampening wind noises: Figure 2.17 The orange colored areas indicate how much wind noise is being captured by the microphone. The higher the x-axis and color intensity, the wider frequency range and volume of the wind noise are recorded [17]. 20 According to the test the windjammers are most effective in damping the wind noise compared to other covers and could be the best option for the project. 2.5.2. System Output After receiving traffic sounds and detecting only car sounds, the warning system should inform cyclists about the incoming traffic with a feedback. Therefore, it is important to explore possible ways for providing the feedback. When choosing feedback, there are several possibilities. It could be audio, visual or vibrotactile feedback. In this section, description of these types of feedbacks will be given. When talking about feedback it is necessary to remember that feedback is providing a clear visibility for user interaction by sending back information about undertaken or accomplished actions. Another important detail when providing the feedback is to keep cyclist’s attention on the road without distracting it too much. As was mentioned before in section 2.3 Music listening and traffic awareness, cyclists attention is divided already when they hear music and have to pay attention on the incoming traffic. Therefore the feedback should not distract any more attention. Three different feedback approaches: visual, audio and vibrotactile are going to be analyzed below. Visual feedback As seen in the section 2.5 State of the Art, visual feedback is widely used in car warning systems. However, in most cases it helps drivers to pay attention on the road. Since there is already a lot of visual information for cyclists in traffic environment, use of visual feedback would add extra workload to vision, which might distract their attention from traffic. Audio feedback Audio feedback is particularly useful when there is no possibility to use a video screen or non-visual dialogue with the user [18]. As an example to this is a GPS, providing verbal directions to drivers. Audio feedback provides personalization to the users engages them and creates emotional distraction. In some systems, audio feedback is produced by earcons, brief, distinctive sounds. Nevertheless, special care should be taken when dealing with such sounds, as use of obtrusive sounds can become annoying very soon [19]. On the other hand, if well designed, earcon sounds could distract cyclist’s attention from music and produce clear feedback. Since the target group listens to music through headphones, another 21 option for audio feedback could be an alternation of music, which cyclists listen to. It means that when such feedback would be triggered, the music would be alternated by some audio effects, filters, etc. This gives plenty of room for designing pleasant and interesting feedback. Furthermore sounds are used in many human-computer interfaces. Sounds can create affective and emotional reactions. Emotions are creating a dynamic process, which consists of “several cognitive, neurophysiological, and motor components” [20]. That is why sound feedback can be used on the same level as other feedbacks. Emotional reactions could be influenced by the sounds [20]. The quality of the sound has an influence on the users´ emotions and actions as well [20]. For example an alarm sound could provide a very emotional feedback without distracting their visual attention from the road. This signal could make them move in a safer zone of the bike lane. It is the design of the sound that could make interaction successful and warning efficient. Vibrotactile feedback The human body has tactile receptors that respond to stimulation such as pressure or vibration [21]. These receptors are sensitive to changes in pressure over time and that is called tactile vibration [21]. “Vibrotactile feedback stimulates human subcutaneous tissue. It’s been employed in mobile phones, video console gamepads, and certain touch panels” [22]. Mobile phones commonly use vibrotactile as feedback to a number of events, for instance, call and messenger alert. As well as in the gaming industry, vibrotactile is broadly adopted to a lot of game console devices, such as Wii U GamePad11, Xbox 360 controller12 or PlayStation Move motion controller13. A study has shown using vibrotactile feedback on one’s body can sometimes be irritating and annoying, especially when the frequencies are as high as 260Hz [23]. Vibrotactile feedback can be a good choice in many cases. It has been successfully implemented in many applications. However, it is not very practical for warning system as a sensor, which causes vibration, requires external platform in order to be connected to a computer. Furthermore, special care has to be taken when using vibrotactile sensor, as it can be irritating and cause negative user reaction. http://www.nintendo.com/wiiu/buynow/ http://www.microsoft.com/hardware/en-us/p/xbox-360-controller-forwindows#details 13 http://ca.playstation.com/ps3/accessories/playstation-move-motion-controller-ps3.html 11 12 22 Choice of Feedback In the preliminary analysis several topics that are relevant for solving our problem statement were looked upon. Before defining the target group research had been made on cyclist’s accidents and traffic conditions. Only then the target group and the target environment had been defined, which is young adults from 18 to 34 years old on the rural roads. To complete the research about the target group causes of accidents among cyclists were explored. This led the research to a conclusion that a combination of those senses as attention, vision and hearing is crucial in traffic conditions. State of the art reviewed many products that improves safety in traffic. Some creative ideas and ways of thinking should be kept in mind as they could be useful later in the project, when the ways of creating warning system for cyclists will be analyzed. It was decided that the warning system is supposed to have an input through a microphone that would suite the project the most as well as give output in a form of feedback. After studying several possibilities and types of feedback it has been decided to use an audio feedback, as it is the most relevant to the project, when taking budget, testing conditions and time frame into consideration. By finishing preliminary analysis, the project has come to a final problem statement: 2.6. Conclusion In the preliminary analysis we came upon several topics that are relevant for solving our problem statement. Before defining our target group we have made a research on cyclist’s accidents and traffic conditions has been done. Only then we were able to define our target group and target environment, which is young adults from 18 to 34 years old on the rural roads. To complete the research about the target group causes of accidents among cyclists were explored. This led the research to a conclusion that a combination of those senses as attention, vision and hearing is crucial in traffic conditions. State of the art reviewed many products that improves safety in traffic. Some creative ideas and ways of thinking should be kept in mind as they could be useful later in the project, when we analyze the ways of creating warning system for cyclists. 23 It was decided that the warning system is supposed to have an input through a microphone that would suite the project the most as well as give output in a form of feedback. After studying several possibilities of types of feedback decision has been taken to use a sound feedback as it is the most relevant to the project, when taking budget, testing conditions and time frame into consideration. By finishing the preliminary analysis, the project has come to a final problem statement: 2.7. Final Problem Statement To what extent could a sound warning system increase cyclists’ awareness of incoming traffic while listening to music through headphones? 24 3.0. Analysis Based on the areas described in pre-analysis this chapter is going to analyze different possibilities that can solve the final problem statement. In order to build the warning system, several steps have to be undergone. Firstly, the system has to acquire the sound through a microphone and send it to the software that is capable of processing and analyzing the sound. Secondly, after processing the sound, the system has to detect approaching car and determine when to warn a cyclist. Finally, this warning signal has to be sent through the headphones. The following sections are going to research these steps in more details and determine the requirements for a design of the warning system. 3.1. Sound properties from road traffic In order to know when to detect a car and trigger the warning signal, it is needed to find out what traffic sounds are derived from. The traffic sound analysis will use two important sound features - frequency and amplitude as a starting point of the research. Predominant sound sources in traffic are the engine and road–tire friction sounds. At low speed for passenger cars the engine sound is dominant (<50 km/h) while at higher speeds the road–tire sound is dominating [24]. At very high speeds, aerodynamic noise generated around a car may become the main factor of sound source [25]. However this kind of aerodynamic noises usually occur only on highways. However this project will focus only on researching the frequencies and amplitude of engine and road-tire friction sounds. 3.1.1. Frequency domain The engine noise frequency rises with engine revs per minute (RPM) - the higher the RPM is, the higher the pitch is. Due to different types of cars being manufactured for different purposes, engine RPM varies from one type to another. Sports car engines have a higher frequency than engines in passenger cars and heavy vehicles. However, the most common vehicles in traffic are passenger cars, trucks and busses. That is why the project will primarily focus on these types of cars. The engines for these types of vehicles produce noise in a relatively low frequency range, mainly between 50 to 100 Hz [26]. 25 The tire-road friction noise is produced by interaction between the rolling tires and the road. The vibrations and impacts as a result of this tire-road interaction generate sounds at a frequency of around 1,000 Hz [26]. 3.1.2. Amplitude domain Primary the amplitude of the traffic sounds dependent on the frequencies produced by different types of vehicles and on their travelling speed. Heavy vehicles generate more noise than lighter vehicles. Driving at a higher speed increases sound amplitude compared to when driving at low speeds regardless vehicle type [24]. As shown on Figure 3.1, at frequencies of 50-100 Hz, which correspond to engine noise, the amplitude is around 65-78 dB. The amplitude is 60-65 dB for tire friction noise at a frequency of around 1,000 Hz, when the sound source is 10 meters away [26]. However, when the sound source is 1 km away, the amplitude of engine sound just slightly drops to 45-58 dB while the tire noise drops to 28-38 dB [26]. Figure 3.3 shows the correlation between distance, amplitude and frequency. As the distance of sound source increases, the sound volume level (dB) of high frequencies decreases more than at low frequencies. Simply put, low frequencies are able to travel longer distances without weakening too much. The reason for this phenomenon is due to the high frequency being absorbed by the atmosphere, and the mid frequency being absorbed by the ground. As this project needs to detect traffic sound before the vehicles arrives close to cyclists, it is there necessary to find the amplitude levels at the certain frequencies ranges. Figure 3.1 Typical spectrum of road traffic noise calculated at different distances from the source. 26 3.1.3. Target frequency and amplitude detection range In the whole frequency spectrum, the only range interest for this project is frequency at around 50-100 Hz and tire friction at around 1000 Hz. Therefore those frequency ranges should be subtracted out of the whole spectrum while the rest frequencies should be filtered out. As Table 1 shows both engine and tire friction sounds. They are at a certain frequency range with corresponding amplitudes. This would be very helpful to allow the warning system to trigger the feedback accurately by controlling the sound amplitude, as there might be some other sounds in the traffic, which have the similar frequency ranges as engine and tire friction have. The warning system only triggers the feedback when the amplitudes corresponding to frequency range of interest reach certain levels found as in the Table 1. Apart from frequency control, the amplitude control prevents noises in the similar frequency ranges as engine and tire friction sounds from the interfering with the warning system. Frequency Amplitude Engine noise 50-100Hz <=78 dB and >= 45dB Tire friction noise Around 1000Hz. <=68 dB and >=28dB Table 1 Target traffic sound properties However in real life these frequencies could differ significantly depending on the weather, car engine, tires, speed, road, wind, etc. What is more, the system is supposed to warn cyclists some time before a car will approach. It means that when the system is supposed to detect a car, frequency and amplitude levels are going to be different. This issue will be taken into consideration during the implementation process, where different samples are going to be analyzed and specific sound features for this project are going to be found. 27 3.2. Sound computing techniques This section is going to have a look at different sound computing and sound analysis techniques that would help to implement the warning system and make use of the frequencies and amplitude ranges defined in the previous section. The audio warning feedback should probably be with a certain timber rather than a pure tone. That’s where additive synthesis can be handy. In order to filter the traffic frequency range, leaving only the frequencies of interest, subtractive synthesis might be useful. Finally, description of different types of filters should help to choose the most appropriate filter for this project. 3.2.1. Additive synthesis According to Fourier synthesis, one can create any periodic vibration by adding sinusoids together with frequencies that are integer multiples of a fundamental frequency at particular amplitude and phases [27]. That is why Fourier synthesis is also known as additive synthesis where waveforms are added together. There are several rules that must be followed when applying additive synthesis: - Only sinusoids can be combined Frequency of all sinusoids must be harmonically related [27]. Figure 3.2 Additive synthesis. Additive synthesis can be implemented by using several instruments or bits adding them together. However it is the same as to add different frequencies together, as described before. 3.2.2. Subtractive synthesis Subtractive synthesis is a sound synthesis method, in which partials of an audio signal are attenuated by a filter. These techniques are in contrast to additive synthesis where sinusoids are added together. 28 When applying subtractive synthesis frequencies are cut off from the sound. For instance highpass filter cut low frequencies off while lowpass filter cut high frequencies off. Frequencies could be removed in the beginning, middle or end of the spectrum, which is a complete cycle of the constantly repeating vibration. Sound waves could be different as well: sine, saw tooth, triangle, pulse or square. 3.2.3. General knowledge about filters Basically filters are devices that boost or attenuate regions of a sound spectrum [28]. That means that we can make any operation on a signal in order to achieve needed result. We can summate or subtract the signals, depending on whether we want perform an additive or subtractive synthesis. After the operation a new waveform is created with a different spectrum. The main characteristics of the filters are amplitude and frequency. Each filter has its own characteristic frequency response curve. These curves for four basic types of filters are shown in Figure 3.4: lowpass, highpass, bandpass, bandreject or notch [28]. Figure 3.3 Frequency response curves for different filters. The subtractive synthesis could be employed for the traffic sound detection to attenuate the noise while only passing the vehicle sound on the certain frequencies. Then the signal that was passed through the filter offers possibilities to manipulate it further. In the next section, amplitude control will be looked upon. 29 3.3. Envelope follower When the filtered signal of the traffic sound is available, an envelope follower could be used to gain control values from amplitude in a dynamical way. Basically, the idea of envelope follower is to understand amplitude of audio signal in a dynamical way over time, but not on a one by one sample level. The follower could generate an overall shape of a sound in ongoing time (see Figure 3.5 below), and the overall shape could be used to trigger some events depending on one’s expectation. For instance the follower could be used to control the filtered traffic sound in order to trigger the audio feedback. Figure 3.5 Envelope following technique. 3.4. MAX MSP Max is a visual programming language and software for making interactive sound and video applications. It has been widely used by various composers, performers, software engineers and artists. MSP provides a set of audio processing tools that can be used in Max software while Jitter provides tools for video and 3-D graphics14. It has got a range of features to help people create media-rich interactive programs in a very structural and graphical way [29]. First of all, Max allows people to build sound programs in a fluid and visual process. Max programs are created by organizing and connecting “boxes” together. These programs are called patches, and boxes are objects. Each of these objects is a selfcontained program having inlets and outlets, which can receive input and generate output. People can hear or see the results in real time once they made some changes in a patch [29]. Secondly, Max offers a lot of sound computing functions allowing people expand their own ideas to create sound and music. These functions include “samplemanipulation, synthesis tools, high-quality filters, spectral processing, real-time 14 http://store.cycling74.com/s.nl/sc.2/category.2/.f 30 recording and playback.” As well as it allows users to “process, resample, slice, and modulate everything in any combination” [29]. 3.5. Audio feedback After consideration on different possible feedback types in pre-analysis section 2.5.2, audio feedback was chosen as the most appropriate for this project. Such feedback could be realized through the cyclist’s headphones and it would not require usage of sensors and microcontroller as well as would not distract cyclist’s visual attention from the road. A driving alert system in motorized vehicles typically warns the driver by playing a warning message, a series of high urgency tones similar to those from an alarm clock, or both. The effectiveness of auditory warning signals and messages in an in-vehicle information system is dependent on the level of both perceived urgency and annoyance for the driver. The perceived urgency could affect how quickly a driver reacts to the warnings, and annoyance might have an effect on whether or not the user ignores the warning or simply turns off the system [30]. The relation between perceived urgency and annoyance – as well as the factors affecting them when used in auditory alerts and warning messages will be explored to find inspirations for designing warning system audio feedback. Audio feedback is also commonly used in other areas such as computer software interfaces, featuring a variety of sounds, which are played in response to user inputs. These sounds are called earcons. Even though these sounds are not used to indicate dangerous situations in real life, the concept behind their designs could prove to be useful for the project. Auditory alerts Vehicle warning system alerts Alert used in vehicle warning systems is a pulse of sounds that users interpret as warnings to a dangerous situation. The sonic urgency is a perceived urgency of an auditory alert. It should correspond to a degree of danger. This principle is called urgency mapping and is important in auditory alert design. The parameters that affect sonic urgency include inter-pulse speed (IPS). IPS is the time interval between the end of a sound pulse and the beginning of the next pulse. IPS is related to density. Density is the percentage of sound presence during each 31 burst and speed is the time interval from the beginning of one burst to the next one. Figure 3.6 The effect of sound parameters on perceived annoyance and urgency [30]. Figure 3.6 demonstrates the effect of both density and speed on perceived urgency and annoyance. The higher the density and speed are, the more urgency and annoyance is perceived by users, as those parameters correlate. However, density affects more than speed. While the incline in urgency and annoyance are almost identical to density, speed seems to increase urgency, though it does not annoy users too much. These results suggest that auditory parameters do not affect urgency and annoyance equally. Furthermore by manipulation of sound parameters such as density and speed, it is possible to create a balance for urgency and annoyance levels. Besides this, a relatively high-perceived urgency alert might be created, which could also minimize annoyance. The context in which the alert is played also affects the users´ annoyance level. Test participants tended to rate highly urgent sounds as more appropriate when they were presented in the descriptive context of highly urgent driving scenarios. Earcons Earcons are featured auditory alerts in the computer operating system or software. They function as audio cues for the users. Earcons can be a simple beep or more often, a customizable alert with pitch and rhythmic changes. They can also contain some other audio effects, heard in newer Microsoft Windows operating systems when emptying the trash bin, for example, or plugging in a USB device. Due to earcons’ ability to have a complex structure, they could be more distinguishable for the users than a simple beep, making it easier to associate the sound with a certain event. If using earcons for providing audio feedback in the sound warning system, cyclists will immediately understand that a car is approaching. 32 Some of the parameters that could be used to make the earcons more extinguishable are for example timbre, pitch, rhythm, tempo and intensity. The use of timbre with multiple harmonics instead of having pure sine waves in a warning tone can help increase perception and reduce masking of the alert sound. Changes in pitch are effective to increase alert sound perception when combined with use of other parameters, but it is less effective if used by itself. Rhythm, duration and tempo changes are more noticeable to the user even if no other parameters are used. The intensity of the tones should not be consistent throughout playback of an earcon either. The guideline suggests that an earcon should be created with the first note being slightly louder and the last note should be quieter but slightly longer. 33 4.0. Design 4.1. Requirements The requirements for the microphone, the overall functionality of the warning system and the audio feedback for design of the warning system will be established and described in this section. 4.1.1. Functional requirements Functional requirements show what the system can do. The warning system should be able to detect incoming traffic and give a feedback before any vehicle approaches the cyclists. 4.1.2. Non-functional requirements Non-functional requirements represent the constraints in the process of both usage and development of the prototype. Requirements for the microphone: The microphone should be able to detect sounds only from one direction (unidirectional). Frequency response of the microphone should be at least in the range of 50Hz – 1000Hz. The microphone should belong to Lavalier microphones type. The microphone should have a windjammer cover in order to eliminate wind noise. Feedback Requirements: The total length of the audio feedback should not exceed 3 seconds. Even though the cyclist needs to be warned of incoming traffic, they will prefer continuing listening to music as soon as possible. Therefore the audio feedback should be short and efficient. The audio feedback should make use of alerts/earcons, due to of their ability to attract attention. Verbal messages should not be used, since the functionality of the warning system is simple and verbal information are unnecessary for the user. 34 The audio feedback should make use of alerts/earcons, due to of their ability to attract attention. The audio feedback should be able to get user’s attention immediately. The perceived urgency of the audio feedback has to match the situation urgency level, so that the warning urgency level is contextually appropriate in order to reduce annoyance. The music’s volume level should be turned down 30% when the alert sound is triggered to increase sound/noise ratio in favor of the audio feedback. 4.1.3. The prototype is designed for the countryside roads without bicycle lines. 4.1.4. Environment requirements Technical requirements Use of MAX MSP to compute incoming traffic sounds. 4.2. Short description of the system The design of the warning system is represented on Figure 4.1. A microphone is placed on the back of a bicycle. It acquires the surrounding sounds and send them to Max MSP. When the frequencies and amplitude of an incoming car are detected, an audio feedback is triggered through user’s headphones. Figure 4.1 Short description of the system. 35 4.2. Overall structure of the system In this chapter an overall structure of the system including hardware and software is going to be described. As the starting point, the system is supposed to acquire traffic sounds through the microphone. Secondly, the traffic sounds are going to be sent to Max MSP software where low pass filter will be applied on the whole frequency spectrum and leave only the range of vehicle frequencies. Thirdly, envelope follower is going to be used to ensure that these frequencies have specific amplitude which is typical to vehicles in the rural areas. Finally, when frequencies and amplitude meet the target range and threshold respectively, the system triggers the feedback. An overall structure of the system is shown in the figure below. Figure 4.2 Structure of the system. 4.2.1. Input Requirements for the microphone that would suit this project’s purpose were established in the section 4.1 Requirements. However for the testing purpose it was decided to simulate the traffic sounds through the soundscape made in Max MSP. This soundscape would consists of different vehicle samples as well as background noises. For this reason, specific microphone will not be chosen neither tested. 4.2.2. Filtering and envelope follower As discussed in section 3.2 Sound computing techniques, filter could be used to eliminate the noises while only allowing the certain range frequencies (see 3.1.3) of vehicle sound to pass, so that the signal entering envelope follower is mostly traffic sound. The envelope follower is needed to blur the small details of the original sound wave, and turn it into a smooth overall shape over time. By controlling the smooth shape of the traffic sound signal, the feedback could be triggered on an early stage of the envelope. 36 4.3. Sound feedback The context in which the audio warning is played is when a car approaches a cyclist. Since the objective of the warning system is only to make users aware that the car is approaching, and since the approaching car is presumably still distant when the warning is triggered, the urgency of the situation is very low. Therefore the design of the auditory alert aims to produce a warning that gives users low to medium perceived urgency, and a low annoyance level. According to the analysis 3.5, when the alert sound is consisted of a single tone that is repeatedly played, the alert from the sound warning system should have a relatively low density, and medium to high speed – as the increase of annoyance is higher with increase of density than speed. However repetitiveness itself is a source of annoyance, therefore the alert sound for the warning system will be a multiplepitch alert that will only be played once. The difference in pitch of the tones should be relatively small, because large changes of pitch in a short time period might create annoyance. As also mentioned in analysis section 3.5, a higher S/N ratio is needed to make the audio alert stand out in comparison to the music that is also playing through the headphones. Instead of playing the alert sound so loud that it eventually it can be clearly heard over the music, the volume of the music will be gradually lowered to 30% and gradually increased back to its original level when the playback is finished. This could be achieved by manipulating the music through MAX software. In order to make the alert sound more distinguishable for the user and also more pleasant, the design will follow the findings from analysis about guidelines for making earcons 3.5, and focus on manipulating the following parameters of the alert sound: timbre, pitch, rhythm/tempo and intensity. By adding harmonics to a fundamental frequency, the timbre of the sound will change, making the alert sound more three dimensional. A frequency pass filter should be used in order for harmonics to get different amplitude. Pitch alteration could be accomplished by creating different fundamental frequencies and thus different tones, or adding audio effects such as tremolo to the alert sound. Finally, the first tone of the alert sound should be slightly louder than the others, while the last tone should be prolonged and gradually fading out. 37 5. Implementation This chapter describes how to realize the design using sound computing technology. As discussed in the Design section 4.2.2, Max MSP will be used to filter the frequency range of vehicle sounds out of the whole spectrum, smooth this sound signal by using envelope follower, and trigger sound feedback based on a certain level of amplitude. During the implementation, sound analysis in frequency and time domains is conducted first. Thereafter additive and subtractive synthesis, and enveloping sound computing techniques are employed to accomplish the design decisions. 5.1. Sound source delimitation According to the design of the warning system, the traffic sounds are supposed to be transmitted from an omnidirectional microphone to Max. However, the initial focus of the sound warning system prototype is to detect car sounds without too much noise interference. In order to keep the external programming environment stable and consistent, the sound sources are delimitated from live microphone recording to three different samples of commonly heard vehicles’ sounds in rural area (passenger car, bus and truck 3.1.1). This benefits the project by keeping the focus on sound computing techniques and removing some issues, which might affect the usage of the prototype in real world. Such issues could be wind noise, different modules of other vehicles, and some sounds from nature remain fluctuated energies in the filtered frequency rang across the time domain, which might be extremely hard to be taken into consideration while programing in this project. Therefore the prototype will use the soundscape made in Max MSP as a sound source. 38 5.2. Development Figure 5.1 below describes the flowchart of the implementation. Traffic sounds are simulated through the soundscape in Max. Then the low pass filter is applied, which passes through only the vehicle sound frequencies. Envelope follower analyzes the amplitude of these frequencies and if the amplitude is increasing and reaches a certain level it sends a bang, which triggers the feedback. Figure 5.1 System flowchart. 5.2.1. Sound file play The first step to do in the Max patch is to play traffic sound by using sfplay~ object (see Figure 5.1). The left inlet of sfplay~ connects to an “open” message with the name of the sound file which is going to be played, and a Toggle object with an output of 1 or 0 to start or stop playing. The right inlet of sfplay~ is used for controlling the speed of sound playing18. Every sound sample in the soundscape is triggered using this method. Figure 5.2 Sfplay~ object. 18 http://cycling74.com/docs/max5/refpages/msp-ref/sfplay~.html 39 5.3. Soundscape Soundscape is a set of sounds arising from a specific environment, which is the rural environment in the case of this project. In order to simulate the rural environment soundscape, three different vehicles sound samples were used: car, bus and truck19. Besides these samples, background noise sound samples such as bicycle bells, birds and general noise were used. Wind noise couldn’t be used as it aggravates the detection of the vehicles and sometimes causes false behavior of the system. Soundscape is implemented by using a line object to generate timed ramp, which is set to 2 minutes and 10 seconds (see figure on the left). Sounds are played by using if statement. When a certain period of time have passed, specific sound sample is triggered. For instance, when 5 seconds have passed, metro object gets a bang and begins playing background noises at random time intervals and order. Figure 5.3 Part of the soundscape implementation. A car sample is triggered after 20, 56 and 117 seconds. Usual personal computer’s sound card is not able to play music through more than one external device without using an external sound card whereas this project needs to play music and audio feedback through headphones, and soundscape through speakers. Therefore a solution is needed to divide these sounds to two output devices. For this reason, two computers containing the same soundscape are used. According to within subject testing, one participant has to test two conditions in this project (see 7.4). In order to avoid the participants having learned the pattern of the only one soundscape, another soundscape is needed to randomize the order and interval of the vehicles’ sounds. So that one participant would never experience the same soundscape twice. When one of two soundscapes is triggered, a respective message is sent to the second computer to play the same soundscape at the same time through the speakers (see Figure 5.4). This is done with a help of Maxhole object. Maxhole allows sending and receiving data wirelessly from other computers running on a local network. Everything that goes into Maxhole object is received by 19 Downloaded from http://www.freesound.org/. 40 the same object on the other computer. This is a very simple and efficient way to start two soundscapes at the same time. Figure 5.4 Implementation of wireless communication between two computers running on the local network. Sender on the left, receiver on the right. 5.3.1. Lowpass filter According to Figure 5.1, the traffic sound is filtered before going further to envelope follower. As discussed in Analysis and Design, the low frequencies of vehicle sound at the range around 100 Hz should be passed by using subtractive synthesis technique. Onepole~ object is a low pass filter, which has a slowly attenuated cutoff20. A couple of onepole~ objects are employed to get this frequency range. As Figure 5.5 shows the first object has a cutoff at 40 Hz, the original signal minus this low passed signal is actually the rest frequencies in the spectrum, which go to another onepole~ with a low pass cutoff at 150 Hz. After the calculation, the finally output is somewhere between 40 Hz and 150Hz. Figure 5.5 Onepole~ lowpass filter. 20 http://cycling74.com/docs/max5/refpages/msp-ref/onepole~.html 41 5.4.3. Envelope follower Before the signal that passed through the filter goes to the envelope, it firstly has to be converted to only positive numbers. Therefore the signal to the abs~ object, which takes any signals and outputs only rectified wave- the non-negative values21. It is because this envelop drawing needs positive numbers. Rampsmooth~ object is to smooth an input signal through the two arguments of this object defined as ramp-up and ramp-down samples. This object provides a hill-like curve with ramp-up on its left side and ramp-down on its right side. In order to assign those samples to rampsmooth~, another object mstosamps~ is used to convert milliseconds to samples. However, the number of samples converted completely depends on the setting of sampling rate of Max, which can be seen on Audio Status (See Figure 5.7). Those samples are visualized on a multislider as continuous little black hills (See Error! Reference source not found.). This project uses 1000 milliseconds that equals to 44100 samples, which means each time the input signal changes, a linear ramp bridges to this value with a mount of samples depending on the sampling rate. Moreover the snapshot~ is used to convert the signal values to numbers, as in order to draw the envelope in multislider, the input mush be numbers. 1ms interval is used for snapshot~, which means that this object converts one number from the input signal every millisecond. By doing this, the object provides a smooth curved hill, which is easy to recognize while the vehicles are approaching. 21 http://cycling74.com/docs/max5/refpages/msp-ref/abs~.html 42 Figure 5.6 Implementation of the envelope follower. Figure 5.7 Audio status. 43 5.4. Feedback The output number from envelope follower is a fluctuated number going up and down constantly. As discussed in Design, the feedback should be triggered when the vehicle just starts approaching to the cyclist. Apparently the triggering point is on the left foot of the hill. The triggering point should meet two circumstances in order to trigger the feedback. First of all, the amplitude should be going up while it is triggered. Secondly, the amplitude should be on a certain levelhill’s left foot. To get the advantages of the amplitude going up and down, the change object is employed. It outputs a message with number 0 if the amplitude is decreasing and a message with number 1 if it is increasing (see Error! Reference source not found. on the left). The right inlet of route object gets the 0 or 1 as its selector, which is going to compare with the route number. If the left input matches the selector, the left output gives a bang. The left input is determined by a switch object with three inputs referring to three amplitude thresholds. As seen in Figure 5.9 these thresholds are different depending on the vehicle’s type (passenger car, bus and truck). Setting different amplitude thresholds allows to trigger the feedback as soon as the vehicle’s sample begins. Figure 5.8 Change object. Figure 5.9 Switch object and different amplitude thresholds. The audio feedback is created using additive and subtractive synthesizes techniques, and envelopes. Six harmonic signals are put together, which afterwards goes 44 through a band pass filter and an envelope. The reason for using band pass filter is that the very low of the original signal give some clipping sound, but they could not be completely cancelled, since they contribute some timbres to the audio feedback. That is why the band pass filter is used to attenuate the clipping sound. After the system was implemented, it became possible to test the hypotheses that are defined in the following chapter and prove them wrong or right. 6. Methods In this chapter of the report methods are going to be described, which will be used for testing the final problem statement. DECIDE framework is a helpful to navigate through the test procedure. The first step in test according to DECIDE framework is to set up a main goal, which is to find out if the prototype could help cyclists be more aware of incoming traffic in case of hearing deprivation. Experimental design approach would help to find out the cause of the relation between two set of data, which is exactly what the test is supposed to do. To be able to accomplish the goal, the project has to set up hypothesizes according to our expectation. When setting up hypothesis independent and dependent variables have to be bared in mind. As well as one tail and two tails conditions have to take into consideration when establishing hypothesis, because this is crucial for the later calculation and interpretation of t test. In order to interpret the t test it is also necessary to compare the results through paired t test. Qualitative data will be collected as well through questionnaire after the testing of each participant. Its purpose is to collect subjects´ opinion on the feedback of the sound warning system. Subjects´ opinions about sound feedback could be applied in the future for any kind of improvements of the sound warning system. 45 6.1. T-test The T-test22 is a classical statistical method, which is commonly used for detecting difference between average scores of two groups. The first one-tailed T-test will help to answer how many cars the subjects could detect with and without the sound warning system, which will prove or disprove the hypothesis, if the prototype could help the subjects to detect more cars rather than those who don’t use the prototype. Controlled group will be expected to detect fewer cars than treatment group. That is why one-tailed T-test in the positive direction23 is going to be used for the experiment [31], as it is expected that the mean of the treatment group is greater than controlled group. This test will disregard the possibility of a relationship to the negative direction. Reason for this is that the system is expected to detect more cars than the subjects who don’t use the prototype. In the second two-tailed T-test it would be interesting to find out how the reaction time of the subjects differ in case of the sound warning system usage. The second test can show the reaction time difference of the prototype usage by the subjects. The independent variable is: feedback (if the test is with or without it). The dependent variable is: reaction time (of the subjects). Paired t-test Paired t-test is going to be conducted on the same subjects, which will save the time and help to compare the results in a more efficient way, as one population mean sample is going to be paired and compared to the other sample[32], as treatment and controlled groups are presented. It is not necessary that a humanly built system will give better results. Although in case of the project subjects´ hearing sense are deprived, which gives a perfect opportunity to build a system providing warnings quicker and be useful. Designing the experiment This research is going to be based on a true experiment and validate only one testable research hypothesis [31]. English chemist William Sealy Gosset developed student test, when he worked for the Guinness brewery in Dublin, Ireland, under the restriction in the contract that did not give him any possibility to publish his work. research. Therefore, the publication of his articles on the t-test Gosset did in 1908 in "Biometrics" under the pseudonym "Student". The simplicity of calculating Student t-test, as well as its presence in most statistical packages and programs has led to be widely used around the world. 22 23 http://www.psychstat.missouristate.edu/introbook/sbk25m.htm 46 Two groups will be involved in the experiment: a treatment group and a controlled group. The first group will be used to test a warning system. The second group will be involved to prove a significant difference of not using the sound warning system. As the test would like to exclude individual differences, a larger number of participants are needed. To escape a larger amount of the participants for T-test, it would be preferred to apply within – group design approach. However, within group has potential bias- learning effective. That is why two random soundscapes were created for two subject groups. Random sampling is going to be applied to the choice of testing subjects, as it is the most commonly used sampling method for true experiment. Another very important factor concerning random sampling is that it would be more practical to apply convenient sampling, as the test subjects are going to be chosen from the one’s availability [33]. And the last problem of within-group design is escaping fatigue of the test subjects [31], which could be done by shortening the testing time [31]. Participants Test subjects will be recruited from Aalborg University, M.C. Meyers 15. There will be 30 participants in a whole. Participants will be picked up randomly. However they have to be between 18-34 years old and hear the music through headphones while cycling in traffic. Before testing the participants will be informed about the procedure of testing, about goals and purpose and asked their permission to use the results in the further scientific research. This form can be found on the exam CD. 47 7. Testing 7.1. Test objectives The data that testing of the prototype product collects should be able to answer whether or not - and to what extent the prototype could increase cyclists’ awareness of incoming traffic while listening to music through headphones. However, the word “awareness” is too general to design a test and the project should specify the exact arguments that will be used to indicate an increase or decrease in awareness. Therefore it is determined, that the test will have two specific objectives: 1. Time needed to detect a vehicle. In many studies regarding traffic awareness for drivers, reaction time is used as the gauge for how aware the driver is about the surrounding environment. The reaction times measured in those studies are for example the time interval from an obstacle appearing on the road until the driver presses the brake pedal. For the cyclist warning system prototype, the test could be designed similarly to a driver reaction experiment. The “time needed to detect a vehicle” could be interval from the time playback start for a car sound sample in the Max soundscape, until the user recognizes the incoming car sound and takes action by pressing a button. If the reaction time is shortened by using the cyclist warning system, it could be argued that the system helped to increase awareness. 2. Incoming traffic identification ability measurement. 48 It is questionable whether cyclists are able to identify incoming cars in time and take actions such as steer towards the side of the road when necessary. Therefore another objective of the prototype product test is to examine the presumable increase in the number of cars that each participant is able to recognize and respond before the car is alongside. Since the test is not taking place in a real life situation, the theoretical point where the car is alongside the test participants would be the peak in amplitude of the car sound samples that are played to the participants. When not using the warning system, a participant presses the button when he/she hears the car samples; when using the system, the participant presses the button when given the audio feedback. If a test participant presses the button before the amplitude peaking in a sound sample, then it means that they have recognized the car early enough to be able to respond with an action. The ways of collecting and analyzing testing data are similar for the two test objectives. For time measurement, the results analysis compares calculated time interval between start of the car sample playback and button press from test participants when using and not using the warning system. For incoming traffic identification ability measurement, the time of amplitude peak in of the car sound samples are compared to the time of corresponding button presses; missed identification of car sample by test participants will be treated as recognizing too late. The calculated results will differ from each other, one set of the results will show changes in reaction time, and the other set of results will indicate changes in the ability to identify incoming cars and take action before the cars reaches alongside. The first set of results will be calculated in time/seconds, and the second set calculated in number of cars, but in the end, analysis of both sets of results will indicate whether the warning system could potentially increase traffic awareness for cyclists who are wearing headphones. According to traffic sound properties research in analysis 3.1.3, the sound amplitude of traffic 10 meters away is around 78 dB, which was further confirmed prior to testing by us using a sound level meter standing beside a rural road to measure amplitude of cars passing by. The measurements from the sound level meter read roughly between 72 dB to 84 dB. Both speakers are therefore adjusted to play the car sound samples to the test participants at about 78 dB to simulate traffic sound loudness in real life. 7.2. Environmental settings Controlled testing environment was set up in a sound laboratory in the Aalborg University Copenhagen. The test needed to be conducted in this laboratory to isolate 49 the test from potential outside noises. Sound laboratory has the best conditions as the walls contain sound isolation materials to avoid sound wave reflections from the walls as well as to avoid unnecessary noises. Figure 7.1 Testing environment. Equipment for testing - Training gym bicycle; IPhone chronometer app; Circumaural headphones without total sound isolation; 2 Computers (1 for plying the traffic sounds through speakers, 1 for playing the music and the feedback); 2 Loudspeakers; Since the testing took place in a sound lab and the purpose of testing was to figure out when participants reacted to the feedback it has been decided that the equipment used for the testing should be slightly different from what has actual been designed for the prototype. Instead of a real bicycle, training bicycle from a gym was used. In order to know when the participant reacted to the audio feedback an IPhone app was used, where the participants had to respond by pressing a button. Each time the participants pressed the button on the app, the time was registered. The IPhone was placed closed to the cycle’s handlebar so it would be easier and quicker for the participant to press the button. 50 Besides, two computers were used: one was used to play the soundscape through speakers and the other to play the music and produce feedback through headphones. In order to play the traffic sounds two loudspeakers were used where one was placed in front of the bike and the other was placed behind the bike. Sony-MDR xB500 headphones26 were chosen for this test. They belong to circumaural, closed back headphones type. They are capable of producing high quality sounds in extremely wide frequency range: 4 – 24,000 Hz as well as very loud sound output reaching up to 104dB. Even through, as described in section 2.3 circumaural headphones usually block the most of the incoming external sounds, practical experience using these headphones outside in the traffic revealed that traffic sounds are still audible depending on the music volume. Instructions Test subjects were tested two times, according to the design of the experiment. 1. Controlled group The first group listened to the music through the headphones while the traffic sounds were played through the loudspeakers. This group did not get any feedback from the sound system. Instead the participants had to detect the potential danger from the approaching traffic by simply trying to hear the surrounding sounds. As soon as they heard the sound of an incoming vehicle they were asked to press the button (measure application on the IPhone), registering the time. 2. Treatment group The second group was using the same equipment: headphones with music playing and the traffic sounds playing through loudspeakers. However, in this test participants got the warning signal when a vehicle was detected through the headphones, which they had to react to. Their reaction time was registered the same way as the first group, by pressing a button. The only difference was that they had to respond as soon as they heard the warning signal instead of the vehicle sound from surroundings. 26 http://www.sony.co.uk/product/extra-bass-dj-headphones/mdrxb500l.ae 51 7.3. Pilot testing A pilot test is conducted before the actual test, in order to avoid problems later on when conducting the T-test. Often problems might occur when collecting the data, giving instructions to the participants, setting the environment, choosing methods or designing the test itself. There were 10 subjects that were chosen randomly for the pilot test. Each subject was asked to give a feedback about the sound warning system, testing procedure and audio feedback signal. Several technical errors occurred during the testing procedure. Those errors were corrected straight after they were detected. General improvements of the sound warning system was made after the pilot test as well. By comparing two types of most commonly used headphones- earbuds and Supra aural headphones, we found out that due to the difference of participants´ pinna, sometimes the earbuds fell off the ear while subjects were cycling. As well as people using Supra aural could still hear outside sound depending on music loudness and it was more comfortable for the participants (see 7.2). Therefore it was decided to use supra aural headphones. Furthermore it would contribute the T- testing with more reliable results. During the pilot test the sound volume level had been tested as well. The reason for this was to find out an optimal sound volume that was the same as the subject used to hear on. 10 subjects listen to music through headphone while giving a gesture to increase or decrease the volume. The average volume was 20% among these subjects. Because MacBook Pro was used in the testing the audio output was adjusted in the sound preferences settings. Down below it will be referred to as the percentage of the audio output of MacBook Pro. The participants were irritated about constant interruption of the music when the feedback was provided. It had been decided that the sound volume level of the music should not be lowered to 0% but to the level of 30% in Max sound level instead. Above all the participants were unsatisfied with the given instructions as they were unclear. That is why it was decided to leave one person in the room together with the participants and changed the instructions for the T-test. Beside this person would be giving a signal to a participant when the test would be finished. If 52 participants have pressed the button unintentionally, he was asked to raise a hand so that the tester would be able to take the note of the mistake. As the instructions about feedback and its sound were confusing it was decided to let the participants hear feedback first before starting the actual test. Furthermore several technical errors appeared during testing. Software didn’t work optimal throughout the test, as MAX crashed several times that is why the particular computer that was playing files from Max was changed to another one, which did not have problems with this program. 7.4. Hypotheses test For the final hypotheses test 30 participants have been picked randomly. There was 21 male and 9 females with an average age of 21 years old. Null and alternative hypotheses for two tests are defined below. First t-test (one tail) Null hypothesis: The average number of vehicles detected in time without prototype is greater or equal to the average number with prototype. Alternative hypothesis: The average number of vehicles detected in time without prototype is less than the average number in with prototype. Second t-test (two tails) Null hypothesis: There is no significant difference in the time when the vehicles are detected between with and without the prototype conditions. Alternative hypothesis: There is significant difference in the time when the vehicles are detected on time between with and without the prototype conditions. Data preparation The actual data collection provides a set of primitive data, which maintains some errors and inconsistencies. So that the primitive data should be deciphered, formatted and organized before it could be used for statistical calculation. As described in the test design, a chronometer is used for the participants to trigger a feedback as soon as they get either an audio feedback through headphone or the vehicle approaching sound through the speakers. So that the data is in an ordered list in a multi-time format looking like “hh:mm:ss:msms”. However, in order to test the first null hypothesis, the only interesting element of the data is how many times the participants pressed the chronometer before the vehicles passed by, which is considered valid button press. By analyzing each of the 53 vehicle sound waveforms, the time correspond to the waveform’s peaks are recognized, which enables us to eliminate the invalid button press. For the without prototype condition, a number of invalid button presses are recognized, but none form with prototype condition. In order to utilize the data to do the second t-test, which has to do with the time difference of vehicles approach between with and without prototype conditions, the original data need to convert to a single time unite- second. In form of single time unite the data could be put directly into some statistical analysis programs to generate the results. Number of participants 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 Mean Ratio Standard deviation 1st t test With feedback 9 9 9 8 9 9 9 8 9 8 9 9 9 9 9 9 9 9 9 9 9 9 9 9 9 9 9 9 9 9 8.9 98% 0.3051 Without feedback 2 2 3 2 0 4 3 3 3 2 2 1 7 2 3 7 5 6 3 4 4 3 6 9 3 4 2 2 0 4 3.36 37% 2.0424 2nd t test With feedback 1.57 1.29 1.54 2.4 1.62 1 1.63 1.85 1.31 1.42 1.39 0.89 1.02 0.96 0.51 0.7 0.37 0.75 1.34 1.19 0.94 1.24 0.61 0.28 1.06 0.69 1.47 1.33 0.94 0.65 1.13 Without feedback 5.18 4.18 3.95 4.05 5.21 4.3 4.8 4.28 4.82 5.11 4.62 5.34 3.45 4.43 4.35 3.56 3.7 3.76 4.51 4.07 3.88 7.82 3.92 3.07 4.42 4 5.04 4.34 5.58 3.76 4.45 0.4664 0.8776 Figure 7.2 prepared first hypothesis test data 54 7.5. First hypothesis data analysis Distribution of data corresponding to the first hypothesis. Figure 7.3 Distributions of with prototype (left) and without prototype (right) for the 1st hypothesis test Figure 7.4 Quantile-quantile plot of with (left) and without (right) prototype Apparently, Figure 7.3 and Figure 7.4 show that the without prototype condition almost fits normal distribution, but with prototype condition is not normal distributed. Because number 9 on X axis is much more prominent than the rest numbers on X axis. Actually, except number 8, 9 is the only column among the whole graph. This could be caused by the small sampling size, which are only 30 participants. However, as the inferential statistics always assume that the data fits normal distribution, so we can use this data to do t-test. 55 7.5.1. Calculation By using the Matlab27, some code is entered to conduct the t test to exam the null hypothesis. The following is a piece of pseudo code for analyzing t-test: [h,p] = ttest (without feedback, mean of with feedback, significant level, tail left) 𝑡= 𝜇1−𝜇2 2 2 √𝜎1 +𝜎2 = 15.06 ; 𝑛−1 h =1 The part of the test report generated by GraphPad28 shows as following: Paired t test P value Significantly different? (P < 0.05) One- or two-tailed P value? t, df Number of pairs < 0.0001 Yes One-tailed t=15.06 df=29 30 7.5.2. Interpretation The returned values of h = 1 and p< 0.0001 determine the t test to reject the null hypothesis at the 5% significant confidence. The rejection further means that the probability of obtaining the sample means completely by chance is too small that there is a backup for the alternative hypothesis. Another way to reject or accept null hypothesis is to check the t table, in order to see if the t value calculated from this data falls into the rejection area or not. In this data, the degree of freedom is 29, significant level is 0.05, and one tails, so that the corresponding t critical value associated with that is 1.699, which is much less than the t value 15.6, so the t falls into the rejection area. Thus the null hypothesis should be rejected on the significant level of 0.05. According to the considerably big t value, the mean of with prototype is significant 27 28 http://www.mathworks.se/index.html http://www.graphpad.com/guides/prism/6/statistics/index.htm?stat_checklist_pairedt.htm 56 higher than the mean of without prototype, based on which we can infer that with prototype, the cyclists can detect 5 (8.9-3.36) more vehicles out of 9 than those without prototype on the statistical significance of 5%. 7.6. Second hypothesis data analysis The average time difference in with prototype condition is 1.13 seconds, and this difference is 4.46 second in without prototype condition. Distribution of data corresponding to the second hypothesis Figure 7.5 Distributions of with prototype (left) and without prototype (right) for the 2nd hypothesis test Figure 7.6 Quantile-quantile plot of with (left) and without (right) prototype for the 2nd hypothesis test 57 7.6.1. Calculation 𝑡= 𝜇2−𝜇1 √𝜎1 2 +𝜎22 = 21.44; 𝑛−1 h =1 The part of the test report generated by GraphPad shows as following. Paired t test P value Significantly different? (P < 0.05) One- or two-tailed P value? t, df Number of pairs < 0.0001 Yes Two-tailed t=21.44 df=29 30 7.6.2. Interpretation The calculated values of h = 1 and p< 0.0001 provide a very clear evidence for the t test to reject the null hypothesis at the 5% significant confidence. The alternative way to accept or reject null hypothesis is to check if the t value falls into the rejection area. According to the t table when the degree of freedom is 29, significant level is 0.05, and two tails, the t critical value associated with this data is 2.045, which is far less than21.44. Obviously, the t falls into the rejection area, the null hypothesis should be rejected. The considerably big t value indicates a prominent difference of means between with and without prototype. The rejection of null hypothesis also hints that to obtain the sample means by chance is very seldom, so that the alternative hypothesis is accepted. Such a big significant difference means that the incoming traffic can be detected 3.32 (4.45- 1.13=3.32) seconds earlier with prototype than without prototype. 58 7.7. Presentation of hypothesis tests The first t test proves evidence that the prototype could help cyclists increase awareness about incoming traffic to the extent of how many more vehicles the cyclists could detect with help of the prototype rather than those who don’t use the prototype. The second t test evidences that the prototype could help cyclists increase awareness about incoming traffic to the extent of how much earlier the prototype could help cyclists detect vehicles than those who don’t use the prototype. Therefore, based on the two hypothesis tests, the conclusion is that the prototype could help cyclists increase awareness about incoming traffic rather than those who do not use it. 59 8. Discussion Test results of both time needed to detect the vehicles and traffic identification ability measurement indicated that the sound warning system could potentially increase the traffic awareness of cyclists. However, further product testing with ideal settings in a realistic environment is needed to confirm the effectiveness of the warning system, as several factors could have contributed to the test results. As mentioned in methods, all test participants are students from Aalborg University Copenhagen, but it is possible that some of the participants do not have the same experience and ability to identify incoming traffic from a far distance as other participants who bike regularly on rural roads. In the city, cyclists have fewer reasons to be on constant alert of approaching cars due to bicycle lanes on most of the roads. Slower speed limits in the city also means that the cars could sound slightly differently than the samples played to the participants during the test – as written in the analysis that at low speeds, tire and air friction sounds are less dominant compared to the engine sounds than at higher speeds. Therefore the participants who ride on rural roads regularly might have had an advantage when identifying cars that are approaching from far away. For example, the test conductors learned that participant number 24 bikes regularly on rural roads and also listens to music while cycling, and the results showed that he could detect all cars before they arrived without using the feedback. On the other hand, the warning system was able to shorten the participant’s reaction time due to the fact that MAX MSP program would detect cars much earlier than the participant could, therefore traffic awareness was still increased for the participant when using the warning system. What can also be discussed is whether there is a difference in testing the prototype in a lab or testing in a rural road. The sound from the MAX soundscapes, which composition consisted of pre-recorded audio samples made it difficult to recreate a very realistic sound environment. Playback of samples through speakers and headphones mean that there is an unavoidable loss of sound quality. Besides audio quality, there could be more types of ambient noises in the real world than the samples used in the MAX soundscape. More importantly, the sound absorbing foam material in the auditory lab would have changed how sounds are reflected back to the test participant, and therefore sound acoustics properties would not be the same. The lab’s foam absorbers, for example, would result in absence of reverb and echo, in contrast to what happens in reality. A possible solution to improve the realistic aspect of sound playback in an audio laboratory would be to have very high 60 quality sound samples, as well as multiple speakers that could simulate the direction of sounds, echo and reverb effects etc. The test results could also have been different if it have been tested in the real environment than in a laboratory, since on the rural road, the test participants would have more distractions to divide their attention such as changing surroundings and the attention need to riding on a bicycle. There are no distractions in the laboratory so the participant only had to concentrate on traffic sounds or audio feedback; therefore reaction times could be higher compared to the test results. The participants that were fast enough to press the button before the cars passed by when using the warning system might not have been able to identify as many cars in time rather than if the test had been conducted on a rural road. Limitations during implementation meant that the prototype in many ways did not resemble how the final product should detect traffic sounds. An obvious issue with the prototype was that it was only analysing pre-recorded sound samples, instead of actual sounds captured through a microphone in real time, as originally intended in product design. Even though in theory, the prototype would be still efficacious if the microphone would work as intended, further testing would be needed to ensure that the extra process and hardware would not affect the performance of the product. The sound warning system prototype detected cars from the samples by first filtering out most of the frequencies and only allowing sound waves within a narrow range of low frequency to pass. This limited frequency range is where the car engine sounds would be, and once the amplitude of the filtered audio reaches a certain threshold, the audio feedback is triggered. This way of detecting incoming cars has an obvious drawback, as if there are other noises within the car engine frequency range, they are likely to trigger the audio feedback and create false warnings. Therefore it is necessary for the study to go back to the drawing board and explore alternative ways to detect cars, which should reduce the potentiality of false warnings being triggered. The frequency filter of the warning system prototype was able to filter out noises such as bicycle bell rings and bird tweets. But as an example to the less than ideal car detection method mentioned above, wind noise was found to trigger the system unintentionally, as some of its frequencies overlap with the car frequency range where the prototype was set to trigger audio warnings. Another matter that should be investigated for further improvement of the sound warning system design is whether the low frequency band pass filter would be able to detect all types of cars. The filter currently detects sounds emitted from internal combustion engines. But due to evolving technology, many cars on the road in the future will be powered by electric motors, which would not trigger the warning system as its noise is outside of the filter’s current frequency range. Even if the filter is changed to match the electric motor frequencies, the sound volume emitted from 61 the electric motor might still be too quiet to be detected. Therefore the warning system should also consider using other sounds emitted by cars to trigger the audio warnings, such as friction noises from the tires and cars travelling through air at higher speeds. The design and implementation of the audio feedback warning is another topic of discussion, as the implementation of audio feedback did not entirely comply with its design. For example, the prototype was originally implemented to shut off the music completely when the audio feedback is triggered. But during the pilot test, it was understood that the interruption of the music was an annoyance factor and therefore the warning system was then adjusted to lower the music volume to 30% instead of simply stopping the music every time the sound feedback played. According to results of the questionnaires (see 11.2) answered by the test participants after testing, 8 of 30 participants found the audio feedback annoying, even though one of the requirements for the audio feedback design was not to annoy its users. Similar to the music interruption issue, the actual implementation of the audio feedback did not follow all aspects of the design. For example, it was stated in design that the feedback tone should not be repetitive, and that the rhythm of the sound should vary. But the implemented audio feedback was a single harmonized tone that had a typical vibrato effect, which means that it was somehow repetitive and the rhythm did not vary enough either. Even though the majority of the test participant either found the audio feedback to be neutral or pleasant, making the audio feedback more compliant to the design and the research findings in analysis might further reduce its annoyance level to the user. 62 9. Conclusion The project is based on the problem that when cyclists are listening to music through headphones, traffic sounds are masked to an extent by the headphones and the music. The preliminary analysis contains general research about the risks of cycling in the rural roads while listening to music through headphones and ear buds. It was found that the risks of cycling in rural areas are higher than urban areas, and the target group research showed that cyclists in the age group from 18-34 year-olds has a high risk of being involved in a traffic accident. Therefore cyclists within this age group were chosen as the target group for the project; and the project was further narrowed down to rural roads as the target environment. In state of the art, a current solution to help cyclists becoming more aware of incoming traffic by reducing wind noises heard by the cyclist using ear covers, and amplifying other ambient sounds through microphones gave inspiration to the product design. Microphones were also found as an affordable solution for inputting relevant data to the warning system. A drawback of using microphone is that it is easily interfered by wind noises; however the study found that windjammer such as foam covers and “furies” could effectively eliminate wind noise interference. Different options to deliver feedback to inform the cyclists about incoming traffic were explored. Within the options of audio, visual and vibrotactile feedback, it was determined that audio feedback would be the most appropriate choice for the project due to the fact that it could be utilized without high annoyance to the user, and at the same time allow the user to keep the visual attention on the road. In the analysis chapter, it was discovered that typical vehicles in traffic generate noise around 70-80 dB in the frequency range from 50-100 Hz. This data would later be used for implementing the sound warning system. Guidelines for audio feedback design helped the project in making an effective audio warning without the feedback being annoying to the user. An audio warning sound similar to an earcon was made in MAX by manipulating parameters such as pitch, harmonics and duration. The sound warning system itself utilizes sound processing techniques including envelope following and frequency pass filtering. A working prototype was tested on AAU students in Copenhagen. The test was conducted to see if the prototype could answer the final problem statement. Results measured from both vehicle detection time test and vehicle identification ability test indicated that the prototype succeeded in increasing traffic awareness of the test participants. In vehicle detection time testing, all participants were able to detect incoming traffic earlier using the sound warning system than without. In vehicle identification testing, the majority of the test participants were able to detect more vehicles before the vehicle passes by using the warning system as well, with an exception of one participant who was able to detect all cars without 63 using the system. However several factors were found to have possibly induced the results being favorable towards the product prototype. Therefore an improved product, test settings and testing procedure are required to confirm the results of the conducted prototype testing. 10. Future perspectives The sound warning system prototype is not the final product, therefore it has not been fully developed and many things could be improved, which will be described in this chapter. Regarding the audio feedback, some of its aspects also have to be improved. One of the requirements was not to annoy the users, as seen from the questionnaire results in section 11.2, almost 27% of the participants found the feedback annoying. If the audio feedback avoided repetition and having consistent rhythm, it might have reduced annoyance. In order to detect the cars in the real environment, the microphone has to be implemented into the warning system. The microphone will be placed on the back of the bicycle, as cyclists are mostly not aware of cars that are approaching from behind. Once the microphone is implemented, the sound detecting software could be converted onto mobile phones as an application, making the sound warning system light and portable. The software is not yet fully developed and improvements are needed in terms of function and performance, such as how effective the sound warning system is to detect incoming traffic. The way the sound warning system in MAX MSP detects the cars is currently dependent on whether engine sounds are above a certain threshold at a specific frequency. This means that noise within the same frequency will trigger the feedback as well. In the real environment, a lot of noise might occur and therefore it is important to find a solution so that the warning system is able to only detect cars. Another problem is the variance in types of cars. Different types of cars have their own frequencies and amplitude levels, and some cars almost emit no audible sounds such as for instance electric cars, making it very difficult to program such a patch in Max MSP. Finally, there is a possibility that several cars are approaching at the same time although it is made for rural roads, therefore the ability to detect several cars have to be implemented as well. 64 11. Appendix 11.1. Methods When calculating the results several terms has to be taken into consideration such as one-tailed, null hypothesis, alternate hypothesis, p- and t- values, type 1 error, type 2 error, rejection region, critical value and t-table. T-value T-value represents the difference between the mean or average scores of two groups, while taking into account any variation of the results. It is the value that is going to be calculated and compared to the critical value in order to reject or accept the hypothesis. One-tailed If a certain outcome is expected like for instance that the treatment group is expected to give better/worse results than controlled group, a one-tailed t-test would be the approach29. This is due to a significant difference of the mean results is expected before hand. This test will disregard the possibility of a relationship to the negative or positive direction depending on what is expected19. Two-tailed A two-tailed test is used when the null hypothesis has the possibility of going both ways20. This means that a value will be on both sides of the normal distribution. An example of a two-tailed null hypothesis could be “there is no difference in terms of gender” while a one-tailed would be “men is worse than women”. This said the onetailed probability value would be half the two-tailed probability value19. Rejection region When having a one-tailed t-test it means that the rejection region will only be one sided when doing normal distribution; either negative or positive30. In this case it is expected to be on the positive side of the normal distribution as dealing with only positive numbers. If the t-value calculated is going to appear in the rejected region, then the null hypothesis will be rejected. Else if the t-value appears not to be in the rejection 29 30 http://statpac.com/manual/index.htm?turl=onetailedandtwotailedtests.htm http://www.mathsrevision.net/alevel/pages.php?page=64 65 region then it will be in the so called acceptance region, which means that the null hypothesis might be true31. The rejection region differs, therefore to be able to find out what the rejection region is according to the data that is collected then it is necessary to take a look at the critical value and the t-table. T-table The t-table or t distribution table is a table that given the degree of boredom and pvalue shows the critical value in percentages32. The critical value is the value that will determine whether to reject the null hypothesis or not. If the t-value calculated is smaller than the critical value from the t-table, the null hypothesis cannot be rejected but if the value is bigger, the null hypothesis is rejected. Null hypothesis A null hypothesis is the one assuming that there is not a difference between the measured variables. It is the hypothesis, which results are by chance33. The reason for making such a hypothesis is because it is easier to disprove and hereby support the alternative hypothesis, for instance “Using our system is worse than without the system.” By rejecting this hypothesis, it is likely that the results are influenced by a non-random cause and hereby automatically supports the alternative hypothesis to be true34. Alternative hypothesis Alternative hypothesis is the hypothesis that states that there is a relation between the measured variables. It is the hypothesis which results are influenced by nonrandom causes, meaning that it is not a coincidence21. The alternative hypothesis is what is thought as being true, although one can never prove but only assume that it is true22. Sometimes it happens that errors occur, that is when assuming something although the results show something else. This is the kind of errors called type 1 error and type 2 error or known as respectively alpha (p-value) and beta as these describes the probability of that specific kind of type error to occur21. Type 1 error Type 1 error is the error that occurs when the null hypothesis is true, but is being rejected21. Type 2 error http://stattrek.com/hypothesis-test/region-of-acceptance.aspx http://www.itl.nist.gov/div898/handbook/eda/section3/eda3672.htm 33 http://stattrek.com/hypothesis-test/hypothesis-testing.aspx 34 http://www.csulb.edu/~msaintg/ppa696/696stsig.htm 31 32 66 Type 2 error is the error that occurs when the null hypothesis is false but is being accepted as being true21. As the type errors are the opposite of each other it means that when reducing one error, the other will increase. p-value p-value represents statistical significance or the probability of error associated with the spread of the observed results to the entire population. For example, p-level = .05 shows that there is a 5% chance that the sample found in the relationship between the variables. This could the only feature of this random sample. By looking at the t-table a comparison of the real value from the table and the value gotten from the testing can conclude how reliable the test is and how significant the results are. 11.2. Questionnaire Apart from the testing itself, a questionnaire was made after the testing in order to know if the feedback caught the test subjects’ attention, to what degree the feedback was found easy/difficult to recognize and to what degree they found the feedback pleasant. A total of 30 test subjects applied their answers of the questionnaire. Making such a questionnaire will help knowing whether the feedback meets the requirements established in the design chapter. Did the sound feedback catch your attention? 0% Yes No 100% For the first question it was asked if the test subjects thought the feedback caught their attention. 30 out of 30 test subjects (100 percent) answered yes. 67 How annoying / pleasant was the sound feedback? 6% 7% Easy Neutral Difficult 87% For the second question, it was asked to what degree they could recognize the feedback. For this question they got three options: 1. Easy, 2. Neutral and 3. Difficult. Out of the 30 test subjects, 2 test subjects (7 percent) thought it was difficult to recognize. Another 2 test subjects (7 percent) found it neutral and 26 test subjects (86 percent) found it easy to recognize. How annoying / pleasant was the sound feedback? 27% 36% Pleasant Neutral Annoying 37% For the third and last question three options were given to the test subjects as well. For this question it was asked to what degree the test subjects found the feedback pleasant. According to the results, 11 test subjects (36/37 percent) found it pleasant while 11 test subjects (36/37 percent) found it neutral and 8 test subjects (27 percent) found it annoying. 68 Bibliography [1] Goldenbeld, C., Houtenbos, M., Ehlers, E., & Waard, D. D. (2012). The use and risk of portable electronic devices while cycling among different age groups. Journal of Safety Research , p. 1. [2] de Waard, D., Schepers, P., Ormel, W., & Brookhuis, K. (2010). Mobile phone use while cycling: Incidence and effects on behaviour and safety. Ergonomics, 53(1) , 30– 42. [3] What Is Considered a Normal Attention Span? (2013, 04 24). Retrieved from Wisegeek: http://www.wisegeek.org/what-is-considered-a-normal-attentionspan.htm [4] Knowles, J., Adams, S., Cuerden, R., Savill, T., Reid, S., & Tight, M. (2009). Collisions involving pedal cyclists on Britain's roads: establishing the causes. Transport Research Laboratory. [5] Jacobsen, P. L. (2003). Safety in numbers: more walkers and bicyclists, safer walking and bicycling. Injury Prevention. [6] Cognition Textbook: Chapter 3 - Attention. (2007, January 17). Stanford, CA, USA. [7] Goldstein, E. B. (2010). Sensation and Perception, Eighth Edition. Belmont , CA: Cengage Learning. [8] Lichenstein, R. (2011). Headphone use and pedestrian injury and death in the United States , 287-290. [9] Laker, L. (2012, March 26). Bike headphones that don't distract from cycling safely. Retrieved from Guardian: http://www.guardian.co.uk/environment/bikeblog/2012/mar/26/bike-headphones-music-cycling [10] Timothy J. Landis, L. C. (1996). United States Patent No. 5,691,515. [11] Mitsubishi Electric Research Laboratories. (2013, 05 07). Sound-Based Traffic Incident Detection. Retrieved from Merl.com: http://www.merl.com/areas/traffic/ [12] Bhave, N., & Rao, P. (2011). Vehicle engine sound analysis applied to traffic congestion estimation. Mumbai: Department of Electrical Engineering, Indian Institute of Bombay. [13] AudioLinks. (2013, May 7). Microphones. Retrieved from AudioLinks: https://microphones.audiolinks.com/microphones.shtml [14] AudioLinks. (2013, 04 14). Types of Microphones. Retrieved from Audiolinks: https://microphones.audiolinks.com/microphones.shtml#Dynamic [15] What is a Directional Microphone? (2013, April 14). Retrieved from LearningAboutElectronics: [16] The DV Shop. (2012, May 9). dvshop.ca. Retrieved from Useful Audio Slang: http://www.dvshop.ca/dvcafe/windsock.html [17] Crewofone.com. (2011, 08 11). Retrieved from Shotgun Microphone Windscreen Test: http://crewofone.com/2011/shotgun-microphone-windscreentest/ 69 [18] Kaushansky, K. (2012, 04 18). Designing With Audio: What Is Sound Good For? Retrieved from Smashingmagazine.com: http://uxdesign.smashingmagazine.com/2012/04/18/designing-with-audio-whatis-sound-good-for/ [19] Kortum, P. (n.d.). Design for Haptic, Speech, Olfactory, and Other Nontraditional Interfaces. Burlington: Denise E. M. Penrose. [20] Stefanics, G., Hangya, B., Hernadi, I., Winkler, I., Lakatos, P., & Istvan, U. (2010). Phase Entrainment of Human Delta Oscillations Can Mediate the Effects of Expectation on Reaction Speed. The Journal of Neuroscience. [21] Wolfe, J. M., Kluender, K. R., & Levi, D. M. (2012). Sensation and Perception, third edition. Massachusetts U.S.A: Sinauer Associates, Inc. Publishers Sunderland. [22] Immersion. The Value of Haptics . San Jose, California : Immersion. [23] Binseel, K. M. (2007). http://www.dtic.mil/cgibin/GetTRDoc?AD=ADA468389 . Retrieved april 24, 2013, from http://www.dtic.mil/cgi-bin/GetTRDoc?AD=ADA468389%20. [24] Bendtsen, H. (1999). The Nordic prediction method for road traffic noise. The Science of the Total Environment, 331-338. [25] Cerrato, G. (2009, April). Automotive Sound Quality – Powertrain, Road and Wind Noise. Sound & Vibration. [26] ICT . Gjestland, T. (2008). Background noise levels in Europe . Trondheim : SINTEF [27] Loy, G. (2007). Musimathics. Volume 2 : The Mathematical Foundations of Music. Cambridge, MA, USA: MIT Press. [28] Roads, C. (1996). The computer musical tutorial. Cambridge, Massachusetts: MIT Press. [29] cycling74. (n.d.). Retrieved 05 06, 2013, from cycling74: http://cycling74.com/products/max/physical-computing/ [30] Marshall, D., Lee, J. D., & Austria, P. A. (2001). Annoyance and urgency of auditory alerts for in/vehicle information systems. Proceedings of the Human Factors and Ergonomics Society Annual Meeting (pp. 1627-1631). Sage Publications. [31] Lazard, J., Feng, J. H., & Hochheiser, H. (2010). Research Methods in Human – Computer Interaction. John Wiley and Sons Ltd. [32] Shier, R. (2004). Statistics: Paired t-tests. Mathematics learning support centre. [33] Preece, R. S. (2011). Interaction Design Beyond Human - Computer Interaction, 3rd edition. A John Wiley and Sons, Ltd, Publication;. 70