The Voice of an Intersection

The intersection is a complex environment with an interface that uses colors and symbols to communicate either to go or stop; the confusion of these commands could be life threatening. The interface of the intersection would improve if it had a voice aspect.

There are two main users of the intersection and interface, drivers and walker. It is vital that both understand the interface and the commands it gives. Humans are very easily distracted and this could be dangerous if the interface is completely dependent on visual cues.

The improved interface would consist of an additional app that is built into new cars and available for smartphones. This app would be linked to the traffic light interface in real time. This would allow the app to verbally tell the driver lots of information regarding the intersection. It would inform the driver of the command of the traffic lights: “go” or “prepare to stop.”

We observed many of the people walking had headphones in while crossing the intersection. This could be very dangerous because the hearing the different sounds of the intersection are important to safety. For example, we heard the sounds of the car engine idling, the brakes, and the engine revving. With headphones in all this information is lost.  So, the new interface app would interrupt the music and say either “Safe.” or “Not safe to cross.”

The voice itself would match the emotion and mood of the traffic lights having a very calm and steady tone being both neutral and an indication of caution low in intensity. The voice would be a feminine synthetic voice much like Siri. The interface would have these traits so that it is perceived as friendly, intellectual, and non-bothersome.


Audible Crosswalk

One of the most frequent activities I witnessed was people trying to cross the street in order to get to their destination.  Although upon coming up to the street the main task is to cross the street, most people were multitasking with their phone in their hand or headphones on their ears.  For the multitasking pedestrians, a speech-directed interface would make crossing the street less dangerous.

I observed a woman walking with a friend while texting on her phone as they approached the crosswalk.  The crosswalk signal was a red hand signaling for pedestrians to not walk but the woman couldn’t see the warning and kept walking until her friend stopped her.  A speech directed interface could have warned the woman that it wasn’t safe to cross the street without her having to look up from her phone and her friend wouldn’t have had to intervene.  A few minutes later, a man, who seemed to be in a rush, completely ignored the traffic signal and hurriedly crossed the street.  He safely reached the other side of the street but this was still a dangerous situation.  A speech directed interface would discourage people from ignoring traffic signals and make crossing the street safer.

"Red Hand" Source:

“Red Hand”

The “voice” of the interface should be a low pitched, rigid voice that says “You should not cross” or “It is safe to cross”, depending on whether or not it is safe to cross.  The voice should rigid in order to get the attention of people who might not be paying attention to the visual signals.  Also, people wouldn’t have to look up from what they’re doing in order to know when it is safe to cross, although it’s always safer to look before you cross.  I visited an area where there is a sound directed interface for the crosswalk signals, but for someone who had never seen one or heard of them I didn’t know what meant it was safe to cross and what meant it wasn’t.  To me, it sounded like a random series of beeps and I couldn’t decode the message.  Therefore, I think a speech-directed interface is the best option in order to make crossing the street the safest.

Battling Distracted Crossing

At Georgia Tech, one of the busiest places on campus is Technology Square. Even at nine in the morning, this area is bustling with people, all of whom are on different missions.There are people walking, driving, eating, studying, exercising, talking, and even listening to music. However, where the roads meet at 5th and Spring, all individuals, whether by car or by foot, come together to safely complete one common task: crossing the street. While widely overlooked, it is crucial to the proper functioning of the intersection.

As the name suggests, technology is important in the space. Certain safety features, such as the stoplights and the cross walk lights, rely on it to visually communicate information that facilitates the completion of the task. However, with the increasing dependency on cell phones and other devices that largely utilize visual technology, these safety features are beginning to become less effective. Fortunately, this can be combated through the addition of safety features that rely on vocal cues.

Pedestrians crossing the street at Tech Square.

For the distracted pedestrian or the disabled/blind, the addition of a voice interface would potentially prevent many accidents. Sensors would detect the presence of a person near the intersection, and the voice would alert the user if and when it is safe to cross. This would reduce not only the danger in the intersection but also the anxiety of both pedestrians and drivers. For drivers, a voice interface similar to that of a GPS would provide an additional safety measure when approaching an intersection. The system would utilize the driver’s position, information from the upcoming stop lights, and sensors near the crosswalk to alert the driver about changing lights and potential pedestrians. This information would be transmitted vocally through the radio, allowing the driver to concentrate on the road ahead.

When choosing a voice for both interfaces, similarity attraction is important. This is due to the fact that “people like voices that manifest personalities that are similar to their own”(Nass 41). At the intersection, the pedestrians and drivers have purpose and are determined to reach their destination; thus, it would make sense for the voice to be professional and straightforward, providing only the necessary information. It should have a male voice with a relatively high volume, a slightly deep pitch, medium pitch range, and an average speech rate. These characteristics would ensure that the voice comes across as knowledgeable and trustworthy but not intimidating. This is important because it would eliminate the possibility of the user feeling as if they are being told what to do; thus, a sense of equality between the user and the interface would be established through the voice, which is similar to that of the stereotypical copilot mentioned in Wired For Speech. However, it does differ in that the slower speech rate and higher volume would ensure clear understanding of the instructions given, which is crucial in a chaotic intersection.

In the future, the integration of a voice interface that connects drivers to pedestrians and provides the user with a comfortable experience would increase the safety and efficiency of the intersection at Tech Square.

1 2