source("./device-setup.R")
library(tidyverse)

1 Data

The data may be downloaded from the project’s page: http://ns.inria.fr/loki/WordSuggestions/.

1.1 Runs

This table contains one row for each participant. The different columns are as described below:

This table contains three rows per participant providing data about each three parts of the experiment. Each part, called a “run” is dedicated to the use of one device in particular. The different columns are as described below:

participant

Participant Identifier.

accuracy

The word suggestion accuracy condition.

device

The device used for the run.

device_order

The order of the device in the experiment.

run_start_date

The start date and time of the run in ISO 8601 format.

run_end_date

The end date and time of the run in ISO 8601 format (only provided for the desktop condition).

run_duration

The duration of the run in seconds.

expe_start_date

The start date and time of the experiment in ISO 8601 format.

expe_end_date

The end date and time of the experiment in ISO 8601 format.

expe_duration

The duration of the experiment in seconds.

age

The self-reported age of the participant.

gender

The self-reported gender of the participant.

swipe_typing_use

Answer for “In the last 7 days, in total, how long have you swipe typed?” This was provided once per experiment.

suggestions_use_frequency

Answer for “In the last 24 hours, how many word suggestions have you used when typing on [device]?” This was asked for laptop, tablet and phone at the beginning of the experiment.

device_use

Answer for “In the last 7 days, in total, how long have you used [device]?” This was asked for laptop, tablet and phone at the beginning of the experiment.

typing_use

Answer for “In the last 7 days, in total, how long have you typed on [device] (using any typing method)?” This was asked for laptop, tablet and phone at the beginning of the experiment.

typing_use_one_hand

Answer for “In the last 7 days, in total, how long have you typed on a phone with only the thumb of one hand?” This is only available for the phone condition.

controls_satisfactory

Agreement for “The controls (keyboard and word suggestions) are satisfactory for the completion of the task.” This was provided for each condition.

suggestion_accuracy

Agreement for “The word suggestions are accurate.” This was provided for each condition.

keyboard_use_efficiency

Agreement for “The use of the keyboard is efficient in this task.” This was provided for each condition.

suggestion_distraction

Agreement for “The word suggestions are distracting.” This was provided for each condition.

mental_demand

NASA-TLX mental demand scale. This was provided for each condition.

physical_demand

NASA-TLX physical demand scale. This was provided for each condition.

temporal_demand

NASA-TLX temporal demand scale. This was provided for each condition.

performance

NASA-TLX performance scale. This was provided for each condition.

effort

NASA-TLX performance scale. This was provided for each condition.

frustration

NASA-TLX effort scale. This was provided for each condition.

avg_cps

The average typing speed of the participant (without suggestions) in character per seconds.

avg_wpm

The average typing speed of the participant (without suggestions) in words per minutes (avg_cps * 60 / 5).

min_suggestions_delay

The minimum delay before updating suggestions. This is always 150ms.

total_suggestions

The number of suggestions showed to the participant. This is always 3.

total_blurred_trials

Register the number of trials during which the page was unfocused, which is an indication that the participant may be distracted.

total_long_delay_trials

The number of trials during which at least one suggestion took more than 300ms before being udpated.

is_experiment_completed

TRUE if the experiment was completed.

is_run_measured

Some runs were not included in our analysis. The column run_rejection_reason provides the reason for the exclusion.

run_rejection_reason

The reason for excluding the run from the analysis. This is only provided for runs that were excluded.

read_device_runs(measured_only = FALSE)

1.2 Trials

This table contains data logs from all participant trials.

participant

Participant Identifier.

device

The device used for the run.

accuracy

The word suggestion accuracy condition.

trial_number

The number of the trial.

trial_id

The identifier of the trial.

phrase

The phrase participant had to type during the trial.

is_practice

If the trial was part of a practice block.

total_chars

The number of character in the phrase to type.

theoretical_sks

The maximum number of keystrokes that could be saved using word suggestions during the trial.

start_date

The start date and time of the trial in ISO 8601 format.

end_date

The end date and time of the trial in ISO 8601 format.

duration

The duration of the trial in seconds.

total_key_strokes

The number of key that were pressed to complete the trial.

actual_sks

The number of keystrokes that were saved using word suggestions. This may be negative if incorrect suggestions were used.

total_suggestion_used

The number of suggestions that were used during the trial.

total_suggestion_errors

The number of incorrect suggestions that were used during the trial.

total_removed_manual_chars

The number of characters that were manually entered then deleted by participant during the trial.

total_removed_suggestion_chars

The number of characters that were inserted from a suggestion then deleted by participant during the trial.

total_final_manual_chars

The number of characters from the final participant input that were manually inserted.

total_added_chars

The number of characters that were inserted during the trial either from suggestion or manual input (some may have been removed later).

does_trial_have_errors

TRUE if incorrect characters were inserted at some point during the trial (they had to be removed later to validate the trial).

phrase_completion_start

The date and time where the first character was inserted during the trial (ISO 8601 format).

phrase_completion_end

The date and time where the phrase was fully completed and correct for the first time (ISO 8601 format). Note that this is before the trial was validated.

phrase_completion_duration

The time it took for the participant to complete the phrase in seconds.

avg_suggestion_delay

The average time it took for suggestions to be updated during the trial.

sd_suggestion_delay

The standard deviation of the time it took for suggestions to be updated during the trial.

total_blur_events

The number of times the experiment web page was unfocused during the trial.

was_trial_blurred

TRUE if total_blur_events > 0

theoretical_key_saving

The maximum key stroke saving of the trial. This should be close from accuracy.

actual_key_saving_no_editing

The key stroke saving excluding errors.

actual_key_saving

The key stroke saving.

cps

Entry speed in characters per second (i.e. (total_chars - 1) / trial_duration).

has_long_suggestion_delay

If at least one suggestion set took more than 300ms to update.

is_run_measured

If the run including this trial was included in our statistical analysis.

is_trial_measured

If this trial was included in our statistical analysis. See below for reasons to exclude a trials from our analysis.

trial_rejection_reason

The reason for excluding the trial from the analysis. This is only provided for trials that were excluded.

trials <- read_device_trials(measured_only = FALSE)
trials

1.3 Events

This large table contains all event logs recorded during our experiment. It includes the following columns:

participant

The participant identifier.

accuracy

The accuracy condition.

device

The device condition.

trial_id

The trial identifier.

event_number

The number of the event in the trial.

type

The type of the event. INIT: trial initialization, UPDATE_SUGGESTIONS: update of the suggestions, INPUT_CHAR: manual insertion of a character by the participant at the end of their input, DELETE_CHAR: removal of the last input character by the participant, INPUT_SUGGESTION: insertion of a suggestion.

input

The content of participant’s input after the event.

is_input_correct

If there is no errors in the input after the event.

added_input

The characters added to the input as a result of the event.

removed_input

The characters removed from the input as a result of the event.

total_added_chars

The number of characters added as result of the event (may be negative).

remaining_key_strokes

The number of character left to complete the phrase.

is_target_completed

If the phrase is completed.

request_time

For events resulting from a web request (in particular updates of word suggestions), the moment the request was sent to the server (ISO 8601 format).

response_time

For events resulting from a web request (in particular updates of word suggestions), the moment the response from the server was received (ISO 8601 format).

time

The date and time of the event (ISO 8601 format).

diff_time

Duration between this event and the previous one.

is_run_measured

If the run including this event was included in our statistical analysis.

is_trial_measured

If the trial including this event was included in our statistical analysis.

is_event_measured

If this event was included in our statistical analysis.

event_rejection_reason

The reason for excluding the event from the analysis. This is only provided for events that were excluded.

target_word

The next word to type or the word currently being typed.

target_word_number

The number of the target word in the phrase.

2 Monitoring

2.1 Total trials

trials %>% filter(is_run_measured & is_practice == F) %>% count(is_trial_measured)

2.2 Measured Trials with errors

trials %>% filter(is_trial_measured) %>%
  count(does_trial_have_errors)

2.3 Blurred Trials

The following trials were removed because the participant was interrupted (the page was unfocused).

trials %>% filter(is_run_measured & is_practice == F) %>%
  count(was_trial_blurred)

2.4 Trials with long suggestion update times

trials %>% filter(is_run_measured & is_practice == F) %>%
  count(has_long_suggestion_delay)
