?
And how we’re going to justify each claim.
Use PCA to do a point-wise comparison of paths, based on their forward-velocity profiles. This is one way of showing that real/cowl/vrwalk are alike and joystick/wip are much different.
Should we do all 6-choose-2 combinations, even if we only choose to look at a small subset of this data? Or would that be cumbersome? If i had to pick a sub-set, I’d do this:
These are mostly for Jeff.
These are mostly for someone besides Jeff
Ugh. The post-question data is uglier than i anticipated. many of the form elements were filled out incorrectly or not at all.
The relative-ranking questions (1–8) in part II may not be very useful.
It’s the part III responses (9,11,13) that i think will be most helpful. This is where people rated the 3 VR interfaces on an absolute scale, from 1 to 7. These are in columns AD-AF, AG-AI, and AJ-AL of the spreadsheet. We’ll want to look for a correlation between how people rated the 3 VR conditions and how they actually performed (i.e. exposure, stopping-point consistency, and all the other computed measures).
Since they’re rating these on an absolute scale (and the range of values that people choose will vary from person to person) some sort of normalization will be needed in order to put all the responses on comparable scales. I don’t know how survey designers deal with that problem, but i can try to take a stab at it:
For each question (general performance; move to where you wanna go; stop where you wanna stop) there are 3 items (VRWalk, Joystick, WIP). So we have a set of 3 values on a scale of 1 to 7. We can take the mean of the 3 values and subtract that out, which will give a common origin to the scales.
Then there remains some amount of variation in the magnitude of the responses (for instance, some people used the full spectrum, 1 to 7, and other people used just a small portion of it (2,3,4)). It seems to me that we don’t want to eliminate this kind of variation, since it gives some indication of whether people felt that there was a huge difference or a minor difference between the conditions.
Does this sound right? The only thing we’d do is re-center the 3 scales around the mean for each subject for each question?
So, for example, if the responses to “move where you wished to go” were {1,5,6} for {vrwalk,joy,wip} for a particular subject, we’d subtract out the mean (4), and get {-3,+1,+2}.
For the pre-questionnaire, the gender data is straight-forward. The video-game hours data is slightly less so. We ask the following:
So there’s the counfounding factors of gaming experience and amount of joystick usage within the gaming experience. Are we interested in separating the gamers from the rest or separating the joystick-gamers from the rest? This will make a big difference. Some people played 40+ hours but have 0% joystick; others play less than 5 hours but have 100% joystick.
It seems like a reasonable way to estimate joystick experience specifically would be to multiply the number of hours by the joystick percentage. We may want to do our correllations with two different measures: one of game-hours times joystick%, another of just game-hours alone.
Also, I didn’t realize that we were asking about MAX video game hours per week. This data is somewhat useful, but I think next time we should ask for average hours instead. What we’re really after is the total amount of video game experience, of which the average is a better indicator than the max.
Chris,
Mary and I are working on generating as much information as we can, but it would be helpful if we had your suggestions on exactly which bits of information will be most relevent to the paper. It’s also helpful to know which bits of information are NOT worth looking in to. For instance, it’s probably not helpful to look for a correlation between the gaming-hours-with-joystick measure and the exposure values for VRWalk (though, of course, we do want to compare gaming-hours-with-joystick to the exposure values for Joystick).
So while you’re writing the outline, as you think of questions that you’d like to “ask” of the dataset, please pass them along to us or make a list at the bottom of the wiki page.
https://wwwx.cs.unc.edu/~feasel/wiki/VR06Paper
Doing this will really help to focus the analysis.
—Jeff
—I’ve got the macros and the input and output all working properly.
—I need to visit Chris Wiesen one more time on Monday to verify that I have written
the contrast and estimate statements correctly for the V4 data.
## WHAT I NEED TO KNOW:
##* A. which dependent variables (dv) are we going to do analysis on?
##* B. what pairwise comparisons of conditions do we want to do?
## WHAT I’LL NEED FROM JEFF/MATLAB:
##* per path data for the DVs we want to analyze including (as we worked on Friday)
the trial number and the barrier number for each data record.
WHEN we have A. above, there will need to be lines added to the code to call the main macro once for each dv and line that will add the data for those calls to the macro to the final output data files.
This will likely take about an hour, then we can look at the data and try to figure out whether we can collapse trials/barriers to simplify our analysis.
It would be great to do this Monday afternoon.
I need to get Chris W. to write me a macro that will subset the input data into data to input to the icc. (There is such a program for V3 (icc_V3Setup) in the V3_2005/Program folder)
The .doc file shows the input data file format: three columns
subject
In the data in the .doc file, they are peakdecel_acc for conditions 2 and 4
For V3, the ICC runs on data for one corner or wall and one dv. Since we haven’t shown yet that we can combine data from more than one wall/corner, that’s the way it has to be so far.
So, for V4, it will be one trial/barrier until we show we can combine subsets based on the outcome of the ANOVAs. With the level of granularity of the current program, there will be a zillion of them
(24 subjects x 3 trials x 4 barriers) = 288, then X N for the number of pairs we want to compare.
I NEED TO KNOW:
—what things do we want to run correlations on?
I NEED: file of data (excel or cvs) from the forms for any items we want to run comparisons on.
The icc’s will explode into an unmanageable task if we don’t think carefully about what we want to test before we start.