4. FACTOR ANALYSIS

4.1 Definition

Factor analysis is a statistical procedure used to uncover relationships among many variables. This allows numerous intercorrelated variables to be condensed into fewer dimensions, called factors. In the context of this research, the variables are the degree of agreement with various specific attitude statements (from parts B and C of the survey), and the factors are the general underlying attitudes. The new factors are used as explanatory variables during choice modeling (see section 5). The factor analysis for this research was conducted using the statistical package SPSS/PC+, version 4.0.

The common factor analysis procedure is briefly described below. For a thorough description of factor analysis see Rummel (1970). In the procedure, each variable is assumed to be a linear combination of some number of common factors and a unique factor:

Zj = [SIGMA]k(ajkSk) + ajuSju

where
Z - standardized variable,
a - factor loading,
S - common factor or factor score,
j - index for variables,
k - index for factors, and
u - denotes the unique portion.
Assuming that the factor scores are standardized by factor and that the unique portion of a variable is not correlated with the common portion, vector algebra can be used to solve iteratively for the factor loadings in the following equation:

Rmxm - Imxm + H^2mxm = Fmxp F'pxm

where
Rmxm - correlation matrix,
Imxm - identity matrix,
H^2mxm - diagonal matrix of communalities,
Fmxp - factor loadings (a) matrix,
F'pxm - transpose of F,
m - number of variables, and
p - number of factors.
Since Zj is a standardized variable, its variance is equal to one. The variance of Zj can be split into two parts, the communality, h^2j, and the uniqueness, u^2j, which are calculated as follows:

In matrix form, this can be represented as Imxm = H^2mxm + U^2mxm, where I is the identity matrix and H^2 and U^2 are diagonal matrices of the communalities and uniquenesses, respectively. This equation allows one to find the unique portion for each variable. Using all this information, the factor scores, which are the underlying dimensions, can be computed (Rummel 1970). The factor scores, thus, have the common information contained in each separate variable and can be used in place of the numerous original variables.

4.2 Methods

There are a number of practical issues involved in conducting factor analysis. The question of whether factor structures should be theory-based or data-based depends on whether the analysis is confirmatory or exploratory. Small sample sizes may negatively affect the outcome of the factor analysis procedure. Before performing the factor analysis, one must decide how to handle missing data and the "not applicable" responses. While performing the factor analysis, there are three decisions to be made: the method of factor extraction, the type of factor rotation, and the number of factors to be used. For each of the options, trial factor analyses were run using different methods. The methods used for the final solution were chosen primarily on the interpretability of the resulting factors. Each of these decisions is discussed further below.

A fundamental problem concerns the formation of the factor structure. While the variables to be factor analyzed must be correlated in some way, researchers are divided over how the variables should be related to each other. Mulaik (1993) suggests that the factor structures based on theory should be set up prior to conducting a factor analysis which would prove or disprove that theory. On the other hand, Velicer and Jackson (1990) state that both confirmatory and exploratory analysis, the latter occurs when factor structures are based on data, have important roles to fill. In the factor analyses presented in this study, survey questions were designed to incorporate the drives and constraints from telecommuting decision theory (section 2.3). However, the nature of the research itself is exploratory, so the data ultimately suggests the factor structure.

Researchers have also given guidelines for the minimum sample size needed to conduct factor analysis. Some have suggested the ratio of sample size to number of variables as a criteria: the recommendations range from 2:1 through 20:1. Others have suggested a minimum sample size of 100 to 200 observations (Guadagnoli and Velicer 1988). Guadagnoli and Velicer (1988) found that absolute sample size was more important than functions of sample size in determining stable solutions. Small sample sizes may affect the factor analysis by making the solution unstable: the addition of more data may cause the variables to switch from one factor to another (Guadagnoli and Velicer 1988). For the set of questions from part B, the ratio of sample size to the number of variables is 8.8:1 for the solution shown in section 4.3. However, the sample size of 97 lies just below the suggested minimum sample size which indicates that the small sample size may be a problem. For the part C questions, the ratio is 9.7:1 and the sample size is 291 due to the rearrangement of the data (see section 4.4), so sample size effects are not likely to be present.

Although there is little missing data (0.5% and 0.8% of the responses are missing for the job satisfaction and work attitudes questions, respectively), the handling of the missing information may affect the factor analysis procedure. Most of the missing data in parts B and C was caused by respondents incorrectly interpreting the instructions and failing to answer each question for all three workplaces. SPSS/PC+ provides two options for handling missing data: mean substitution and pairwise deletion. For mean substitution, the missing value is replaced by the average value for the sample. In pairwise deletion, the correlation coefficients are calculated using only the valid data, thereby retaining much of the information that would have been lost by deleting each case that had missing data. Although a few of the solutions calculated using pairwise deletion looked promising, the computer package did not allow the saving of factor scores when using this procedure. As a result, mean substitution was used to replace the missing data.

In each group of questions, there are two questions that provide a "not applicable" response. For the job satisfaction group, the questions refer to clients (B3F) and supervising of others (B3M), and for the work attitudes, the questions ask about dependent care (C21) and household activities (C23). Originally, responses are coded 1 through 5 from "strongly disagree" to "strongly agree" with the not applicable response coded as a 6. If the value of 6 were retained, those selecting not applicable would be seen as agreeing with the statement more strongly than those marking "strongly agree". The three possible methods to deal with this problem are to recode the not applicable response as neutral (which has the value of 3), to substitute with the mean, or to exclude the questions from the factor analysis. Recoding as neutral has some theoretical validity because if the question does not apply to the respondent, s/he would likely have neither a positive nor a negative opinion about it. However, this will reduce the variation in responses to these questions since more responses would be grouped near the middle of the scale. As a result, the significance of the strong opinions of those who did express an attitude may be reduced. Using means substitution would treat the not applicable responses as missing data. Unfortunately, this approach also reduces the variation in response and has the additional effect of supplying the "not applicable" respondents with an non-neutral opinion. Removing the not applicable questions from the factor analysis altogether may lead to less interpretable factor structures and/or diminish the completeness of the common factor space. For the job satisfaction questions, excluding the not applicable questions provided the most interpretable factor solution. However, for the work attitudes questions, the factor solutions with the not applicable response recoded as neutral provided more interpretable factors (and appeared logical), so the recoding method was chosen instead.

Two common methods of factor extraction are principal component analysis and principal axis factoring. In the first method, the communality (h^2) is assumed to equal one, and, therefore, the uniqueness is equal to zero (see above). This is simply a linear transformation of the variables that assumes the factors will explain all of the variance in each variable. The second method, on the other hand, relaxes this assumption and allows the unique portion to be non-zero. As a result, the factor loadings are higher which leads to greater interpretability since the unique portion does not have to be taken into account by the factors. However, this method also is affected by factor indeterminacy, which may cause substantially different factor interpretations to be obtained from the same original data. This weakness is particularly a problem in exploratory analysis where the factor structures are determined by the data set (Velicer and Jackson 1990). In this study, the solutions found through principal axis factoring are similar in form to those found by principal component analysis, thus factor indeterminacy is not likely to be important. Since principal axis factoring generally provides more interpretable solutions, it was chosen as the extraction method.

The two methods of factor rotation investigated in this study are orthogonal and oblique rotation. Rotation is used to reorient the factor loadings so that the factors are more interpretable. Orthogonal rotation assumes that the factors are at right angles to each other; in other words, the factors are not correlated. For example, if the factor loadings from a set of variables are plotted on a two dimensional set of axes, the variables that load on one factor would lie along one axis, and the variables that load on the other factor would lie along the other. The VARIMAX rotation option in SPSS/PC+, which tries to minimize the number of variables that load highly on a factor, uses the orthogonal assumption. Oblique rotation relaxes the assumption that the factors must be orthogonal. In this method, one set of variables may lie along an axis, while the other set may lie along a 45 degree angle to the axes. Allowing for correlations between the factors often simplifies the factor solution since many attitudinal dimensions are, in fact, likely to be correlated. SPSS/PC+'s OBLIMIN procedure uses oblique rotation (Norusis 1990b). (Note: Unlike orthogonal rotation, the pattern matrix and the structure matrix are not equal after oblique rotation. However, only the pattern matrix need be examined since it allows for the easiest interpretation of factors (Rummel 1970)). The pattern matrices found using oblique rotation are more interpretable than the orthogonal rotation solutions, with fewer variables loading significantly on more than one factor. Thus, the oblique rotation method is used in the final factor analyses.

The final decision to be made when conducting factor analysis is to determine the number of factors. One rule of thumb is to use an eigenvalue of one as the cut-off value. That is, all factors in a particular solution must have eigenvalues greater than one. Also, one can look at the scree on a plot of eigenvalues against the number of factors. The point at which the eigenvalues begin to level off can also be used as a cut-off point (Velicer and Jackson 1990). However, perhaps the best method in an exploratory approach is to use the eigenvalue and scree cut-off points as general guides to the dimensionality of the factor space and let the interpretability of the factors indicate the exact number of factors to retain. For example, if a solution has a factor with an eigenvalue close to one but one or more factors do not have an obvious meaning, then the solution may be reasonably discarded. In the factor analyses conducted in this research, the eigenvalue cut-off of one was used in choosing four factors for the job satisfaction variables. But, for the work characteristics variables, a solution in which one of the six factors had an eigenvalue slightly greater than one was excluded since it had a uninterpretable factor (only one variable had a significant factor loading). Instead, the five factor solution was chosen.

4.3 Part B Factors: Job Satisfaction

Job satisfaction can be defined as "a pleasurable or positive emotional state resulting from the appraisal of one's job or job experiences" (Locke 1976). Since the work environment often influences a worker's emotional state, job satisfaction is a likely determinant of the preference to telecommute. The set of questions focusing on job satisfaction are located in part B, question 3 of the survey (see Appendix A). In all, there are fourteen questions dealing with the respondent's job. Of the nine job satisfaction components described in Locke (1976), five are used as the basis of questions in this section: work, recognition, promotion, supervision, and co-workers. Of the other four components, pay and benefits are assumed to be the same for all work locations, and working conditions and management are taken into account in the part C questions.

The factor analysis procedure produced four factors that explain 51.0% of the variation in the job satisfaction questions. Table 4-1 shows for each variable the factor loadings that are greater than 0.25. The negative loadings are caused by questions that are negatively oriented to the factor, thus ineffective supervisor communication (NSUPCOMM) has a negative loading on the supervisor factor. Both positively and negatively oriented questions were used in the survey to minimize an automatic response bias by the respondents. The correlation between the supervisor and frustration factor is 0.45 and between the satisfaction and frustration factor is 0.34. All other factor correlations are less than 0.3. The presence of frustration factor in the high correlations is not surprising since this factor has the lowest loadings making it less well-defined compared to the other factors.

Table 4-1. Part B Factor Analysis Pattern Matrix
 
Supervisor Satisfaction Interaction Frustration
B3E: WORKWSUP
B3A: NSUPCOMM
B3I: NSUPAPP
B3D: WORKTEAM
B3B: PROMOTE
1.00
-0.67
-0.41
0.38
0.38


-0.29
 
 
B3G: SATISFY
B3H: ACCOMPL
 
0.96
0.88
 
 
B3J: COWRKRS
B3K: CONFIDEN
 
 
-0.85
0.50
 
B3L: NEWJOB
B3N: TEDIOUS
B3C: RESOURCE
 
-0.26
 
0.56
0.46
0.43

The first factor deals with the respondent's interaction with the supervisor. The self-rating of how well the respondent works with the supervisor has the highest loading. Next comes poor supervisor communications which is loaded negatively. This means that good communication (a negative attribute rating) would contribute to a positive score on this factor indicating a good relationship with the supervisor. The other important variables (not being appreciated by the supervisor, having an effective work team, and having the opportunity for promotion) also involve supervisor relations.

The second factor can be interpreted as the satisfaction with the work performed by the respondent. Here, the highest-loading variables are the overall satisfaction with work and having a sense of accomplishment. Lack of supervisor appreciation and likelihood to look for a new position have much lower loadings. An employee's satisfaction is likely to be related to the attention given by the supervisor. Finally, satisfaction with the current job is likely to be negatively associated with the desire to search for a new one.

Workplace interaction effects load on the third factor. The negative sign on the loading for not getting along with co-workers means that getting along well with co- workers (i.e., a strongly negative attribute rating) will contribute to a high positive score on this factor. The other variable, confidence in the ability to do the job, may also reflect interaction effects. Those who are confident in themselves often work well with others (for example, they may be less apt to blame others for their own faults, which would lead to interpersonal conflicts).

The final factor has been interpreted as the frustration the respondent feels toward the job. The most important loading is for the variable on the likelihood of looking for a new job, which can be associated with job frustration. Also loading on this factor, the belief that the job is tedious and boring reflects feelings of disappointment with the position. Finally, the lack of resources to complete assigned tasks can also be associated with frustration on the part of the respondent.

Just as the means were calculated for each separate question in part B (see section 3.4), the mean factor scores can also be computed (see Table 4-2). The mean scores on each factor across the sample will be zero because factor scores are standardized during the factor analysis process. Thus, the values presented in Table 4-2 can be interpreted as the number of standard deviations (where the standard deviation is calculated on the entire sample) from the overall sample mean. Overall, home-based telecommuters rate their supervisors more highly than the other groups. Prospective telecenter users are the most satisfied with their position, and home-based telecommuters are the next most satisfied. Social and professional interaction effects are highest for the non-telecommuters and lowest for the center users-to-be. Finally, non-telecommuters are more frustrated with their job than the other two groups. An analysis of variance was conducted on the means for each factor; however, no significant differences between study groups were found using a level of confidence of 0.1. The small sample size of the control groups may have affected the outcome of the analysis of variance.

Table 4-2. Mean and Standard Deviation for Part B Factor Scores
Factor Study Group
Center Home Non All
Supervisor -0.03 (0.98) 0.18 (0.82) -0.04 (0.99) 0.00 (0.95)
Satisfaction 0.11 (0.98) -0.13 (1.15) -0.24 (0.80) 0.00 (0.98)
Interaction -0.11 (0.87) 0.13 (1.00) 0.23 (0.95) 0.00 (0.91)
Frustration -0.06 (0.85) -0.04 (0.64) 0.22 (0.87) 0.00 (0.82)

4.4 Part C Factors: Work Attitudes

For part C, each of the thirty questions about a work characteristic was rated according to three work locations: the regular workplace, the telecommuting center, and home (see Appendix A). If these responses were factor-analyzed as 90 separate variables for each respondent, it is likely that the same question would load on different factors for each of the three workplaces, which would make interpretation difficult. To prevent this, the 90 responses for each individual were treated as three sets of 30 responses on parallel questions. It is as if the sample size were three times as large, with each respondent only answering the questions for one of the three workplaces. This procedure assumes that it is reasonable to treat the factor structure as identical across the three workplaces (i.e., respondents view all three workplaces in terms of the same set of dimensions). The responses for all three workplaces were thus combined in the factor analysis so that the variables composing each factor represented all work locations (see Table 4-3). After the factor analysis was complete, the factor scores were separated so that each respondent has a set of five factor scores for each of the three workplaces. There are higher correlations between factors for the work attitudes than for the job satisfaction questions. The correlation between the work ethic and supervisor concerns factors is 0.56 and between the personal benefits and family factors is 0.50. Other correlations are all below 0.3.

The first factor includes many of the direct benefits to the employee at the workplace. The most important variable here is the respondent's desire to dress how s/he likes. The next two variables with negative loadings indicate the benefits of a workplace free from distractions and with a hassle-free commute. Other important benefits to the respondent are protecting the environment and having control over the condition of the workplace. Also, cost savings and stress at the workplace are associated with this factor. The remaining significant loading variables, the workplace would have a professional appearance and opportunity for social interaction (both loading negatively), each load more strongly on other factors, and their association with the personal benefits factor, although of the expected sign, is weak.

Table 4-3. Part C Factor Analysis Pattern Matrix
 
Personal Benefits Work Ethic Responsibility Family Supervisor Concerns
C27: DRESS
C5: DISTRACT
C18: COMMHSSL
C12: ENVIRON
C16: CONTROL
C24: SAVMONEY
C17: COST2MCH
0.63
-0.58
-0.56
0.56
0.55
0.40
-0.37

-0.38




0.32


-0.41
0.38

0.39
-0.32
 
C4: PROFAPPR
C30: EFFECTIV
C1: MOTIVATE
C15: HMNWKSEP
C14: NOEQPMNT
C7: INDULGE
C2: STRSSFUL
C23: HHCNFLCT
-0.30




-0.43
0.65
0.65
0.63
-0.63
-0.53
-0.49
-0.49
-0.41
 
 
 
C22: RESULTS
C9: INDEPEND
C20: SCHEDULE
 
 
0.57
0.52
0.48


0.31
 
C21: DEPNDCAR
C6: NOFREETM
C28: BALWKNHH
C19: SCKDISBL
C10: ERRANDS
 

0.31




0.26
0.79
-0.49
0.45
0.43
0.28
 
C29: SUPCOMM
C13: NOTVISBL
C25: DISCIPLN
C11: NOSPACE
C26: NOPROFIN
C3: SUPUNCOM
C8: SOCINTER






-0.31






0.32
 
 
0.70
0.63
0.51
0.51
0.50
0.44
-0.36

The second factor can be interpreted as the respondent's work ethic, that is, the desire to work effectively and competently. The loadings on professional appearance, working effectively, and being motivated translate into a strong work ethic. The ability to keep home and work activities separate and minimize household conflicts are also significant to this factor. Both frustration over not having necessary equipment and fear of indulging in unhealthy activities also point to a strong belief in working hard. Not surprisingly, one who wishes to work hard wants to have a less stressful workplace and be free of distractions. For the second lowest loading, lack of free time is associated with a strong work ethic which may mean that those who work hard do not have extra time. And finally, having enough opportunity to interact socially with co-workers is related to this factor; however, those with a strong work ethic are likely to be satisfied in this regard since a choice to concentrate on work activities may also mean a choice to minimize social interaction.

Work responsibility, the third factor, has the fewest number of significantly loading variables. Responsibility is shown here by the desire to have work judged by the results rather than the presence of the employee at the work site. In addition, respondents include the need for independence in daily activities and the freedom to schedule work tasks as variables that describe this factor. Having control over the work environment also expresses responsibility for work activities. The final and weakest-loading variable, the ability to run errands, does relate to responsibility; however, here the issue is most likely taking care of household, rather than work, responsibilities.

The family drive is present in the fourth factor. The ability to handle dependent care has the highest loading. Having free time, the next highest loading, is an important component for family life. Also, balancing work and home activities and working while sick or disabled are important to the well-being of the household. The ability to run errands on the way to and from work can be important to the family: for example, the errand may be picking up a child from school. Other loadings include the desire to have an easy commute, which can prevent the transfer of work stress to the home, and benefitting the environment, so that family members are not adversely affected by the respondent's actions. Financial concerns, which also have lower loadings, can be especially important in household interactions.

The final factor, which involves concerns the respondent may have about the supervisor and vice versa, is similar to the job satisfaction factor about the supervisor but oriented in the opposite (negative) direction. The strongest-loading variable is difficulty with supervisor-employee communication. Next in importance is the concern of not being visible to the management, which can affect promotion opportunities. The need for self- discipline is likely included here because if the employee does not have self-discipline, then the supervisor is needed to give it. This factor also incorporates the need for professional and social interaction at work, both of which often involve the supervisor, and may reflect, at least in part, the supervisor's ability to provide a supportive work environment. Not surprisingly, supervisor discomfort with the work location is also included in this factor. Although the variable accounting for the need for enough space to work may appear to be out of place in this factor, its loading may reflect a concern on the part of the supervisor, or the perception of the respondent that there is such a concern.

Table 4-4. Mean and Standard Deviation for Part C Factor Scores
Factor Study Group Regular Workplace Telecommuting Center Home
Personal Benefits All -0.98 (0.64) 0.30 (0.62) 0.68 (0.54)
Center -0.98 (0.66) 0.58 (0.47) 0.55 (0.53)
Home -1.27 (0.59) -0.50 (0.46) 0.96 (0.35)
Non -0.75 (0.55) 0.05 (0.49) 0.87 (0.57)
Work Ethic All 0.19 (0.60) 0.41 (0.59) -0.60 (1.17)
Center 0.16 (0.66) 0.58 (0.53) -0.90 (1.15)
Home 0.07 (0.59) 0.34 (0.41) 0.39 (0.66)
Non 0.34 (0.41) 0.18 (0.49) -0.43 (1.11)
Work Responsibility All -0.39 (0.79) 0.04 (0.70) 0.36 (0.82)
Center -0.29 (0.80) 0.16 (0.74) 0.23 (0.88)
Home -0.34 (0.62) -0.30 (0.55) 0.67 (0.59)
Non -0.72 (0.79) -0.09 (0.60) 0.50 (0.73)
Family All -0.81 (0.68) 0.14 (0.72) 0.67 (0.60)
Center -0.89 (0.73) 0.46 (0.61) 0.66 (0.62)
Home -1.00 (0.58) -0.56 (0.60) 0.85 (0.69)
Non -0.44 (0.43) -0.27 (0.48) 0.57 (0.47)
Supervisor Concerns All -0.59 (0.67) 0.01 (0.66) 0.58 (0.96)
Center -0.53 (0.72) -0.05 (0.65) 0.73 (0.88)
Home -0.66 (0.54) 0.01 (0.72) -0.10 (0.85)
Non -0.72 (0.61) 0.19 (0.63) 0.62 (1.07)

Since the work attitudes factors were calculated from variables for all three workplaces, each work attitudes factor will have, in general, a non-zero mean for each workplace even though the overall mean for all workplaces is zero (see Table 4-4). Based on the attitudes of all respondents, the telecommuting center is seen as a midway point between the regular workplace and home for most of the factors. The personal benefits of the work arrangement are strongest from home, but still important from the telecenter. The center is seen as an effective workplace for completing work tasks, while home is perceived to be the least effective work environment. On average, working from home requires the most work responsibility, working from the office requires the least, and working from the telecenter requires only average responsibility. Although family responsibilities are best handled by working at home, working at the center also provides opportunities for handling family care. Similarly to the third factor, supervisor concerns are high from home, average from the center, and low for the regular workplace. As a result, telecommuting from a center seems to mitigate some of the impacts, both positive (personal benefits, family care) and negative (supervisor concerns), of telecommuting from home.

The factor means for each study group at each workplace are also reported in Table 4-4. Although the overall mean for personal benefits was highest at home, the prospective telecommuting center users rated the personal benefits at the center slightly higher than for home. For the work ethic factor, each study group rated its workplace location as the best place to get work done. The group means for the other three factors mirror the rankings of the means for all groups combined. Unlike the job satisfaction factors, the work attitudes factors have significantly different means among groups and workplaces as shown by the p-values from a two-way analysis of variance (see Table 4-5). All workplace main and interaction effects are significant (shown in boldface type) for the five factors at a level of confidence of 0.1. Only two factors are not significant for the study groups: work ethic and work responsibility. For these factors, the means are not significantly different for the center-based, home-based, and non-telecommuters.

Table 4-5. Part C Factors ANOVA Results (P-values)
Factor Study Group Effects Workplace Effects Interaction Effects
Personal Benefits .002 .000 .000
Work Ethic .217 .000 .000
Work Responsibility .469 .000 .008
Family .008 .000 .000
Supervisor Concerns .058 .000 .026



Previous
Next
Table of Contents

Return to the Abstracts of Non-Refereed Publications page.
Return to the Main page.