A multi-objective evolutionary algorithm-based soft computing model for educational data mining A distance learning experience

Purpose – The purpose of this paper is to propose a soft computing model based on multi-objective evolutionary algorithm (MOEA), namely, modified micro genetic algorithm (MmGA) coupled with a decision tree (DT)-based classifier, in classifying and optimising the students’ online interaction activities as classifier of student achievement. Subsequently, the results are transformed into useful information that may help educator in designing better learning instructions geared towards higher student achievement. Design/methodology/approach – A soft computing model based on MOEA is proposed. It is tested on benchmark data pertaining to student activities and achievement obtained from the University of California at Irvine machine learning repository. Additional, a real-world case study in a distance learning institution, namely, Wawasan Open University in Malaysia has been conducted. The case study involves a total of 46 courses collected over 24 consecutive weeks with students across the entire regions in Malaysia and worldwide. Findings – The proposed model obtains high classification accuracy rates at reduced number of features used. These results are transformed into useful information for the educational institution in our case study in an effort to improve student achievement. Whether benchmark or real-world case study, the proposed model successfully reduced the number features used by at least 48 per cent while achieving higher classification accuracy. Originality/value – A soft computing model based on MOEA, namely, MmGA coupled with a DT-based classifier, in handling educational data is proposed.


Introduction
For decades, educators have been troubled by questions such as "Does study habits correlate to test scores achievements?","Why educational attainment, especially on higher educational completion rate or student retention rate is so difficult to achieve?" (Atkinson and Geiser, 2009;Nandeshwar et al., 2011), and "What is needed to improve the quality of education?"(Langstrand et al., 2015;Willman et al., 2015).We argue that by inspecting the variation in observational data on predictors (independent variables) and outcomes (dependent variables), our understanding of the relationship inter-played among different factors, for instance, student-teacher interactions (Allen et al., 2011) and teacher quality (Kraft, 2015) can lead to intervention instruction design that improves academic outcomes.An educational system is a complex system which comprises the pedagogy, human assets (e.g.students and teachers), supportive tools (e.g.course materials and infrastructure), social and cultural influence, and government policies, among others.At the same time, the advent of technologies, in particular the World Wide Web (as known as the internet) has also given rise to new waves in the educational domain.
In this context, distance learning spurred by the emergent of the internet, has become one of the key components in higher education.It offers adult learners the opportunity to pursue their educational dreams without the barrier of distance.Distance learning practises the delivery of course materials without needing both student and teacher to be in the same physical room.The convenience is usually made possible by computer-mediated learning or two-way interactive videos (Tan, Bong and Natarajan, 2015).As such, pedagogic strategies designed to match the new level of instructions with the level of distance learners are rapidly gaining attention among modern educators.
Additionally, the internet has also opened up new opportunities for open education in the form of open educational resources (OER).OER include books, videos, journals, articles, podcasts, lesson plans, open software, and so on at the discretion of openness and sharing (Smith, 2009).Consequently, the increase in e-learning resources and student databases leads to massive repositories of data.Although there has been considerable research on the use of data mining (DM) techniques in discovering potentially useful information from large sets of data in the fields of healthcare (Wang et al., 2013), manufacturing (Durán et al., 2010), market analysis (Fiol-Roig and Miro-Julia, 2011), and many others; only recently has researchers begun to apply them to issues of educational realm through a discipline of known specifically as the educational data mining (EDM).The plausible revelation comes from the realisation that given any context, additional hidden information may be revealed if one invests in trying.EDM involves the development, research, and application of computerised methods in detecting potential patterns from huge pools of educational data (Romero et al., 2010).
With the aggressive push towards sophisticated technology and boundary-less education, more and more factors interact within the already complex educational systems.Unfortunately, as remarked by the Science journal (Smith, 2009), we are to some extent still lacking of the understanding about the effectiveness of value-added components (e.g.OER) introduced to the educational arena.Many educational research works employ EDM techniques that rely on statistical methods (with exception a few exceptions).However, these traditional ad hoc combinations of data management tools and statistical methods are now far from adequate in analysing the vast pool of data.Therefore, it is time to seek new computing approach that can be applied to the highly non-linear, complex, and large volume of data environment.Moreover, the approach ought to be able to handle incompleteness and shortage of data.Fortunately, soft computing lends the solution.Soft computing is a collection of methodologies capable of handling imprecision and uncertainty with the aim of achieving tractable, robust, yet low-cost solutions.Such characteristics of soft computing models make them invaluably attractive when applying to DM applications.
To give an immediate example, soft computing method, in particular the genetic programming complements existing educational research by providing better insights into how student participation affects achievement (Xing et al., 2015).Furthermore, empirical research using DM technique to classify early dropout from selective Mexican high schools have been carried out with success (Márquez-Vera et al., 2016).Yet another recent application of DM technique (i.e.cluster analysis) in exploring the correlation between the students' online interaction patterns and achievement is reported in Cerezo et al. (2016).While these approaches certainly have their strengths, the generic applications are 107 MOEA-based soft computing model for EDM contested when handling multi-objective optimisation problem (MOPs).In principle, MOPs require simultaneous optimisation of conflicting objectives.As in the classification of student achievement model, one would consider improving the classification accuracy rate of the predictive results using the least numbers of available features (e.g.demographic data, social data, past examination grades, and other school-related data) from a pool of data since it is often true that the latter is more difficult to obtain or even if they do, may not be in complete set.Therefore, a robust model should hold high accuracy rate at the presence of uncertainty or data incompleteness.
This paper presents a soft computing technique, which comprises an evolutionary algorithm (EA), i.e., modified micro genetic algorithm (MmGA) (Tan, Lim and Cheah, 2013), coupled with a decision tree (DT)-based classifier, namely, C4.5 for the classification and optimisation of system.The MmGA works well in MOP context as shown by a series of previous successes (Tan, Lim and Cheah, 2013;Tan et al., 2014;Tan, Lim, Cheah and Tan, 2013;Tan, Hanoun and Lim, 2015;Tan et al., 2017).In order to evaluate the proposed soft computing model, an empirical-based case study is conducted.In particular, we aim to answer the first research question raised at the beginning of the paper.That being said, the case study attempts to uncover possibly appealing activities of students' online interaction against test score achievement in an open distance learning institution in Malaysia.Student performance modelling is noted as the second most popular areas in EDM research (Peña-Ayala, 2014).The implications of the present work can be translated into helpful information that may assist educators in designing instructions suitable for students customisable to their learning behaviours so as to improve achievement.Kremer et al. (2013) share the similar view that technology can be used to tailor learning to student's level of knowledge.
The remaining of the paper is organised as follows: the second section covers the background review of EDM and EA; followed by a short overview of C4.5 classifier.Third section is devoted to the description of our proposed methodology.The proposed model is first verified using a set of benchmark problems.Next, a case study for demonstrating the true efficacy of our model is described in fourth section.Finally, the paper concludes with some promising avenues for future work in fifth section.

Review on EDM and learning analytics (LA)
In recent years, the potential of DM and analytics has transformed field after field.DM is the process of analysing and gleaning useful but hidden information from huge data sets (Mukhopadhyay et al., 2014a, b).DM is popular due to its ability to discover data patterns, classify objects, cluster homogeneous objects, and unveil numerous kinds of new findings (Peña-Ayala, 2014).In the educational domain, a specific form of the DM is known as EDM.EDM emerges as an approach to explore educational massive data in enhancing the educational sector.It leverages the data mined through DM to improve learning, cognition, and assessment (Sachin and Vijay, 2012).
On the other hand, LA refers to the use of learner-data in reporting and analysing models for the purpose of predicting and advising learners (Ferguson, 2012;Hwang et al., 2017).Though LA is commonly identified with EDM, they differ in terms of goals and scopes.Baker and Inventado (2014) contrast between EDM and LA in a recent review: from the technical perspective, EDM deals with the development of methodologies for analysis of learning data, while LA focusses on the interpretation of these data for optimising learning and its environment.EDM also emphasises the modelling of relationships among specific constructs, whereas LA relates the interplay among constructs from a holistic view of the system.Additionally, EDM research works concentrate mostly on the development of automated support for learners.LA, on the other hand, puts more effort in informing and empowering instructors about learners' performance progress.

AAOUJ 12,1
To date, EDM has taken on and extended many other related fields including text mining, machine learning, statistics, and psychometrics.Romero et al. (2013) propose a prediction model of student performance using DM techniques such as clustering and class association rules.The model has been claimed to be more representative of student groups (clusters) compared to previous rule-based model.EDM is also popularly used for student modelling (Lemmerich et al., 2011;Nandeshwar et al., 2011), which aims at characterising student by emotion, achievements, skills, learning preferences, and fulfilling individual's learning requirements through adaptation of teaching experiences.Another area of EDM application is on student assessment and evaluation, which enables student proficiency to be distinguished at a fine-grained level (Lopez et al., 2012).EDM also facilitates student feedback and support (Leong et al., 2012).More generally, EDM can be applied to educational problems with regards to emotion in context, engagement, meta-cognition, and collaboration tasks (Baker, 2014).
From the lens of LA, students' engagement and learning outcomes can be improved with proper intervention by instructors.To be effective, the intervention should be provided at the right time.LA and DM techniques are helpful in this case.A commonly used approach for discovering sequential patterns among events is known as sequential pattern mining.Chen et al. (2017) adopt the frequent sequent mining and lag sequential analysis (LSA) in order to study how learners collaborate in knowledge-building discourse.Similarly, LSA facilitates the exploration of learners' sequential patterns in other learning settings such as online interactions behaviour (Cheng et al., 2017) and problem-solving behaviour (Chiang, 2017;Hu et al., 2017).It is also not uncommon to adopt LA in analysing learners' behavioural patterns as a result of interaction with strategies or technological tools.For instance, Kizilcec et al. (2017) investigate various self-regulated learning strategies in MOOCs environment in hope of uncovering the most effective strategies and how they manifest in online behaviour.Meanwhile, Van Leeuwen et al. (2014) examine how teacher supporting tools in the context of computer-supported collaborative learning affect teacher guidance behaviour.
In general, more and more educators are turning to both EDM and LA for improving the educational outcomes.As shown by Xing et al. (2015), synthesising LA approaches and EDM techniques supplemented by genetic programming produces an effective student performance prediction model.The model has been claimed to possess higher prediction rate and interpretability compared to traditional models.Whether EDM or LA, educators can continue to benefit from the various scientific and systematic analysis methodologies available.

Review on EAs and multi-objective optimisation
Natural evolution provides a promising collection of inspirations for computational algorithms.The group of computing methodologies, which analogises the evolutionary process of biological population in finding optimal solutions to optimisation problems, is known as EA (Golberg, 1989).Generally, EA can be divided into four major classes: genetic algorithm (Holland, 1992), evolutionary programming (Fogel, 1966), evolution strategies (Schwefel, 1993), and genetic programming (Koza, 1992).
Unlike traditional methodologies, EAs are distinguished mainly by the use of a population of search space.Each member of the population corresponds to a potential solution.The quality of the solution is determined by a fitness value associated with each member.During each iteration step (generation), better fitness members receive higher chances of survival or become the parents of the next generation.Offspring which are the new population members are generated using some variation operations, like mutation and/ or crossover.The evolutionary process ends after some termination criteria are met.These synergetic combinations of population-based, fitness-based, and variation-driven search have reported success in many complex optimisation problems (Tan, Lim and Cheah, 2013;Lim et al., 2015aLim et al., , b, 2016;;Tan et al., 2017).Meanwhile, the literature of GA runs 109 MOEA-based soft computing model for EDM a long list of variance diverging from its original, yet maintaining the novelty of GA characteristics.Among the more popular ones are the micro-GA, monogamous GA (Lim et al., 2015a(Lim et al., , 2016)), island model GA, and cellular GA, to name but a few [1].
Many real-world problems are made up of performance measures (objectives) that are often conflicting in interest.They ought to be optimised simultaneously in order to achieve a trade-off.In this light, a special domain of the EA that deals with MOPs is known as the multi-objective evolutionary algorithm (MOEA).In any MOP, it is not surprising that a set of optimal solutions (as opposed to single optimum) is obtained.The optimal solution set usually consists of a number of solutions that are close in fitness according to Pareto dominance concept.As a result, comparing among the different optimal solution sets is a challenging task ( Jiang et al., 2014).Various quantitative performance metrics exists in the literature of MOP for defining the optimality of different solution sets.These included the generational distance (GD) metric (Durillo and Nebro, 2011), generalised spread metric (Zhou et al., 2006), and hypervolume metric (Zitzler and Thiele, 1999).
In the meantime, the MOEA can be broadly classified into aggregation-based, indicator-based, and Pareto-based approaches.The aggregation-based approaches treat MOP as single-objective optimisation problem that can then be solved using conventional EAs after combining all its objective functions into a single weighted scalar value.However, the major shortcoming of this approach is that the scalar function and weights are critical in determining the efficiency of the algorithm.However, finding suitable weights is an optimisation problem in itself.On the other hand, the indicator-based MOEAs typically adopt selection mechanism with specific performance metric (Zitzler and Künzli, 2004;Beume et al., 2007).They have the advantage of being scalable to the number of objectives, usually four or more.However, they are generally more computationally expensive, especially when using hypervolume metric.
Finally, a representative Pareto-based MOEA approach is the MmGA (Coello and Pulido, 2005).MmGA is also an extension of the micro-GA.It has been used with great success in handling various multi-objective benchmark problems (Tan, Lim and Cheah, 2013), job-shop scheduling problems (Tan, Hanoun and Lim, 2015;Tan et al., 2017) as well as classification problems (Pourpanah et al., 2017).Even though the MmGA uses only a small size population relative to the other GA variants, it is able to achieve good convergence rate (see third section for more details).As such, this work employs the MmGA as an optimisation means.The MmGA uses GD as its performance metric.

Review on C4.5 classifier
This section provides a quick overview of the C4.5 classifier, which is commonly used for generating a DT.First and foremost, a DT is a tree-like structure composed of decision rules.These rules regulate the grouping of independent variables into homogeneous zones in recursion (Cho and Kurup, 2011).DT is commonly used in acquiring information for decision making.This is in conjunction with the observation that by constructing a DT, the outcome of a set of input variables can be predicted simply by finding the set of decision rules (Pradhan, 2013).In fact, DT has been ranked as the second most popular classification methods in EDM in a recent survey conducted in Peña-Ayala (2014).

AAOUJ 12,1
Even though there exist a plethora of DT model constructing algorithms, for instance, the chi-square automatic interaction detector DT (Michael and Gordon, 1997) and classification and regression tree (Breiman et al., 1984), this paper focusses on the use of C4.5 classifier (Quinlan, 1986) for reason of simplicity and wide applications.
C4.5 is an extended algorithm to the ID3 (Quinlan, 1986), which is based upon the Hunt's algorithm (Hunt and Kübler, 1984).It addresses many problems that were not accounted for by its predecessor, including continuous and categorical attributes, pruning, and rule derivation.In C4.5 algorithm, a DT is built from a set of training data, S ¼ s 1 , s 2 , ….Each sample s i is made up of n-dimensional vector (x 1, i , x 2, i , …, x p, i ), where x k,i refers to the sample features or attribute values of class s i .When encountering continuous attributes, the algorithm simply divides the attribute values into two partitions as specified by a given threshold.In order to remove any bias of information gain, especially when an attribute has many outcome values, the C4.5 algorithm relies on gain ratio as its selection measure.Starting from the highest information gain attribute, the algorithm recurs to smaller sub-lists.In this way, the root node has the maximum gain ratio.The higher information gain attribute will be chosen for decision making (Quinlan, 1993).

Proposed model
In this work, a soft computing model to classify and optimise students' online behaviours in a distance learning environment is presented.Students' online behaviours as characterised by a set of web data, forms the input to our proposed model.The web data represents the frequency of students' interactions with courses within the distance learning environment.Our aim is to classify students' frequency of access to the learning repository against their examinations achievement at the end of a semester.Followings are elaboration of the proposed model.
Initially, a standard C4.5 classifier (Quinlan, 1993(Quinlan, , 1996) ) is applied.It uses a divide-andconquer approach to growing DTs from a set C of cases.Suppose that C fulfils a stopping criterion of decision making.The tree of C is a leaf associated with the most frequent target class in C, which contains only cases of the similar target class.Meanwhile, the proportion of cases in X of jth class is identified.The uncertainty about the class for a case of X, and its corresponding information gained by a test T with k outputs are computed.
Next, a specific MOEA, namely, the MmGA (Tan, Lim and Cheah, 2013) is deployed.The MmGA performs optimisation on two objective functions, i.e., maximising the classification accuracy rate (α) and minimising the number of features (β) of classification process.Note that α describes the systematic errors and measures the statistical bias in handling predictors and outcomes of C4.5 classifier processes.As articulated earlier (recall section "Review on EAs and Multi-objective Optimisation"), the MmGA is able to achieve good convergence rate as indicated by the GD metric.MmGA's search process terminates when objective functions has reached the maximum round of evaluation or achieved convergence as measured by true Pareto.Details on C4.5 classifier as well as objective functions α and β with relation on MmGA are presented in the Appendix.

Benchmark tests
The proposed model aims to yield a solution set, which fulfils the objective functions f 1 and f 2 such that the classification accuracy rate is maximised, while minimising the number of features during the classification stage.Prior to application on a real-world case study, we first examined the proposed model's performance on a set of benchmark data obtained from the University of California at Irvine (UCI) machine learning repository (University of California, 2017).The benchmark data set comprises students' achievements in mathematics and Portuguese in two Portugal secondary schools.The data attributes include student 111 MOEA-based soft computing model for EDM grades, demographic, social, and school-related features.They were collected by using both school reports and questionnaires as published in Cortez and Silva (2008).
Note only mathematical achievements, which were modelled as binary classification, but five-level classification and regression tasks were adopted in this study.We adhere to the original performance evaluation of Portugal education.That is, students are evaluated in three periods during the school year based on a 20-point grading scale (with values between 0 ¼ lowest score to 20 ¼ perfect score).Hence, the data set is split into three classes according to period grade, i.e., first period grade (G1), second period grade (G2), and final grade (G3).As a result, each newly created data set has originally 30 features, which correspond to variable x in the Equation (A5) for each target class G1, G2, or G3, separately.To begin, the collected grades for each class were binarised prior to classification processing: student grades were re-categorised into two groups, namely, well performed (those above or equal to score 8) and not well performed (those below score 8).
For comparison purposes, the proposed model first uses only a standard C4.5 classifier (note: in the remaining of this paper, we merely refer this model by C4.5 classifier).Subsequently, an enhanced model which incorporates the MmGA coupled with standard C4.5 classifier is deployed.It is coined as the MmGA-based classifier.The MmGA analogises the evolutionary process of biological population in finding optimal solutions for MOP.In this context, by maximising α (Equation ( A3)) and minimising β (Equation (A4)).Each member of the population corresponds to a potential solution, which is created with MmGA extended population formation.We also employ a ten-fold cross-validation method in producing the experimental results.All experiments involving both methods are repeated over 30 runs with randomised seed.

Results and discussion
Figures 1 and 2 depict the performance of the proposed model as compared to the standard C4.5 when simultaneously optimising the objective functions f 1 (x) and f 2 (x).Apart from a lower β achievement, our proposed model reported a higher α relative to the standard C4.5 classifier.For completeness, the mean and standard deviation values obtained for each experiment are tabulated in Table I.Mean values marked in italics indicate best statistical significance results at 95% confidence interval under the pairwise t-test (Hall and Holmes, 2003;Götz et al., 2008) comparison.The obvious yet encouraging results obtained inform us that our proposed model is superb in optimising the given data set using lesser number features but at the same time yielding much higher accuracy rate of classification.We attribute this to the superiority of MmGA in performing multi-objective optimisation.Consider a population of probable solutions (aka members) in our proposed model.Each member is represented as a variable x following Equation (A5) and is further associated with multi-objective-based fitness values, in this case α and β.The quality of the member is determined by its fitness values.Like all EAs, MmGA biases members with better fitness: At each iteration step, better fitness members receive higher chances of survival or become the parents of the next generation under an elitism strategy.Offspring, or new population members, are generated using mutation, crossover, and selection operators.The evolutionary process ends with both objectives converging in MmGA nominal and outlier evolution cycles; yielding p (Equation (A5)) in response to α and β.

MOEA-based soft computing model for EDM
A case study Satisfied with the preliminary results, let us now consider applying the proposed model to a real-world case study encompassing a Malaysian private institution of higher education with more than a decade of history in open distance education.The institution offers tertiary education to working adults via open distance learning mode.The learning environment has been catered for adult learners seeking to purse tertiary qualifications for professional development and self-enrichment in a flexible manner.Furthermore, students and tutors come from regions across Malaysia and worldwide.
Rather unique in its kind, the open distance learning institution provides five face-to-face tutorial classes that are spread over a period of five months to its students every semester.It also offers learning-support services via an open source learning management system (LMS).The LMS is an important platform for collaborative learning involving massive teaching-learning activities among course instructors, tutors, and students.For example, apart from the face-to-face classes, students and tutors continue to interact via video conferencing tools supported by the learning platform.Students are also free to engage in online activities such as downloading course materials, posting discussion in forums, participating in online quizzes, submitting assignments, and many more at any time anywhere convenient to them.On the other hand, instructors and tutors often play the role of system administrator in the online platform by uploading course materials, initiating discussion groups, setting up quizzes, marking assignments, answering posts, and others.It should be noted that, throughout the semester, students are generally assessed using three instruments on three periods: assignment 1 (T1), assignment 2 (T2), and final examination on the second, fourth, and fifth month, respectively.

Experimental setup
Moving on, the proposed MmGA-based soft computing model is depicted in Figure 3. Initially, data extracted from the LMS go through a pre-processing stage.It involves gathering various students' interaction data from courses and converting their frequency into required raw data in a tabular format.Noise from the raw data are removed and transformed into a structured data format, i.e., an Extensible Markup Language file format, so that the C4.5 classifier may perform further processing.The processing stage involves employing MmGA-based soft computing model.Lastly, the output of the processing stage is made available for interpretation.In most complex systems, the interpretation may involve end-users and incorporation of other tacit knowledge to uncover the existence of any possible relationship between the trends of students' online interaction activities with the e-learning platform and their examination performance, for instance.Students' daily online interaction activities for every course are captured in LMS.In this study, a total of 46 courses offered in the said institution are examined.The data are collected throughout the entire semester for 24 consecutive weeks, including two weeks prior to the start of semester and two weeks after the end of semester.This contributed to 24 features, which are further grouped into two targeted classes: well-performed and not well-performed student classes corresponding to examination scores above or equal to 60 marks and below 60 marks, respectively.Note that the number of features is determined by a fixed interval of seven days.They form the input features for the classification and optimisation processes in the subsequent experimental studies carried out within the institution's computational-based server farm.

Results and discussion
As depicted by Figure 4, there were initially many chosen features (i.e.weeks) resulted from the application of C4.5 classifier.After applying MmGA, a significant reduction in the number of features is observed.Worth remarking that the effect is achieved without reduction in the accuracy rates as shown in Figure 5. On average of 30 runs, there is approximately 6 per cent improvement in the accuracy rate when employing our proposed model compared to the standard C4.5 classifier.In addition, Figures 4 and 5  i.e., a reduction of up to 57 per cent relative to the standard C4.5 classifier.On closer inspection, the box plot distribution also reflects that the proposed model is more consistent and stable as compared to the standard C4.5 classifier since the former has narrower box and shorter tails.Reader is referred to Table II for the numerical results comprising mean, standard deviation, and p-value of the pairwise t-test comparisons.The best mean values marked in italics are statistically significant at 95% confidence interval.
To take a step further, let us examine the major determinants for the proposed model more closely.As illustrated by Figure 6, ten most prominent features (shaded in black) have been identified by our proposed model after optimisation.They represent the most significant weeks with student interaction activities that are adopted by the proposed model in classifying students achievement (recall Figure 4 and Table II).The captured interaction activities are not limited to students' post-reply inquiry on tutorials, technical hands-on, examination-related discussions, online quizzes, manipulation of teaching-learning materials, academia-related consultation, and clarification.
At second glance, the emergent patterns unfold several interesting events.First, students are actively involved in pre-semester activities before the start of a course (weeks 1 through 2).
An obvious example of such activity includes exploration of course materials by students.The trend extends towards the second week just before the commencement of a new semester, which in turn corresponds to the closing date of course registration.This has come with little surprise as students are naturally more curious and eager to know about a new course being enroled in.But the obvious may have yet gotten the attention of educators.As evidential here, educators eager to improve students' first perception about a course, and subsequently leading to higher motivation in continuing the course (student retention strategy) should at least invest more time in the preparation work.Early content availability and accessibility, for example, would promote first positive impression and invite future interaction.A wide range of research works in cognition and social psychology attest to how initial impressions influence human interpretation of later information (Dougherty et al., 1994).Second, weeks 7 through 10 have been recognised as other significant indicators of student performance.Inherently, plenty of practical labs preparation and T1 discussions take place throughout the second month of the semester.To educators, this is likely the best time to engage more with students in ensuring that they are well with the course.The notion arises from the assumption that increasing the two-way interactions between tutor and student will enhance both student motivation and achievement.To this end, Allen et al. (2011) find strong correlation between high-quality tutor-student interactions and improved student achievement gains.
Whereas, the third week of the fourth month and the first week of the fifth month are the remaining two indicators identified for student performance.The former reflects the presence of discussion activities following T2, while the latter arises as a result of revision activities in conjunction with final examination.The preceding makes it clear that revision is important in boosting student performance.Educators may plan revision activities on gradual basis.Revision may be treated as a form of reinforcing previously learned knowledge through practises.Intuitively, educators should pay more attention to retrieval practice in consolidating learning (Karpicke and Roediger, 2008;Karpicke and Blunt, 2011) as part of the revision process.Finally, the common activities involved in the remaining two weeks (weeks 24 through 25) may include closing of posts and reporting of web statistics.They reflect the closing of the semester.

Conclusions
Many of the developing and potential EDM research on student achievement are contended to statistical methods (with a few exceptions) and lack the knowledge in handling multi-objective optimisation problems.To this end, this paper fills the gap by proposing a soft computing model with MmGA coupled with a DT-based classifier, namely, C4.5 classification and optimisation of system.Our model has shown confident results in student achievement classification under the UCI benchmark problem and real-world distance learning case study in Malaysia by simultaneous optimising multiple objectives (performance factors), i.e., maximising accuracy rate, α and minimising the number of features, β.Whether benchmark or real-world case study, the proposed model successfully reduced β by at least 48 per cent while achieving higher α.
We believe that this work can expand access to knowledge and insight into understanding student interaction activities and their achievement based on our empirical results.It may serve as a potential platform to inform educators seeking to reform educational policy by enhancing its provision of learning-support services and create a better learning experience for the students.To this end, the results presented could be easily translated into useful information such as when and what should be done in order achieve the target research goal.In the case study, early educator preparation work, improving tutor-student interactions, and investing in retrieval practices may well improve student motivation and achievement.This is only the beginning of a study that can lead to more elaborative outcomes for the educational arena.The results thus far could very well be true for the case study e-learning environment, but the proposed model is transferable to any optimisation and classification problems.We also plan to deploy other types of MOEAs and classifiers for other experiments in the near future.Finally, another promising work is investigation into the behaviour of the proposed model in response to the nebulous data covering different domains of interest to educators.
We need more rigorous research in the educational arena and soft computing models have opened up a new route.We believe that the future is bright and the vacuum of empirical evidence shall continue to be filled by the enthusiast research works of EDM and the alike.
a) G1; (b) G2; (c) G3 Figure 1.A comparison of the accuracy rate between the standard C4.5 classifier and the proposed model in the data set a) G1; (b) G2; (c) G3 Figure 3. Proposed model for the EDM in an open distance learning institution in Malaysia Figure 4.A comparison of the features number between the standard C4.5 classifier and the proposed model Mean ± standard deviation for 30 runs experimental results and computed p-values.Mean values marked in italics indicate best statistical significance results at 95% confidence interval under the pairwise t-test comparisons

:
Mean ± standard deviation for 30 runs experimental results and computed p-values.Mean values marked in italics indicate statistical significance results at 95% confidence interval under the pairwise t-test comparison