Development of a Four-Tier Test with a Metacognitive Perspective Approach on the Topic of Newton's Laws

: This research aimed to develop a learning evaluation instrument on Newton's Law Materials using a four-tier test with a metacognitive perspective to identify students' thinking abilities from metacognition and students' misconceptions. The research employed the R&D (Research and Development) design with a 4-D development model (define, design, develop, and disseminate). The instrument was constructed by developing twelve questions. There were ten valid and reliable questions with good quality parameters (difficulty level and differentiating power). The researchers analyzed the four-tier test instrument with a metacognitive perspective approach using item difficulty level analysis, item discriminatory analysis, test reliability analysis, and expert validity testing. This research involved 30 participants in the small-scale test and 250 participants in the large-scale trial. The test results showed that the ten questions could identify students' thinking abilities from the perspective of metacognition and misconceptions on Newton's Law material. The findings of the large-scale test showed that students' thinking skills were mapped into high, medium, and low categories. Students' misconceptions were also mapped on each indicator of the questions compiled. Based on the mapping result, the highest students' misconceptions were found in the aspect of the metacognitive experience, with an average of 76 percent of students experiencing misconceptions.


INTRODUCTION
Physics is a discipline that systematically studies important concepts based on careful observations, measurements, and experiments (Ogundeji et al., 2020). Students can build their abilities from low to high levels, emphasizing understanding concepts in physics (Labra et al., 2012).
Human resources in the twenty-first century are emphasized mastering critical thinking and problem-solving skills (Z. Hidayat et al., 2019;Plessis, 2015;Rahman, 2019). Physics learning is intended as a vehicle to grow thinking skills. This thinking ability is important for problem-solving processes in everyday life (Permatasari et al., 2019;Saputro et al., 2019;Usmeldi, 2016). Problem-solving is needed in metacognitive abilities so that students can understand the problems being solved and the physics concepts learned to be more meaningful. Metacognition is involved in learning so that students emphasize awareness in their thinking processes (Jaleel & Premachandran, 2016).
Metacognition ability is thinking about how to think about oneself (thinking about thinking) or someone's knowledge about the process of thinking (Diandita et al., 2017). Metacognition is also explained as a process in which a person can think about his way of thinking to build strategies to solve problems (Livingston, 2003). This research maximizes students' metacognitive abilities because the instrument developed uses deep thinking and finds problem-solving to answer.
Metacognitive abilities and their relation to the learning process play an important part in the world of education (Zohar & Dori, 2009) as an effort to increase students' awareness of thinking in learning (R. Hidayat et al., 2018). To measure metacognitive ability, the fourtier test format can be used (Gurel et al., 2017). The four-tier test can identify students' misunderstandings and familiarize them to account for the answers even though they are incorrect (Kaniawati, 2017). Evaluation using the four-tier test is associated with solving students' problems so that they can find out the process of answering and solving them (Putranta & Supahar, 2019).
The evaluation instrument is developed to determine the metacognitive ability, which refers to a person's understanding of in-depth knowledge (Handel et al., 2013). The four-tier test developed so far only explores students' understanding of a concept. So far, the four-tier test has only been used to diagnose students' misunderstandings, even though the evaluation instrument developed can also be used to determine the students' thinking flow according to the questions asked. (Handel et al., 2013). Also, based on observations at several high schools in the ex-residence of Pati conducted by researchers by interviewing teachers, no instrument has been found that has been compiled to identify patterns of students' thinking abilities seen from the metacognitive perspective using the four-tier test.
Therefore, this research developed a four-level test instrument using a metacognitive perspective approach. The instrument developed, apart from measuring misconceptions, is also able to diagnose the level of students' thinking abilities from a metacognitive perspective.

METHOD
The research design used the Research and Development (R&D) method. The R&D method applied the 4-D development model (Thiagarajan et al., 1974) (Mappalotteng et al., 2015. The development flow chart using the 4-D model in this research can be seen in Figure 1. In Figure 1, there are four main stages carried out in this research, namely (1) define stage (definition), (2) design stage (design), (3) develop stage, and (4) disseminate stage.
Based on Figure 1, Phase 1 (define) aims to obtain information related to learning needs using an analysis of the objectives and limitations of the material used in the study. Stage 2 (instrument design) is used to design the four-tier test evaluation to determine the level of students' thinking skills seen from the metacognition perspective and their misconceptions. This stage consists of compiling indicators and question grids, making assessment rubrics, and compiling four-tier test evaluation instruments. Stage 3 (development) aims to produce an evaluation instrument that has been improved based on the validation results by experts and the results of the analysis of the small-scale trial stage. Stage 4 (disseminate) aims to implement a fourtier test evaluation instrument that has been tested on several state high schools in the ex-residence of Pati. In this fourth stage, the level of students' thinking abilities from a metacognitive perspective and their misconceptions are also mapped. In this research, students' thinking abilities from a metacognitive perspective were categorized into factual, conceptual, procedural, planning, monitoring, and evaluation processes. In this research, the product developed was an evaluation instrument, namely a four-tier test to determine the level of thinking ability with a metacognitive perspective and students' misconceptions.
The evaluation instrument developed consists of ten items with graded questions in each item and a validation sheet as an assessment sheet for the evaluation instrument.
The researchers analyzed the data based on the validity, reliability, and identification of the level of thinking ability with metacognitive perspectives and students' misconceptions of Newton's Law material. The small-scale trial stage in the development step was carried out by testing the content validity of the evaluation instrument. Two subject physics teachers and one physics lecturer carried out the expert validation test. The small-scale testing phase was implemented at a high school with 30 students as respondents. The participants of the large-scale testing came from three public high schools in the ex-residence of Pati. The developed evaluation instrument was distributed to the 250 participants (tenthgrade students). Each school provided two classes to be analyzed to determine students' thinking abilities with a metacognitive perspective and misconceptions about Newton's law concepts. The students' thinking levels were categorized into three levels, namely low, moderate, and high categories, as presented in Table 1. The percentage value (P) in Table 1 is obtained through equation 1 (Sudijono, 2012). At the same time, the analysis of misconceptions experienced by students is analyzed based on Table 2.

RESULT AND DISCUSSION Defining
The four-tier test evaluation instrument was developed to determine how students think in answering the questions asked, even though the students' answers are incorrect (Liu et al., 2011). The evaluation instrument is to determine the pattern of physics ability with the metacognitive perspective of students in terms of the combination of answers and student interviews. The results are then grouped into high, medium, and low metacognitive abilities. Students' metacognitive abilities are based on references to a deep understanding of knowledge as a problem by the student (Yusnaeni & Corebima, 2017).

Designing
The form of evaluation instruments is made to determine students' thinking abilities with metacognitive perspectives and students' misconceptions. At this stage, indicators and grids of evaluation instruments are produced (see Table 3). At this stage, as in the initial stage, 12 questions were made according to the designed grid and indicators.
The form of the evaluation instrument developed was also determined at this stage. In this research, the four-tier test format was selected, consisting of four levels of questions asked. The first level refers to items in the form of multiple choices with four distractors and one correct answer The fourth level is the students' belief in the reasons at the third level. Procedural Implement Newton's second law to determine the mass of an object according to the illustration of the event.
2 Choose a solution to illustrate the incident related to increasing or decreasing the friction force.

Metacognitive Experience
Planning Predicting the magnitude of the acceleration based on the diagram of the force acting according to the illustration of events in Newton's second law.

7
Predict the direction and magnitude of the force based on Newton's third law according to the illustration of the incident.

8
Monitoring Assess the magnitude of the force acting following Newton's second law principles.

9
Solve the action-reaction force relationship according to the illustration of events based on Newton's third law.

Evaluation Process
Comparing the magnitude of the force with the effect of the frictional force on an object on a flat plane.

10
Defend the principles of applying Newton's third law in the illustration of events.

Developing
At this stage, a small-scale trial was conducted. The small-scale trial began with validity testing by experts. In this research, expert validation was carried out by two senior physics teachers with more than seven years of teaching experience and one physics lecturer. The expert validation was carried out to assess each item's material, construction, and language aspects. The analysis results of the three expert validations are shown in Table 4.  After expert validation, the instrument reliability test was carried out. The reliability test was used to determine the level of consistency of the data based on the field and the data obtained. The reliability test was analyzed using the Kuider and Richardson (KR20) reliability test. The result of the instrument reliability test was 0.68 (r table 0.3061), with 30 respondents. These results indicate that the instrument was reliable because the value of rcount was higher than rtable. The low level of instrument reliability was due to sampling factors. The students were given limited time to complete the test, so many students' results were not optimal.
Furthermore, the quality parameters of the items are used, including the level of difficulty and the level of discriminating power of the questions (Quaigrain & Arhin, 2017). The results of the interpretation of the level of difficulty and differentiating power are shown in Table 5. The results show that 33.33% of the questions are categorized as easy, 25% are categorized as moderate, and 41.76% are categorized as difficult.
Problems with low discrepancy are not used because the questions cannot distinguish between students who experience misconceptions or not and cannot distinguish thinking abilities from a metacognitive perspective between students.

Disseminating
At this stage, a large-scale test was conducted with 250 students. The largescale test used ten questions. Based on Table 5, questions number 5 and 9 were not used. The evaluation instrument in the large-scale test was distributed to three public high schools in the Pati exresidence, specifically to the tenth-grade students. The large-scale test was intended to identify students' thinking abilities with a metacognitive perspective and analyze the misconceptions experienced by students in Newton's Law material. The results showed that students' thinking abilities from a metacognitive perspective were mapped into low, medium, and high levels in each category of thinking ability. The findings can be seen in Table 6.   1  2  3  4  6  7  8  10  11  12  Understand  31  28  15  36  40  14  14  6  17  6  Do not understand  28  8  7  12  11  11  13  16  14  12  Misconception  41  64  78  52  49  75  73  78  69  82 Based on Table 6, the percentage of students' thinking ability indicators with a metacognitive perspective was high compared to the other categories. The highest percentage in the students' metacognitive abilities category was found in metacognitive knowledge, namely the factual, conceptual, and procedural knowledge indicators. The metacognitive ability for metacognitive strategy indicators has lower results than metacognitive knowledge in all categories of students' metacognitive ability. These results indicated that students with high, medium and low categories tended to have factual knowledge at the level of their thinking ability (Vukić et al., 2020). Factual knowledge is the ability of concrete thinking and has the lowest level of abstraction compared to other indicators of metacognitive knowledge (Veenman et al., 2004). The lowest percentage was in the evaluation process indicators for the high, medium, and low categories. Students with high categories in their thinking process can evaluate their work correctly to maintain the concepts they understand (Craig et al., 2020).
Besides analyzing the achievement of thinking skills from a metacognitive perspective, the researchers succeeded in mapping students' misconceptions of Newton's law material at this stage. The findings showed that students' misconceptions about Newton's law were identified in each item. These results indicate that students experience misconceptions about each indicator (Table 3). Referring to Table 3, the misconceptions experienced by many students were in the indicator questions number 3, 7, 10, and 12. In this indicator, more than 75 % of students experienced misconceptions. The high level of student misconceptions in the aspect of the metacognitive experience was due to the instrument that referred to students' strategies to complete their work. Students had low problem-solving strategies, so it was difficult to answer questions on the aspects of metacognitive experience. Meanwhile, the indicators for questions number 1 and 6 were less than 50 %. The results showed that teachers or instructors could use the evaluation instrument based on the four-tier test as material to explore students' thinking skills from a metacognitive perspective and identify students' misconceptions of Newton's law material.

CONCLUSION
In this research, it can be concluded that of the 12 questions developed, ten questions are valid, reliable, and possess a good parameter quality (in terms of difficulty level and differentiating power). Based on the large-scale trial, the ten questions can identify students' thinking abilities from the perspective of metacognition and misconceptions on Newton's Law material. The findings of the large-scale trial showed that students' thinking skills were mapped into high, medium, and low categories. Also, students' misconceptions are mapped on each indicator of the questions compiled. The developed instrument can be used in learning, especially in physics education, for reference materials as a basis for improving the quality of learning physics at the high school level because learning physics requires a pattern of questions that require problem-solving and deep thinking.