Unit 5Machine Learning



0
0

Part 1 

Reading & Translating
Section A£º Decision Tree in Machine Learning

Decision Tree in Machine Learning is used for supervised learning £Ûclassification and regression£Ý£®Decision Tree exploits correlation between features and nonª²linearity in the features£®
Wondering what a Decision Tree would be?You might have come across the programmatic representation of a decision tree which is a nested ifª²else£®
Let us consider the following pseudo logic£¬where we are trying to classify the given livingª²thing into either human£¬bird or plant£º 

if(displacement is present){

if(wings are present AND feathers are present){

livingª²thing is bird

} else if(hands are present){

livingª²thing is human

}

} else if(displacement is absent){

livingª²thing is plant

}


In the above pseudo code£¬output variable is category of livingª²thing whose value could be human or bird or plant£®Input variable is livingª²thing£®Features of input data taken into consideration are displacement £Ûwhose values are present/absent£Ý£¬wings £Ûwhose values are present/absent£Ý£¬feathers £Ûwhose values are present/absent£Ý and hands £Ûwhose values are present/absent£Ý£®So we have four features whose values are discrete£®
In the traditional programs£¬the above ifª²elseª²if code is hand written£®Efforts put by a human being in identifying the rules and writing this piece of code where there are four features and one input are relatively less£®
But could you imagine the efforts required if the numbers of features are in hundreds or thousands£®It becomes a tedious job with nearly impossible timelines£®Decision Tree could learn these rules from the training data£®Despite other classifiers like Naive Bayes Classifier or other linear classifiers£¬Decision Tree could capture the nonª²linearity of a feature or any relation between two or more features£®
Regarding the capturing relation among features in the above example£¬the features (wings and feathers) are coª²related£®For the considered example (or data set)£¬their values are related in a way ÉîÉ«µÄµ×ÎÆsuch that their collective value is deciding on the decision flow£®
In machine learning£¬input dataset for the Decision Tree algorithm would be the list of feature values with the corresponding categorical value£®A sample of the dataset is as shown in the Table 5ª²1£®


Table 5ª²1A sample of the dataset



InputOutputFeatures

ÉîÉ«µÄµ×ÎÆlivingª²beingcategorywingshandsfeathersdisplacement


Joehumanabsentpresentabsentpresent
Parrotbirdpresentabsentpresentpresent
Jeanhumanabsentpresentabsentpresent
Hibiscusplantabsentabsentabsentabsent
Eaglebirdpresentabsentpresentpresent
Roseplantabsentabsentabsentabsent


Each row in the Table 5ª²1 represents an observation/experiment£®
In practical scenarios£¬the number of features could be from single digit number to thousands£¬and the data set would contain single digit number to millions of entries/observations/experiments£®
The common way to build a Decision Tree is to use a greedy approach£®Consider you are greedy on the number of Decision Nodes£®The number of Decision Nodes should be minimal£®By testing a feature value£¬the Dataset is broken into subª²Datasets£¬with a condition that the split gives maximum benefit to the classification i£®e£®£¬the feature value considered(among all the possible feature value combinations) is the best available to categorize the given data set into two subsets£®In each subª²Dataset£¬a new feature value combination is chosen£¬as in the former split£¬to divide it into smaller subª²Datasets£¬with the same condition that the split gives maximum benefit to the classification£®The process is repeated until a Decision Node is not required to further split the subª²Dataset£¬and almost all of the samples in that subª²Dataset belong to a single category£®
The graphical representation of Decision Tree for the Dataset mentioned above would be as shown in the Figure 5ª²1£®
From the Figure 5ª²1£¬it is evident that the Decision Tree has made use of only two features £Ûdisplacement£¬wings£Ý as the other two features are redundant£®Thus it needs to reduce the number of Decision Nodes£®


Figure 5ª²1Flowchart representation of Decision Tree




Words







regression£Ûri¬ñ¨Àre­Àn£Ý n£®»Ø¹é

programmatic£Û­ýpr­¼u¨Àr­¼¬ñm¢utik£Ý adj£®ÓÐ¼Æ»®µÄ£¬°´¼Æ»®µÄ

displacement£Ûdis¬ñpleism­¼nt£Ý n£®Î»ÒÆ

discrete£Ûdi¬ñskri­Ât£Ý adj£®ÀëÉ¢µÄ£¬²»Á¬ÐøµÄ

livingª²being ÓÐ»úÌå£¬ÉúÎï

parrot£Û¬ñp¢ur­¼t£Ý n£®ðÐðÄ

hibiscus£Ûhi¬ñbisk­¼s£» hai¬ñbisk­¼s£Ý n£®Ä¾éÈ£¬Ü½ÈØ»¨

entry£Û¬ñentri£Ý n£®ÌõÄ¿





Phrases





such thatÈç´Ë¡­¡­ÒÔÖÂ



Exercises

I£® Read the following statements carefully£¬and decide whether they are true (T) or false (F) according to the text£®
1£® The common way to build a Decision Tree is to use a SVM approach£®
2£® Regarding machine learning£¬input dataset for the Decision Tree algorithm would be the list of feature values with the corresponding categorical value£®
3£® Decision Tree could take the nonª²linearity of a feature or any relation between two or more features£®
4£® In the Figure 5ª²1£¬the Decision Tree has used only two features £Ûdisplacement£¬feathers£Ý£®
5£® Decision Tree in Machine Learning is used for unsupervised learning£®

II£® Choose the best answer to each of the following questions according to the text£®
1£® Which of the following is right about the Decision Tree?£¨£©
A£® Decision Tree could take the linearity of a feature or any relation between two or more features£®
B£® Decision Tree exploits correlation between features and nonª²linearity in the features£®
C£® Decision Tree in Machine Learning is used for unsupervised learning£®
D£® The common way to build a Decision Tree is to use a SVM approach£®
2£® How many features are mentioned in the Figure 5ª²1?£¨£©
A£® One
B£® Two
C£® Three
D£® Four 
3£® Which of the two features has Decision Tree used in the Figure 5ª²1?£¨£©
A£® £Ûdisplacement£¬feathers£Ý
B£® £Ûdisplacement£¬hands£Ý
C£® £Ûdisplacement£¬wings£Ý
D£® None of the above

III£® Fill in the numbered spaces with the words or phrases chosen from the box£®Change the forms where necessary£®

variableindependentcandefinecall

manydependentestimatewhatcommon






Linear Regression
Linear regression is a basic and 1 used type of predictive analysis£® The overall idea of regression is to examine two things£º (1) does a set of predictor variables do a good job in predicting an outcome (dependent) variable? (2) Which 2 in particular are significant predictors of the outcome variable£¬and in3 way do they¡ªindicated by the magnitude and sign of the beta estimates¡ªimpact the outcome variable? These regression 4 are used to explain the relationship between one dependent variable and one or more 5 variables£® The simplest form of the regression equation with one dependent and one independent variable is 6 by the formula y=c+b*x£¬where y=estimated 7 variable score£¬c=constant£¬b=regression coefficient£¬and x=score on the independent variable£®
There are 8 names for a regression¬ðs dependent variable£® It may be 9 an outcome variable£¬criterion variable£¬endogenous variable£¬or regressand£® The independent variables 10 be called exogenous variables£¬predictor variables£¬or regressors£®

IV£®Translate the following passage into Chinese£®

Support Vector Machine (SVM)
A support vector machine is a supervised learning algorithm that sorts data into two categories£®It is trained with a series of data already classified into two categories£¬building the model as it is initially trained£®The task of an SVM algorithm is to determine which category a new data point belongs in£®This makes SVM a kind of nonª²binary linear classifier£®
An SVM algorithm should not only place objects into categories£¬but have the margins between them on a graph as wide as possible£®
Section B£º Kª²means Clustering Algorithm and Example
¡°I¬ðm clueless¡°
You say£¬looking at ÉîÉ«µÄµ×ÎÆan ocean of unlabeled data£¬waving in front of you£®It is true that the lack of labels can sometimes ÉîÉ«µÄµ×ÎÆfreak us ÉîÉ«µÄµ×ÎÆout£¬leaving us wondering how to group the data together£®But luckily£¬kª²means clustering algorithm is here to rescue£¬one of the simplest algorithms for unsupervised clustering (dealing with data without defined categories)£®Assigning data points into k clusters based on the minimum distance£¬kª²means clustering is simple£¬helpful£¬and effective for finding the latent structure in the data£®
Here we provide some basic knowledge about kª²means clustering algorithm and an illustrative example to help you clearly understand what it is£®
Kª²means clustering algorithm is an unsupervised machine learning algorithm for determining which group a certain object really belongs to£®What it means by ¡°being unsupervised¡° is that there are no prescribed labels in the data denoting its structure£®The main idea is to assign each observation into the cluster with the nearest mean (centroid £Û1£Ý)£¬serving as a prototype of the cluster£®
Here are five simple steps for the kª²means clustering algorithm and an example for illustration£º 
¤r Step 1£º Visualize n data points and decide the number of clusters (k)£®Choose k random points on the graph as the centroids of each cluster£®For this example£¬we would like to divide the data into 4 clusters£¬so we pick 4 random centroids (Figure 5ª²2)£®


Figure 5ª²2Visualize the data and pick the random centroids (which is 4 in this example)



¤r Step 2£º Calculate the Euclidean distance between each data point and chosen clusters¬ð centroids£®A point is considered to be in a particular cluster if it is closer to that cluster¬ðs centroid than any other ones (Figure 5ª²3)£®


Figure 5ª²3Assign each point into the cluster with the nearest centroid


¤r Step 3£º After assigning all observations to the clusters£¬calculate the clustering score£¬by ÉîÉ«µÄµ×ÎÆsumming up all the Euclidean distances between each data point and the corresponding centroid£®

Total distances=¡Ækj=1¡Æni=1¡¬x(j)i-cj¡¬2

Where£º 
k£º the number of clusters
n£º the number of points belonging to cluster j
cj£º the centroid of cluster j
¤r Step 4£º Define the new centroid of each cluster by calculating the mean of all points assigned to that cluster£®Here¬ðs the formula (n is the number of points assigned to that cluster)£º 


=¡Æni=1xin


¤r Step 5£º Repeat from step 2 until the positions of the centroids no longer move (Figure 5ª²4) and the assignments stay the same (Figure 5ª²5)£®


Figure 5ª²4Final iteration£º distances are minimized and centroids no longer move




Figure 5ª²5Flow chart of kª²means clustering algorithm


There you go£º data points are now grouped into 4 different clusters£®Using a simple idea of minimizing distances between data points to group them together£¬kª²means clustering algorithm is extremely helpful for understanding the structure of the data£¬how observations are classified£¬and interpreting the story behind£®Kª²means clustering has been widely used in data analysis£¬especially in life sciences£¬in analyzing thousands to millions of data points in singleª²cell RNAª²seq and bulk RNAª²seq experiments£®
Note that the Euclidean metric measures the distance based on the vector connecting two points£¬and will cause some biases for data with different scales£®For example£¬in RNAª²seq data£¬gene expression values can range from as little as 0£®001 to a thousand£¬stretching the data points along an axis£®That is£¬the variable with the smaller scale will be easily dominated and play little in the convergence£¬as clusters will scatter along an axis only£®For this reason£¬it is necessary to make sure that the variables are at the same scale before using kª²means clustering£®
Note that before determining the number of clusters to assign the data into (the variable k)£¬you should have an overview of the data and on what basis you want to group them£®You can even apply a hierarchical clustering on the data first to briefly understand the structure of the data before choosing k by hand£®
A wellª²known method to validate the number of clusters is the Elbow method £Û2£Ý£¬that is to run kª²means clustering several times for a range of values of k (usually from 2 to 10) and pick out the value of k that causes sudden drop in the sum of ÉîÉ«µÄµ×ÎÆsquared distances£®More specifically£¬for each value of k£¬we calculate the sum of squared distances (between each point and the corresponding centroid) and graph the results on a line chart£®Choose the value where the sum of squares drops£¬giving an angle in the graph (ÉîÉ«µÄµ×ÎÆa£®k£®a£®an elbow)¡ªthat is the optimal value of k (Figure 5ª²6)£®


Figure 5ª²6Elbow point example




Words







clustering£Û¬ñkl­¾st­¼ri¢d£Ý n£®¾ÛÀà

cluster£Û¬ñkl­¾st­¼(r)£Ý n£®Èº¼¯£¬´Ø£¬¼¯Èº

latent£Û¬ñleitnt£Ý adj£®Ç±ÔÚµÄ£¬Ç±·üµÄ

prescribe£Ûpri¬ñskraib£Ý v£®¹æ¶¨

mean£Ûmi­Ân£Ý n£®Æ½¾ùÊý£¬Æ½¾ùÖµ

centroid£Û¬ñsentr­¿id£Ý n£®ÖÊÐÄ£¬ÐÎÐÄ

Euclidean£Ûju­Â¬ñklidi­¼n£Ý adj£®Å·¼¸ÀïµÃ¼¸ºÎÑ§µÄ£¬Å·¼¸ÀïµÃµÄ

observation£Û­ý­¿bz­¼¬ñvei­Àn£Ý n£®Êý¾Ýµã

RNAª²seq×ªÂ¼×é²âÐò¼¼Êõ(RNA sequencing)

dominate£Û¬ñd­¿mineit£Ý v£®Ö§Åä£¬¿ØÖÆ

convergence£Ûk­¼n¬ñv¤—­Âd­Ã­¼ns£Ý n£®Ç÷Í¬£¬ÈÚºÏ£¬Ò»Ìå»¯












metric£Û¬ñmetrik£Ý n£®¶ÈÁ¿±ê×¼

gene£Ûd­Ãi­Ân£Ý n£®»ùÒò

axis£Û¬ñ¢uksis£Ý n£®Öá£¬ÖáÏß






Phrases




an ocean of¼«¶àµÄ£¬ÎÞÇîÎÞ¾¡µÄ

freak out±ÀÀ££¬Ê¹´¦ÓÚ¼«¶ÈÐË·ÜÖÐ

serve asÓÃ×÷£¬³äµ±

sum up¼ÆËã¡­¡­µÄ×ÜÊý

squared distances¾àÀëÆ½·½



Abbreviations




a£®k£®a£®Òà³Æ£¬ÓÖÃû(also known as)



Notes
£Û1£Ý kª²meansÊÇÒ»ÖÖÊý¾Ý¾ÛÀàËã·¨£¬ÖÊÐÄ(centroid)ÊÇÖ¸¸÷¸öÀà±ðµÄÖÐÐÄÎ»ÖÃ£¬ÖÊÐÄµÄÎ¬ÊýµÈÍ¬ÓÚµ¥ÌõÊý¾ÝµÄÎ¬Êý¡£±ÈÈçËµ£¬ÄãÓÐ1000ÌõÊý¾Ý£¬Ã¿ÌõÊý¾Ý100Î¬¡£Èç¹ûÊ¹ÓÃkª²meansËã·¨½«Õâ1000ÌõÊý¾Ý¾ÛÎª10¸öÀà±ð£¬¾Í»áµÃµ½10¸öÖÊÐÄ¡£Ã¿¸öÀà±ðµÄÖÊÐÄÊÇ¸ÃÀà±ðËùÓÐÊý¾ÝµãµÄ¾ùÖµ¡£±ÈÈçµÚÒ»´ÎÈ·¶¨ÁË10¸öÖÊÐÄ£¬Í¬Ê±Ò²½«ÔªÊý¾Ý·Ö±ð¹éÀàµ½Õâ10¸öÖÊÐÄ£¬ÄÇÃ´½ÓÏÂÀ´¿É¼ÌÐøµ÷ÕûÖÊÐÄÒÔÖÂ×îºó´ïµ½×îÓÅ£º 
(1) ½«¸÷¸öÊ¾Àýsample·ÖÅäµ½¾àÀë×î½üµÄÖÊÐÄ£» 
(2) ¶ÔÓÚ¸÷¸öÀà±ð£¬¼ÆËãÆäËù°üº¬µÄsampleµÄÆ½¾ùÖµ£¬×÷Îª¸ÃÀà±ðÐÂµÄÖÊÐÄ¡£
£Û2£Ý Öâ²¿·¨Ôò(Elbow method)£¬´ËÖÖ·½·¨ÊÊÓÃÓÚK(´ØµÄÊýÁ¿) ÖµÏà¶Ô½ÏÐ¡µÄÇé¿ö£¬µ±Ñ¡ÔñµÄkÖµÐ¡ÓÚÕæÕýµÄKÊ±£¬kÃ¿Ôö¼Ó1£¬costÖµ¾Í»á´ó·ùµØ¼õÐ¡£» µ±Ñ¡ÔñµÄkÖµ´óÓÚÕæÕýµÄKÊ±£¬kÃ¿Ôö¼Ó1£¬costÖµµÄ±ä»¯¾Í²»»áÄÇÃ´Ã÷ÏÔ¡£ÕâÑù£¬ÕýÈ·µÄkÖµ¾Í»áÔÚÕâ¸ö×ªÕÛµã£¬ÀàËÆelbowµÄµØ·½¡£
Exercises

I£® Read the following statements carefully£¬and decide whether they are true (T) or false (F) according to the text£®
1£® Kª²means clustering algorithm is a supervised machine learning algorithm£®
2£® The main concept of kª²means is to assign each observation into the cluster with the nearest mean (centroid)£¬serving as a prototype of the cluster£®
3£® To find the latent structure in the data kª²means clustering is a simple way to assign data points into k clusters based on the minimum distance£®
4£® ¡°Being unsupervised¡° is that there are some prescribed labels in the data denoting its structure£®
5£® Elbow method is a wellª²known method which validates the number of clusters£®

II£®Choose the best answer to each of the following questions according to the text£®
1£® Which of the following is not mentioned in the text?()
A£® ID3
B£® Centroid
C£® Euclidean
D£® Kª²means
2£® How many steps are mentioned for the kª²means clustering algorithm and an example for illustration?()
A£® Two
B£® Three
C£® Four
D£® Five 
3£® Which of the following is right?()
A£® The main concept of kª²means is to assign each observation into the cluster with the nearest mean (centroid)£¬serving as a prototype of the cluster£®
B£® To find the latent structure in the data kª²means clustering is a simple way to assign data points into k clusters based on the minimum distance£®
C£® Elbow method is a wellª²known method which validates the number of clusters£®
D£® All of the above

III£® Fill in the numbered spaces with the words or phrases chosen from the box£®Change the forms where necessary£®

understandlabordealadvantagelike

basemethodasreflectuse





Clustering Algorithms
Clustering algorithms can automatically recognize the pattern inside the data so 1 to analyze the collected data without their labels£®Using this advantage£¬three clusteringª²based fault diagnosis methods are presented to 2 with some diagnosis cases of rotating machinery in which the labeled data are limited£®In the first method£¬compensation distance evaluation technique and the weight K nearest neighbor are 3 to recognize the mechanical faults£¬harnessing the merits that the computation of feature weights is simpler and the weights are easier to 4£®The second method is presented 5 on weight fuzzy cª²means£¬which is robust to the local structure of the data and 6 the level of uncertainty over the most appropriate assignment£®Finally£¬a Hybrid clustering algorithmª²based fault diagnosis 7 is introduced£¬considering the problems 8 the sample influence for clustering and the automatic setting of the cluster number£®The results of the diagnosis cases verify that these diagnosis methods take full 9 of unlabeled data and reduce the human 10 in fault diagnosis£®

IV£®Translate the following passage into Chinese£®

Ensemble Learning
Many ensemble learning tools can be trained to produce various results£®Individual algorithms may be stacked on top of each other£¬or rely on a ¡°bucket of models¡° method of evaluating multiple methods for one system£®In some cases£¬multiple data sets are aggregated and combined£®For example£¬a geographic research program may use multiple methods to assess the prevalence of items in a geographic space£®One of the issues with this type of research involves making sure that various models are independent£¬and that the combination of data is practical and works in a particular scenario£®
Ensemble learning methods are included in different types of statistical software packages£®Some experts describe ensemble learning as ¡°crowdsourcing¡° of data aggregation£®
Part 2 

Simulated Writing£º Developing Reports and Proposals (I)

±¨¸æºÍÌá°¸ÊÇÔÚ¹¤×÷ÖÐ×î³£Ð´µÄ³¤ÎÄµµ¡£ÕâÁ½Õß¶¼»Ø´ðÁËÄ³¸öÖ÷Ìâ»òÏîÄ¿µÄÎÊÌâ£¬»òÕßÕë¶ÔÄ³¸öÎÊÌâÌá¹©½â¾ö·½°¸¡£¶ÁÕß½«»áÑÐ¾¿×÷ÕßµÄ±¨¸æ£¬²¢ÇÒÔËÓÃÆäÖÐµÄ½áÂÛºÍ·ÖÎöÀ´°ïÖúËûÃÇ½øÐÐ¾ö²ß¡£³ýÁËÉÌÒµÆóÒµÖ®Íâ£¬·ÇÓ¯Àû»ú¹¹ºÍÕþ¸®»ú¹¹Ò²»á×«Ð´±¨¸æÀ´×Ü½á»òÕß·ÖÎöÑÐ¾¿×´¿ö¡£ÓÐÊ±£¬×éÖ¯»á¹ÍÓ¶×¨ÒµµÄ×«¸åÈË×«Ð´Ìá°¸ÒÔÓ®µÃºÏÍ¬£¬»ò»ñµÃÏúÊÛ»ú»á¡£Ñ§»áÐ´×÷ÕâÐ©ÖØÒªµÄÎÄµµÊÇÒ»ÏîºÜÓÐ¼ÛÖµµÄ×¨Òµ¼¼ÄÜ¡£
1£® ÁË½â±¨¸æºÍÌá°¸
±¨¸æÊÇÒ»ÖÖÕë¶ÔÌØ¶¨Ö÷Ìâ½»Á÷ÐÅÏ¢¶øÉè¼ÆµÄÊéÃæÎÄµµ¡£ËäÈ»ÓÐÐ©±¨¸æ¿ÉÒÔ°üº¬·ÖÎö»ò½¨Òé£¬µ«×«Ð´µÄ±¨¸æÍùÍùºÜ¿Í¹Û¡£Ìá°¸Óë±¨¸æºÜÏàËÆ£¬µ«ÆäÄ¿µÄÔÚÓÚËµ·þºÍÍ¨Öª¡£Ìá°¸Ìá¹©ÁËÓÐ¹Ø²úÆ·¡¢·þÎñ»òÕßÏë·¨µÄÐÅÏ¢£¬²¢ÇÒÊÔÍ¼Ëµ·þ¶ÁÕß½ÓÄÉËù½¨ÒéµÄ½â¾ö·½°¸¡£±¨¸æÓëÌá°¸µÄÒ»¸ö¹Ø¼üÇø±ðÔÚÓÚËüÃÇ±»Ð´×÷µÄÊ±¼ä¡£Ìá°¸Í¨³£ÔÚÖÆ¶¨¾ö²ß¹ý³ÌµÄÔçÆÚ½øÐÐ£¬´ËÊ±ËüÄÜ¹»Ó°Ïì¾ö²ß¡£±¨¸æÍ¨³£ÔÚÒÑ¾­²ÉÈ¡Ò»Ð©ÐÐ¶¯Ö®ºó×«Ð´¡£µ±Ò»Ïî»î¶¯»òÏîÄ¿·¢ÉúµÄÊ±ºò£¬Ò»Ð©±¨¸æ¿ÉÒÔ¼ÇÂ¼ËüÃÇµÄ×´Ì¬¡£µ±»î¶¯»òÏîÄ¿Íê½áÊ±£¬¿ÉÒÔ×«Ð´ÆäËûµÄ±¨¸æ¡£±¨¸æºÍÌá°¸µÄÀàÐÍ²Î¼ûÍ¼5ª²7¡£


Í¼5ª²7±¨¸æºÍÌá°¸µÄÀàÐÍ


ÔÚ¿ªÊ¼×«Ð´±¨¸æ»òÌá°¸Ç°£¬Çë»Ø´ðÏÂÃæµÄÎÊÌâ£º 
×«Ð´µÄÄ¿µÄÊÇÊ²Ã´£¿
×«Ð´±¨¸æµÄµÚÒ»²½ÊÇÃ÷È·µØ¶¨ÒåÄ¿µÄ¡£Ê×ÏÈ·ÖÎöÏëÒª´ïµ½µÄÄ¿±ê£¬Ä¿±êÊÇÍ¨Öª¡¢¸üÐÂ¡¢·ÖÎö£¬»¹ÊÇËµ·þ£¿Ä¿±ê½«°ïÖú¾ö¶¨Ó¦¸ÃÊ¹ÓÃµÄÐÎÊ½¡£
¶ÁÕßÊÇË­£¿
ÓëÆäËûÀàÐÍµÄÎÄµµÏàÍ¬£¬×«Ð´±¨¸æ»òÌá°¸Ê±£¬Òª¿¼ÂÇ¶ÁÕß¡£ÎªÁË¸üºÃµØÂú×ã¶ÁÕßµÄÐèÇó£¬Òª±æ±ðËûÃÇÀí½â±¨¸æ»òÌá°¸Ö÷Ö¼µÄ³Ì¶È¡£ËûÃÇÏëÒªÍ¨¹ýÔÄ¶Á±¨¸æ»òÌá°¸ÁË½âÊ²Ã´£¿ËûÃÇÓÐ¿ÉÄÜÔõÑùÔÄ¶Á£¿Ó¦¸ÃÔõÑù×«Ð´²ÅÄÜÊ¹ÐÅÏ¢ÇåÎú£¬²¢ÇÒÊ¹¶ÁÕßÒ×¶®£¿Ò»¶¨Òª¿¼ÂÇÖ÷Òª¶ÁÕßºÍ´ÎÒª¶ÁÕß£¬ÒÔ¼°°üÀ¨ÄÇÐ©¿ÉÄÜ»áÔÄ¶Á¸ÃÎÄµµµÄÈÎºÎÈË¡£
Ó¦¸Ã×«Ð´±¨¸æ»¹ÊÇÌá°¸£¿
×«Ð´±¨¸æÊÇÎªÁËÓëËûÈË·ÖÏíÐÅÏ¢¡£×«Ð´Ìá°¸ÊÇÎªÁËËµ·þ¶ÁÕß²ÉÄÉÏë·¨¡¢²úÆ·»òÕß½â¾ö·½°¸¡£ÕâÁ½ÕßÓë·ÖÎö±¨¸æºÜÀàËÆ£¬µ«Çø±ðÊÇÖ»ÊÇÕâÀïÖ»³ÊÏÖÒ»¸ö½¨Òé¡£±í5ª²2¸ø³öÁËºÎÊ±Ó¦¸Ã×«Ð´±¨¸æ»òÌá°¸µÄ½¨Òé¡£


±í5ª²2ºÎÊ±×«Ð´±¨¸æ»òÌá°¸



³¡¾°±¨¸æÌá°¸ÆäËû


²Î¼ÓÒ»³¡Ã³Ò×Õ¹Ê¾»á£¬Ï£ÍûÍ¨¸æ±¾¹«Ë¾µÄ¾ºÕù¶ÔÊÖµÄ²úÆ·¡Ì
ÐèÒªÎª¹«Ë¾Á÷³Ì×«Ð´ÎÄµµ¡Ì
·ÖÎöÊÇ¹ºÂòÐÂµÄ¼ÆËã»úÉè±¸»¹ÊÇÉý¼¶ÏÖÓÐÉè±¸¡Ì
ÌáÒé¹ºÂòÐÂµÄ¼ÆËã»úÉè±¸¡Ì
Îª¹æ»®Ö°Ô±×ÊÔ´ÌáÒéÒ»ÖÖÐÂ·½°¸¡Ì
Îª¸öÈË»ò×éÖ¯Ìá¹©¹«Ë¾µÄ·þÎñ¡Ì
ÔÚËù²Î¼ÓµÄÒ»³¡»áÒéÉÏÎªÖ®ºóµÄ²éÔÄ×Ü½áËù×öµÄ±Ê¼Ç·ÇÕýÊ½±Ê¼Ç»ò´ó¸Ù
ÎªÒ»°ãµÄÊÜÖÚÍÆÏú¹«Ë¾µÄ·þÎñ¹ã¸æ
ÎªÇ±ÔÚµÄ¹Ë¿ÍÃèÊö¹«Ë¾²úÆ·£¬²¢ÇÒÌá¹©ÑùÆ·Õ¹Ê¾

±¨¸æÖÐ»áÕ¹Ê¾ÐÅÏ¢»¹ÊÇ·ÖÎö»°Ìâ£¿
±¨¸æ¿ÉÒÔÊÇÏÂÊöÁ½ÖÖÀàÐÍÖÐµÄÒ»¸ö¡£ÐÅÏ¢±¨¸æÒÔÇåÎú¡¢¿Í¹ÛµÄÐÎÊ½Õ¹Ê¾ÐÅÏ¢¡£µ±ÏëÎª¶ÁÕßÊéÃæ×Ü½áÕë¶ÔÄ³¸öÖ÷ÌâµÄÐÅÏ¢Ê±£¬Ê¹ÓÃÐÅÏ¢±¨¸æ±È½ÏºÏÊÊ¡£Òâ¼ûºÍ½¨Òé²»Ó¦Ð´ÔÚÒ»¸öÐÅÏ¢±¨¸æÖ®ÖÐ¡£·ÖÎö±¨¸æÒ»°ã»á³ÊÏÖÊý¾Ý¡¢·ÖÎöºÍ½áÂÛ¡£·ÖÎö±¨¸æÍ¨³£»áÌá¹©²»Í¬µÄÑ¡Ôñ£¬¼ø±ðÓÅÁÓÒÔµÃµ½Ìæ´ú·½°¸£¬ÒÔ¼°°üº¬¾ßÌåµÄ½¨Òé¡£
Ìá°¸ÊÇÎªÄÚ²¿»¹ÊÇÍâ²¿µÄ¶ÁÕß¶ø×«Ð´£¿
Ìá°¸Ò²ÓÐÁ½ÖÖÀàÐÍ¡£ÄÚ²¿Ìá°¸½¨ÒéÈçºÎÔÚÒ»¸ö×éÖ¯ÄÚ½â¾öÎÊÌâ£¬ÀýÈç£¬Í¨¹ý¸Ä±äÒ»¸ö³ÌÐò»òÕßÊ¹ÓÃÉÌ¼ÒµÄ²»Í¬²úÆ·»ò·þÎñ¡£Íâ²¿Ìá°¸±»Éè¼ÆÀ´ÏúÊÛ²úÆ·»ò·þÎñÓÚ¿Í»§£¬²¢ÇÒÍ¨³£ÎªÏìÓ¦ÇëÇó¶ø×«Ð´¡£
»Ø´ðÕâÐ©ÎÊÌâÓÐÖúÓÚ¾ö¶¨±¨¸æÓ¦¸ÃÓÐ¶à³¤£¬°üº¬Ê²Ã´ÑùµÄÐÅÏ¢£¬ÒÔ¼°ÊÊµ±ÐÎÊ½¡£
2£® ¹æ»®±¨¸æ»òÌá°¸
ÓÐÌõÀíµØ×éÖ¯ÒµÎñ±¨¸æºÍÌá°¸£¬ÒÔ±ãÊ¹ÐÅÏ¢ÈÝÒ×ÔÄ¶ÁºÍÀí½â¡£ÔÚÐ´µÚÒ»¾ä»°Ö®Ç°£¬¾ÍÓ¦¸ÃÓÐÕë¶ÔÈçºÎ×éÖ¯±¨¸æ»òÌá°¸µÄºÃµÄË¼Â·¡£½«Ò»°ãµÄË¼Â·×éºÏÔÚÒ»Æð£¬²¢×ñÑ­Âß¼­Ë³Ðò¡£¸ÃË³ÐòÄÜ¹»Âú×ãÄ¿µÄ£¬²¢ÓÐÖúÓÚ¶ÁÕßÃ÷°×ËùÐ´µÄÄÚÈÝ¡£ÓÐÂß¼­µØ×éÖ¯ÐÅÏ¢µÄ·½Ê½Ó¦ÒÀÊ±¼ä¡¢ÖØÒªÐÔÒÔ¼°Àà±ð£¬ÀýÈçÎ»ÖÃ»ò²úÆ·À´¾ö¶¨¡£×«Ð´ÕýÊ½»ò·ÇÕýÊ½µÄ´ó¸Ù¿ÉÒÔÓÐÖúÓÚ¹æ»®ÓÐÐ§µÄ±¨¸æ¡£±í5ª²3×Ü½áÁË×«Ð´´ó¸ÙµÄ×¢ÒâÊÂÏî¡£


±í5ª²3×«Ð´´ó¸ÙµÄ×¢ÒâÊÂÏî



ÒªËØÊÊ ºÏ Ìá µ½¾¡ Á¿ ±Ü Ãâ
Ö÷ÒªË¼Â·

¡¤ ÒÔÍ·ÄÔ·ç±©¿ªÊ¼£¬ÁÐ³öÏëÒª°üº¬µÄËùÓÐË¼Â·

¡¤ Ñ¡ÔñÒ»¸ö×÷ÎªÖ÷ÒªµÄË¼Â·

¡¤ Ð´ÔÚ´ó¸Ù¿ªÍ·

¡¤ ±£ÁôËùÓÐµÄË¼Â·£¬¶ø²»ÊÇÖ»±£ÁôÄÇÐ©·þÎñÓÚ±¨¸æ»òÕßÌá°¸µÄÄ¿µÄºÍÄÇÐ©·þÎñÓÚ¶ÁÕßµÄË¼Â·

¡¤ ±íÊöÖ÷ÒªË¼Â·³¬¹ýÁ½¾ä
´ó±êÌâºÍÕÂ½Ú

¡¤ Ñ¡ÔñÒéÌâ²¢Ð´³öÏàÓ¦µÄ±êÌâ

¡¤ Ê¹ÓÃ±ê×¼±êÌâ£¬ÀýÈç½éÉÜ(Introduction)ºÍ½áÂÛ(Conclusion)

¡¤ °´Âß¼­Ë³ÐòÁÐ³ö±êÌâ

¡¤ ÔÚÕýÊ½´ó¸ÙÖÐ£¬Ê¹ÓÃÂÞÂíÊý×Ö±ê×¢´ó±êÌâ
¡¤ Æ«ÀëÈçÏÂµÄ±ê×¼Ä£Ê½£º (1)½éÉÜ(Introduction)£» (2)ÊÂÊµºÍ·¢ÏÖ(Facts or findings)£» (3)½áÂÛ(Conclusion)

¡¤ °üº¬Ã»ÓÐ×ã¹»Ï¸½ÚºÍÖ¤¾ÝµÄÒéÌâ

×Ó±êÌâ
¡¤ ÓÃ×Ó±êÌâ½«´óÒéÌâ·Ö½âÎª×ÓÒéÌâ

¡¤ ÒÔÂß¼­Ë³ÐòÁÐ³ö×ÓÒéÌâ£¬ÀýÈç£¬Ê±¼ä¡¢ÖØÒªÐÔ£¬»òÕßÀà±ð

¡¤ ÔÚÕýÊ½´ó¸ÙÖÐ£¬µÚÒ»¼¶×Ó±êÌâÊ¹ÓÃ´óÐ´×ÖÄ¸£¬ÏÂÒ»¼¶Ê¹ÓÃÊý×Ö£¬×îºóÒ»¼¶Ê¹ÓÃÐ¡Ð´×ÖÄ¸¡¤ ÒÔÈÎÒâË³ÐòÁÐ³ö×ÓÒéÌâ

¡¤ Ê¹ÓÃÄÑÒÔ½âÊÍµÄ×Ó±êÌâ

1) Ê×ÏÈÈ·¶¨Ö÷ÒªµÄË¼Â·
¿ªÊ¼×«Ð´´ó¸Ù¿ÉÒÔÍ¨¹ýÔÚÒ³Ãæ¶¥¶ËÓÃÒ»Á½¸ö¾ä×ÓÃèÊöÖ÷ÒªË¼Â·À´¿ªÊ¼¡£Èç¹ûÖ÷ÒªµÄË¼Â·Ì«³¤£¬¿ÉÒÔ¾«¼òËùÐ´µÄÄÚÈÝ¡£ÔÚÒ³ÃæµÄÉÏ·½ËµÃ÷Ö÷ÒªµÄË¼Â·£¬ÓÐÖúÓÚÔÚÖÆ¶¨´ó¸ÙµÄÆäÓà²¿·ÖÊ±×¨×¢ÓÚ×Ô¼ºµÄÄ¿±ê¡£Ðí¶à±¨¸æºÍÌá°¸µÄÖ÷ÒªË¼Â·ÊÇÒªÃèÊöÒ»¸ö½â¾öÎÊÌâµÄ°ì·¨¡£
2) ÎªÖØÒªµÄË¼Â·Ê¹ÓÃ±êÌâ
¸´²é±¨¸æµÄË¼Â·ºÍÖ÷Ìâ£¬²¢Ñ¡Ôñ×îÖØÒªµÄ²¿·Ö¡£ÕâÐ©¶¼Ó¦×÷Îª´ó¸ÙµÄÖ÷Òª±êÌâ¡£ÕâÐ©±êÌâÒª°´ÕÕÂß¼­Ë³ÐòÁÐ³ö£¬±ÈÈç´Ó×îÖØÒªµÄµ½×î²»ÖØÒªµÄ£¬»ò°´Ê±¼äË³Ðò(Èç¹û±¨¸æÇ¿µ÷ÁËÊ±¼ä)¡£ÕâÐ©±êÌâ½«³ÉÎª±¨¸æµÄÖ÷Òª²¿·Ö¡£Í¼5ª²8Õ¹Ê¾ÁËÕýÊ½ºÍ·ÇÕýÊ½´ó¸ÙÖÐµÄ±êÌâ£¬°üÀ¨Ê¹ÓÃÂÞÂíÊý×Ö¡¢´óÐ´×ÖÄ¸¡¢Êý×ÖºÍÐ¡Ð´×ÖÄ¸µÄ¹æ·¶¡£


Í¼5ª²8ÕýÊ½ºÍ·ÇÕýÊ½µÄ´ó¸Ù


3) Îª×ÓÒéÌâ´´½¨×Ó±êÌâ
¿ÉÒÔ½«Ã¿Ò»¸öÖ÷ÒªÒéÌâ·ÖÎª¼¸¸öË¼Â·£¬ÒÔ±ãÏêÏ¸µØÌÖÂÛËüÃÇ¡£ÔÚ´ó¸ÙÖÐÁÐ³öÕâÐ©Ë¼Â·½«Æä×÷Îª×Ó±êÌâ¡£¿ÉÒÔÎªÃ¿¸ö´ó±êÌâÌá¹©Á½¸ö»òÁ½¸öÒÔÉÏµÄ×Ó±êÌâ£¬ÈçÍ¼5ª²8ËùÊ¾¡£Èç¹ûÕýÔÚÐ´Ò»¸öºÜ³¤µÄ»òÕßºÜ¸´ÔÓµÄ±¨¸æ£¬¿ÉÒÔ½«×ÓÒéÌâ·Ö½âÎª¸üÐ¡µÄ²¿·Ö¡£
4) ½«ºÏÊÊµÄÕÂ½ÚÌí¼Ó½øÀ´
´ó¶àÊý±¨¸æºÍÌá°¸°üÀ¨±ê×¼ÕÂ½Ú£¬Èç½éÉÜ¡¢±³¾°¡¢ÏÖ×´¡¢ÊÂÊµ¡¢Ìá³öµÄ½â¾ö·½°¸¡¢×Ü½á¡¢½áÂÛ¡¢½¨Òé¡¢Àû±×¡¢²Î¿¼Çåµ¥ºÍ¸½Â¼¡£Ñ¡ÔñÄÜ¹»·þÎñÓÚ±¨¸æ»òÌá°¸Ä¿µÄÕÂ½Ú¡£
5) ¸´²é´ó¸Ù
¸´²é´ó¸ÙµÄÍêÕû²Ý°¸ÒÔ±ã»Ø´ðÒÔÏÂÎÊÌâ£º Ë¼Â·ÊÇ·ñ°´ÕÕÂß¼­Ë³Ðò°²ÅÅ£¿Èç¹û´óÉù¶Á´ó¸Ù¸ø×Ô¼ºÌý£¬ÌýÆðÀ´ÊÇ·ñÓÐÒâÒå£¿±êÌâºÍ×Ó±êÌâÊÇ·ñ¾ßÓÐÂß¼­ÐÔºÍÆ½ºâÐÔ£¿ËüÃÇµÄÖØÒªÐÔÊÇ·ñ²î²»¶à£¿Èç¹ûÓÐ±ØÒªÔòÖØÐÂÅÅÁÐË³Ðò¡£ÒéÌâÊÇ·ñÒÑ¾­ÓÐÁË×ã¹»µÄÏ¸½Ú»òÖ¤¾ÝÀ´Ö§³ÖÖ÷ÒªµÄË¼Â·£¿Èç¹û²»ÊÇ£¬ÄÇ¾ÍÓ¦¸Ã½«ËüÃÇÌí¼Óµ½´ó¸ÙÖÐ»òÕßÖØ×é´ó¸Ù¡£
×ª109Ò³

Part 3 

Listening & Speaking


ÔÚÏßÒôÆµ



Dialogue£º Machine Learning
(Before the first lesson of Machine Learning£¬Mark met with Henry and Sophie in front of their classroom)


Mark£º Excuse me£¬Henry and Sophie. Could you help me? £Û1£Ý

Henry£º Sure. What¬ðs the problem?

£Û1£Ý Replace with£º 

1. Can you give me a hand?

2. Could you please do me a favor?

3. Could you do me favor?



Mark£º I¬ðm a little bit confused with machine learning? Exactly £Û2£Ý what is machine learning?


£Û2£ÝReplace with£º 

1. Accurately

2. Correctly

3. Definitely

4. Truly

5. Precisely



Henry£º Well£¬machine learning is the scientific study of algorithms and statistical models that computer systems use to effectively perform a specific task without using explicit instructions£¬relying on models and inference instead. It is seen as a subset of artificial intelligence.
Sophie£º To my knowledge£¬machine learning algorithms build a mathematical model of sample data£¬known as ¡°training data¡°£¬in order to make predictions or decisions without being explicitly programmed to perform the task. Machine learning algorithms are used in the applications of email filtering£¬detection of network intruders£¬and computer vision£¬where it is infeasible to develop an algorithm of specific instructions for performing the task.

Mark£º Does machine learning have some relationships with other areas?
Henry£º Of course. Machine learning is closely related to computational statistics£¬which focuses on making predictions using computers. The study of mathematical optimization delivers methods£¬theory and application domains to the field of machine learning. Data mining is a field of study within machine learning£¬and focuses on exploratory data analysis through unsupervised learning. In its application across business problems£¬machine learning is also referred to as predictive analytics.
Mark£º And are there any classifications for the machine learning? 
Sophie£º 
Absolutely. Machine learning tasks are classified into several broad categories. In supervised learning£¬the algorithm builds a mathematical model of a set of data that contains both the inputs and the desired outputs.

For example£¬if the task were determining whether an image contained a certain object£¬the training data for a supervised learning algorithm would include images with and without that object (the input)£¬and each image would have a label (the output) designating whether it contained the object.
Henry£º In special cases£¬the input may be only partially available£¬or restricted to special feedback. Semiª²supervised learning algorithms develop mathematical models from incomplete training data£¬where a portion of the sample inputs are missing the desired output.

Mark£º Could you please name a few algorithms for supervised learning? 
Henry£º Sure.  Classification algorithms and regression algorithms are types of supervised learning. Classification algorithms are used when the outputs are restricted to a limited set of values. For a classification algorithm that filters emails£¬the input would be an incoming email£¬and the output would be the name of the folder in which to file the email. For an algorithm that identifies spam emails£¬the output would be the prediction of either ¡°spam¡° or ¡°not spam¡°£¬represented by the Boolean values true and false. 
Sophie£º And regression algorithms are named for their continuous outputs£¬meaning they may have any value within a range. Examples of a continuous value are the temperature£¬length£¬or price of an object.
Mark£º So£¬how about unsupervised learning?
Sophie£º Well£¬in unsupervised learning£¬the algorithm builds a mathematical model of a set of data which contains only inputs and no desired outputs.
Unsupervised learning algorithms are used to find structure in the data£¬like grouping or clustering of data points.
Henry£º Moreover£¬unsupervised learning can discover patterns in the data£¬and can group the inputs into categories£¬as in feature learning. Dimensionality reduction is the process of reducing the number of ¡°features¡°£¬or inputs£¬in a set of data.
Mark£º OK£¬so what else?
Henry£º Well£¬ÉîÉ«µÄµ×ÎÆreinforcement learning algorithms are given feedback in the form of positive or negative reinforcement in a dynamic environment£¬and are used in autonomous vehicles or in learning to play a game against a human opponent.

Sophie£º And other specialized algorithms in machine learning include topic modeling£¬where the computer program is given a set of natural language documents and finds other documents that cover similar topics. Machine learning algorithms can be used to find the unobservable probability density function in density estimation problems£¬ÉîÉ«µÄµ×ÎÆso on and so forth.
Mark£º So much knowledge I¬ðm interested in! Thank you very much!

Exercises
Work in a group£¬and make up a similar conversation by replacing the statements with other expressions on the right side£®


Words







infeasible£Ûin¬ñfi­Âzib(­¼)l£Ý adj£®²»¿ÉÐÐµÄ£¬²»¿ÉÊµÐÐµÄ
designate£Û¬ñdezigneit£Ý v£®Ö¸¶¨£¬Ö¸ÅÉ

file£Ûfail£Ý v£®°Ñ¡­¡­¹éµµ





Phrases





reinforcement learningÇ¿»¯Ñ§Ï°

so on and so forthµÈµÈ







ÔÚÏßÒôÆµ



Listening Comprehension£º Supervised Learning
Listen to the article and answer the following 3 questions based on it£®After you hear a question£¬there will be a break of 15 seconds£®During the break£¬you will decide which one is the best answer among the four choices marked (A)£¬(B)£¬(C) and (D)£®
Questions
1£® Which of the following is right?£¨£©
(A) Supervised learning is the machine learning task of learning a function that maps an input to an output based on example inputª²output pairs£®
(B) Supervised learning infers a function from labeled training data consisting of a set of training examples£®
(C) A supervised learning algorithm analyzes the training data and produces an inferred function£®    
(D) All of the above
2£® Regarding the handª²written digit recognition problem£¬which of the following is right?£¨£©
(A) A reasonable data set for this problem is a collection of images of handª²written digits£®
(B) A reasonable data set for this problem is for each image£¬what the digit actually is£®
(C) A set of examples of the form (image£¬digit) should be considered£®  
(D) All of the above
3£® Which of the following can¬ðt supervised learning do?()
(A) Supervised learning is the machine learning task of learning a function that maps an output to an input based on example outputª²input pairs£®   
(B) Supervised learning is the machine learning task of learning a function that maps an input to an output based on example inputª²output pairs£®
(C) Supervised learning infers a function from labeled training data consisting of a set of training examples£®
(D) A supervised learning algorithm analyzes the training data and produces an inferred function£®


Words







map£Ûm¢up£Ý v£®Ó³Éä

entirety£Ûin¬ñtai­¼r­¼ti£Ý n£®È«²¿£¬ÍêÈ«

outset£Û¬ñautset£Ý n£®¿ªÊ¼£¬¿ª¶Ë









ÔÚÏßÒôÆµ



Dictation£º Unsupervised Learning
This article will be played three times£®Listen carefully£¬and fill in the numbered spaces with the appropriate words you have heard£®
Unsupervised learning is a 1 of machine learning that learns from test data that has not been 2£¬classified or categorized£®Instead of 3 to feedback£¬unsupervised learning identifies commonalities in the data and reacts based on the presence or 4 of such commonalities in each new piece of data£®5 include supervised learning and reinforcement learning£®
In the unsupervised 6£¬the training data does not contain any output information at all£®We are just given input examples X1£¬¡­£¬XN£®You may wonder how we could possibly learn anything from mere inputs£®Consider the coin 7 problem£®Suppose that we didn¬ðt know the denomination of any of the 8 in the data set£® 
We still get similar 9£¬but they are now 10 so all points have the same ¡°color¡°£®The decision regions in unsupervised learning may be 11 to those in supervised learning£¬but without the labels£®However£¬the correct clustering is less 12 now£¬and even the number of clusters may be 13£®
14£¬this example shows that we can learn something from the inputs by themselves£®Unsupervised learning can be 15 as the task of spontaneously finding 16 and structure in input data£®For instance£¬if our task is to 17 a set of books into topics£¬and we only use 18 properties of the 19 books£¬we can identify books that have similar 20 and put them together in one category£¬without naming that category£®



Words







commonality£Ûk­¿m­¼¬ñn¢uliti£Ý n£®¹«¹²£¬¹²ÐÔ

denomination£Ûdi­ýn­¿mi¬ñnei­Àn£Ý n£®Ãæ¶î

spontaneously£Ûsp­¿n¬ñte­Êni­¼sli£Ý adv£®×Ô·¢µØ£¬×ÔÈ»µØ