回歸

1.連續(xù)輸出

本課關(guān)于連續(xù)(變量)的監(jiān)督式學(xué)習(xí)
The output variable has been constrained to binary values in our previous setup.
This output is called discrete
但在很多學(xué)習(xí)問(wèn)題中猎荠，輸出也可以是連續(xù)的

image.png

2. 連續(xù)

continuous supervised learning
continuous--output

3. 年齡：連續(xù)還是離散卧晓？

連續(xù)輸出離散輸出

4. 天氣：連續(xù)還是離散昂勉？

我們視為離散的多數(shù)事物其實(shí)在某種程度上是連續(xù)的

7. 收入：連續(xù)還是離散？

我們將一個(gè)變量看成連續(xù)時(shí)精续，我們其實(shí)暗示其有一定的次序，即可以比較大小

8.連續(xù)特征

分類通常意味著離散輸出黔酥，在我們的地形分類問(wèn)題中聋呢，輸出變量是快速/慢速抓艳，如果將輸出推廣為連續(xù)輸出触机，最好的辦法是speed in mile

9.斜率和截距(回歸線性方程)

目標(biāo)變量/嘗試預(yù)測(cè)的變量/輸出 = 斜率(slope)*輸入變量+截距(intercept)
slope -- define how steep the curve goes up 定義了曲線上升的陡度
a larger slope makes it go up faster
in the situation of negative slope,the graph would go down
截距：與縱軸交點(diǎn)的坐標(biāo)

17. 線性回歸編碼

from sklearn import linear_model
clf = linear_model.LinearRegression()
clf.fit([[0,0],[1,1],[2,2]],[0,1,2])
clf.coef_                                 #讀取系數(shù) 斜率

數(shù)據(jù)集(年齡-凈資產(chǎn)) →(分拆)訓(xùn)練集&測(cè)試集
在數(shù)據(jù)中加入噪音，這樣就不是完美的關(guān)系

用訓(xùn)練集擬合直線，得到的就是回歸的結(jié)果儡首，再用這條線預(yù)測(cè)年齡在25-60間任何人的凈資產(chǎn)

18. sklearn中的年齡/凈值回歸

#!/usr/bin/python   studentMain.py

import numpy
import matplotlib
matplotlib.use('agg')

import matplotlib.pyplot as plt
from studentRegression import studentReg
from class_vis import prettyPicture, output_image

from ages_net_worths import ageNetWorthData

ages_train, ages_test, net_worths_train, net_worths_test = ageNetWorthData()



reg = studentReg(ages_train, net_worths_train)


plt.clf()
plt.scatter(ages_train, net_worths_train, color="b", label="train data")
plt.scatter(ages_test, net_worths_test, color="r", label="test data")
plt.plot(ages_test, reg.predict(ages_test), color="black")
plt.legend(loc=2)
plt.xlabel("ages")
plt.ylabel("net worths")


plt.savefig("test.png")
output_image("test.png", "png", open("test.png", "rb").read())

#!/usr/bin/python   studentRegression.py
def studentReg(ages_train, net_worths_train):
    ### import the sklearn regression module, create, and train your regression
    ### name your regression reg
    
    ### your code goes here!
    
    from sklearn import linear_model
    reg = linear_model.LinearRegression()
    reg.fit(ages_train,net_worths_train)
    
    return reg

19. 通過(guò)sklearn提取信息

print "katie's net worth prediction: ", reg.predict([27])  #預(yù)測(cè)結(jié)果
print "slope:", reg.coef_                    #獲取斜率
print "intercept:" ,reg.intercept_              #獲取截距

20. 通過(guò) sklearn 提取分?jǐn)?shù)數(shù)據(jù)

評(píng)估回歸的指標(biāo)：r2片任，sum of errors
r2: 越大，回歸性能越好 max=1

print "\n ######## stats on test dataset ########\n"
print "r-squared score: ",reg.score(ages_test,net_worths_test)  #通過(guò)使用測(cè)試集蔬胯，可以察覺到過(guò)擬合等情況

print "\n ######## stats on training dataset ########\n"
print "r-squared score: ",reg.score(ages_train,net_worths_train)

21. 現(xiàn)在你練習(xí)提取信息

#!/usr/bin/python   regressionQuiz.py
import numpy
import matplotlib.pyplot as plt

from ages_net_worths import ageNetWorthData

ages_train, ages_test, net_worths_train, net_worths_test = ageNetWorthData()



from sklearn.linear_model import LinearRegression

reg = LinearRegression()
reg.fit(ages_train, net_worths_train)

### get Katie's net worth (she's 27)
### sklearn predictions are returned in an array, so you'll want to index into
### the output to get what you want, e.g. net_worth = predict([[27]])[0][0] (not
### exact syntax, the point is the [0] at the end). In addition, make sure the
### argument to your prediction function is in the expected format - if you get
### a warning about needing a 2d array for your data, a list of lists will be
### interpreted by sklearn as such (e.g. [[27]]).
km_net_worth = reg.predict([[27]]) ### fill in the line of code to get the right value

### get the slope
### again, you'll get a 2-D array, so stick the [0][0] at the end
slope = reg.coef_ ### fill in the line of code to get the right value

### get the intercept
### here you get a 1-D array, so stick [0] on the end to access
### the info we want
intercept = reg.intercept_ ### fill in the line of code to get the right value


### get the score on test data
test_score = reg.score(ages_test,net_worths_test) ### fill in the line of code to get the right value


### get the score on the training data
training_score = reg.score(ages_train,net_worths_train) ### fill in the line of code to get the right value



def submitFit():
    # all of the values in the returned dictionary are expected to be
    # numbers for the purpose of the grader.
    return {"networth":km_net_worth,
            "slope":slope,
            "intercept":intercept,
            "stats on test":test_score,
            "stats on training": training_score}

#!/usr/bin/python   ages_net_worths.py
import numpy
import random

def ageNetWorthData():

    random.seed(42)
    numpy.random.seed(42)

    ages = []
    for ii in range(100):
        ages.append( random.randint(20,65) )
    net_worths = [ii * 6.25 + numpy.random.normal(scale=40.) for ii in ages]
### need massage list into a 2d numpy array to get it to work in LinearRegression
    ages       = numpy.reshape( numpy.array(ages), (len(ages), 1))
    net_worths = numpy.reshape( numpy.array(net_worths), (len(net_worths), 1))

    from sklearn.cross_validation import train_test_split
    ages_train, ages_test, net_worths_train, net_worths_test = train_test_split(ages, net_worths)

    return ages_train, ages_test, net_worths_train, net_worths_test

22. 線性回歸誤差

評(píng)估線性回歸：

可視化：將回歸的結(jié)果放在散點(diǎn)圖上
查看線性回歸產(chǎn)生的誤差
線性擬合會(huì)有關(guān)聯(lián)性的誤差
誤差=實(shí)際結(jié)果-輸出的預(yù)測(cè)結(jié)果

image.png

24. 誤差和擬合質(zhì)量

一個(gè)好的擬合可以將以下兩種誤差最小化：

所有誤差的絕對(duì)值的和
所有誤差的平方和

25. 最小化誤差平方和

當(dāng)執(zhí)行線性回歸時(shí)对供，要做的是最大程度地降低誤差平方和
這意味著，最佳回歸是最小化誤差平方和的回歸氛濒、
因此我們要做的是找到能夠使得誤差平方和最小的m和b

image.png

26. 最小化誤差平方和的算法

*ordinary least squares(OLS) 普通最小二乘法
used in sklearn LinearRegression
*gradient descent 梯度下降法

28. 最小化絕對(duì)誤差的問(wèn)題

there can be multiple lines that minimize Σ|error|,but only one line will minimize Σerror2
use sum of squared error also makes implementation much easier.
使用最小化誤差平方和時(shí)产场，更容易找到回歸線

29. 肉眼評(píng)估回歸

哪一個(gè)回歸能更好地?cái)M合數(shù)據(jù)集？

image.png

這兩條線基本上都恰當(dāng)?shù)貙?duì)圖形和數(shù)據(jù)進(jìn)行了描述

30.SSE的問(wèn)題

image.png

上面圖中的兩個(gè)線性回歸都很好地?cái)M合了數(shù)據(jù)舞竿，兩個(gè)擬合結(jié)果間沒有太大差異
但是右邊的擬合會(huì)產(chǎn)生更大的誤差平方和
通常來(lái)講京景，更大的誤差平方和意味著擬合得更差
因此這是誤差平方和的一個(gè)不足之處，因?yàn)樘砑拥臄?shù)據(jù)越多骗奖，誤差平方和幾乎必定會(huì)增加确徙，但并不代表擬合得不好，誤差平方和會(huì)因?yàn)樗褂玫臄?shù)據(jù)點(diǎn)的數(shù)量出現(xiàn)偏差重归，盡管擬合不存在太大問(wèn)題米愿，下面介紹評(píng)估回歸的另一個(gè)指標(biāo)r2.

31. 回歸的 R 平方指標(biāo)

r2 of a regression
描述線性回歸的擬合良好度
-- is a number that effectively ask the question:
how much of my change in the output(y) is explained by the change in my input(x)
-- 0.0＜r2 ＜1.0
-- the number is small,means that your regression line isn't doing a good job of capturing the trend in the data
--the number is large ,close to 1,means your regression line is doing a good job of describing the relationship between your input (x) and your output (y)
--優(yōu)點(diǎn)在于與訓(xùn)練集的數(shù)量無(wú)關(guān)，即不受數(shù)據(jù)集中數(shù)據(jù)數(shù)量的影響
--比誤差平方和更可靠一些鼻吮，尤其是在數(shù)據(jù)集中的數(shù)據(jù)數(shù)量可能會(huì)改變的情況下

32. sklearn中r2

如果我們能夠整合其他特征中的信息，就能更好地進(jìn)行預(yù)測(cè)较鼓，即獲得更高的r2

from sklearn import linear_model
reg = linear_model.LinearRegression()
reg.fit(ages_train,net_worths_train)
print "katie's net worth prediction:" , reg.predict([27])
print "r-squared score:",reg.score(ages_test,net_worths_test)
print "slope:",reg.coef_
print "intercept:",reg.intercept_

33.可視化回歸

plt.scatter(ages,net_worths)
plt.plot(ages,reg.predict(ages),color='blue',linewidth=3)
plt.xlabel('ages')
plt.ylabel('net_worths')
plt.show()

34.什么數(shù)據(jù)適用于線性回歸

可以使用線性回歸的意思是椎木，可以寫出y=mx+b,這個(gè)等式可以很好地描述數(shù)據(jù)中的趨勢(shì)

image.png

對(duì)于右下角的拋物線形狀，可以通過(guò)使用特征轉(zhuǎn)換擬合非線性關(guān)系博烂。例如香椎，添加平方 x 項(xiàng)作為功能會(huì)得到多項(xiàng)式回歸，這可以視為多元線性回歸的特殊情況禽篱，

35. 比較分類與回歸

image.png

輸出類型
監(jiān)督分類：類標(biāo)簽的形式是離散的
回歸：輸出是連續(xù)的
真正嘗試查找的東西：
監(jiān)督分類：
-- 決策邊界
-- 根據(jù)點(diǎn)相對(duì)于決策邊界的位置畜伐，可為其賦予一個(gè)類標(biāo)簽
-- 描述數(shù)據(jù)的邊界
回歸：
-- 最優(yōu)擬合線
-- 擬合數(shù)據(jù)的線條
評(píng)估
監(jiān)督分類：
-- 準(zhǔn)確率 accuracy

回歸：
-- 誤差平方和 sum of squared error
-- r的平方 r2

36.多元回歸/多變量回歸/multi-variable regression

有很多不同的輸入變量來(lái)預(yù)測(cè)輸出

image.png

38.回歸迷你項(xiàng)目簡(jiǎn)介

通過(guò)工資預(yù)測(cè)獎(jiǎng)金
異常值：在我們的模式以外很遠(yuǎn)的一個(gè)點(diǎn)
異常值如何影響通過(guò)回歸得到的結(jié)果

在此項(xiàng)目中，你將使用回歸來(lái)預(yù)測(cè)安然雇員和合伙人的財(cái)務(wù)數(shù)據(jù)躺率。一旦你知道某位雇員的財(cái)務(wù)數(shù)據(jù)玛界，比如工資，你是否會(huì)預(yù)測(cè)他們獎(jiǎng)金的數(shù)額悼吱？

40.獎(jiǎng)金目標(biāo)和特征

運(yùn)行在 regression/finance_regression.py 中找到的初始代碼慎框。這將繪制出一個(gè)散點(diǎn)圖，其中有所有的數(shù)據(jù)點(diǎn)后添。你嘗試預(yù)測(cè)什么目標(biāo)笨枯？用來(lái)預(yù)測(cè)目標(biāo)的輸入特征是什么？
在腦海中描繪出你大致預(yù)測(cè)的回歸線（如果打印散點(diǎn)圖并用紙筆來(lái)描繪，效果會(huì)更好）馅精。

image.png

input:salary
output:bonus

41. 可視化回歸數(shù)據(jù)

就像在分類中一樣严嗜，你需要在回歸中訓(xùn)練和測(cè)試數(shù)據(jù)。這在初始代碼中已被設(shè)定洲敢。將 test_color 的值從“b”改為“r”（針對(duì)“red”）漫玄，然后重新運(yùn)行。
注意：對(duì)于將 Python 2 代碼轉(zhuǎn)換至 Python 3 的學(xué)員沦疾，請(qǐng)參見以下關(guān)于兼容性的重要備注称近。
你將僅使用藍(lán)色（訓(xùn)練）點(diǎn)來(lái)擬合回歸。（你可能已經(jīng)注意到哮塞，我們放入測(cè)試集的是 50% 的數(shù)據(jù)而非標(biāo)準(zhǔn)的 10%—因?yàn)樵诘?5 部分中刨秆，我們將改變訓(xùn)練和測(cè)試數(shù)據(jù)集，并且平均分割數(shù)據(jù)使這種做法更加簡(jiǎn)單忆畅。）

從 Python 3.3 版本開始衡未，字典的鍵值順序有所改變，在每次代碼運(yùn)行時(shí)家凯，字典的鍵值皆為隨機(jī)排序。這會(huì)讓我們?cè)?Python 2.7 環(huán)境下工作的評(píng)分者遭遇一些兼容性的問(wèn)題绊诲。為了避免這個(gè)問(wèn)題送粱，請(qǐng)?jiān)?finance_regression.py 文件的第26行 featureFormat 調(diào)用時(shí)添加一個(gè)參數(shù)
sort_keys = '../tools/python2_lesson06_keys.pkl'
它會(huì)打開 tools 文件夾中帶有 Python 2 鍵值順序的數(shù)據(jù)文件。

42.提取斜率和截距

from sklearn import linear_model
reg = linear_model.LinearRegression()
reg.fit(feature_train,target_train)
print reg.coef_          #5.448
print reg.intercept_    #-102360.54

43. 回歸分?jǐn)?shù)：訓(xùn)練數(shù)據(jù)

假設(shè)你是一名悟性不太高的機(jī)器學(xué)習(xí)者掂之，你沒有在測(cè)試集上進(jìn)行測(cè)試抗俄，而是在你用來(lái)訓(xùn)練的相同數(shù)據(jù)上進(jìn)行了測(cè)試，并且用到的方法是將回歸預(yù)測(cè)值與訓(xùn)練數(shù)據(jù)中的目標(biāo)值（比如：獎(jiǎng)金）做對(duì)比世舰。
你找到的分?jǐn)?shù)是多少动雹？你可能對(duì)“良好”分?jǐn)?shù)還沒有概念；此分?jǐn)?shù)不是非常好（但卻非常糟糕）跟压。
···
print reg.score(feature_train,target_train) #0.0455
···

44. 回歸分?jǐn)?shù)：測(cè)試數(shù)據(jù)

現(xiàn)在胰蝠，在測(cè)試數(shù)據(jù)上計(jì)算回歸的分?jǐn)?shù)。
測(cè)試數(shù)據(jù)的分?jǐn)?shù)是多少震蒋？如果只是錯(cuò)誤地在訓(xùn)練數(shù)據(jù)上進(jìn)行評(píng)估茸塞，你是否會(huì)高估或低估回歸的性能？

print reg.score(feature_test,target_test)   #-1.48

45. 根據(jù) LTI 回歸獎(jiǎng)金

我們有許多可用的財(cái)務(wù)特征喷好，就預(yù)測(cè)個(gè)人獎(jiǎng)金而言翔横，其中一些特征可能比余下的特征更為強(qiáng)大。例如梗搅，假設(shè)你對(duì)數(shù)據(jù)做出了思考禾唁，并且推測(cè)出“l(fā)ong_term_incentive”特征（為公司長(zhǎng)期的健康發(fā)展做出貢獻(xiàn)的雇員應(yīng)該得到這份獎(jiǎng)勵(lì)）可能與獎(jiǎng)金而非工資的關(guān)系更密切效览。
證明你的假設(shè)是正確的一種方式是根據(jù)長(zhǎng)期激勵(lì)回歸獎(jiǎng)金，然后看看回歸是否顯著高于根據(jù)工資回歸獎(jiǎng)金荡短。根據(jù)長(zhǎng)期獎(jiǎng)勵(lì)回歸獎(jiǎng)金—測(cè)試數(shù)據(jù)的分?jǐn)?shù)是多少丐枉？

import sys
import pickle
sys.path.append("../tools/")
from feature_format import featureFormat, targetFeatureSplit
dictionary = pickle.load( open("../final_project/final_project_dataset_modified.pkl", "r") )

### list the features you want to look at--first item in the 
### list will be the "target" feature
features_list = ["bonus", "long_term_incentive"]
data = featureFormat( dictionary, features_list, remove_any_zeroes=True)
target, features = targetFeatureSplit( data )

### training-testing split needed in regression, just like classification
from sklearn.cross_validation import train_test_split
feature_train, feature_test, target_train, target_test = train_test_split(features, target, test_size=0.5, random_state=42)
train_color = "b"
test_color = "r"



### Your regression goes here!
### Please name it reg, so that the plotting code below picks it up and 
### plots it correctly. Don't forget to change the test_color above from "b" to
### "r" to differentiate training points from test points.
from sklearn import linear_model
reg = linear_model.LinearRegression()
reg.fit(feature_train,target_train)
print reg.coef_
print reg.intercept_
print reg.score(feature_test,target_test)    #-0.59

### draw the scatterplot, with color-coded training and testing points
import matplotlib.pyplot as plt
for feature, target in zip(feature_test, target_test):
    plt.scatter( feature, target, color=test_color ) 
for feature, target in zip(feature_train, target_train):
    plt.scatter( feature, target, color=train_color ) 

### labels for the legend
plt.scatter(feature_test[0], target_test[0], color=test_color, label="test")
plt.scatter(feature_test[0], target_test[0], color=train_color, label="train")

### draw the regression line, once it's coded
try:
    plt.plot( feature_test, reg.predict(feature_test) )
except NameError:
    pass
plt.xlabel(features_list[1])
plt.ylabel(features_list[0])
plt.legend()
plt.show()

46. 工資與預(yù)測(cè)獎(jiǎng)金的 LTI

如果你需要預(yù)測(cè)某人的獎(jiǎng)金，你是通過(guò)他們的工資還是長(zhǎng)期獎(jiǎng)金來(lái)進(jìn)行預(yù)測(cè)呢掘托？
long_term_incentive

47.異常值破壞回歸

這是下節(jié)課的內(nèi)容簡(jiǎn)介瘦锹，關(guān)于異常值的識(shí)別和刪除。返回至之前的一個(gè)設(shè)置闪盔，你在其中使用工資預(yù)測(cè)獎(jiǎng)金弯院，并且重新運(yùn)行代碼來(lái)回顧數(shù)據(jù)。你可能注意到泪掀，少量數(shù)據(jù)點(diǎn)落在了主趨勢(shì)之外听绳，即某人拿到高工資（超過(guò) 1 百萬(wàn)美元！）卻拿到相對(duì)較少的獎(jiǎng)金异赫。此為異常值的一個(gè)示例椅挣，我們將在下節(jié)課中重點(diǎn)講述它們。
類似的這種點(diǎn)可以對(duì)回歸造成很大的影響：如果它落在訓(xùn)練集內(nèi)塔拳，它可能顯著影響斜率/截距鼠证。如果它落在測(cè)試集內(nèi)，它可能比落在測(cè)試集外要使分?jǐn)?shù)低得多靠抑。就目前情況來(lái)看量九，此點(diǎn)落在測(cè)試集內(nèi)（而且最終很可能降低分?jǐn)?shù)）。讓我們做一些處理颂碧，看看它落在訓(xùn)練集內(nèi)會(huì)發(fā)生什么娩鹉。在 finance_regression.py 底部附近并且在 plt.xlabel(features_list[1]) 之前添加這兩行代碼：
reg.fit(feature_test, target_test)
plt.plot(feature_train, reg.predict(feature_train), color="b")
現(xiàn)在，我們將繪制兩條回歸線稚伍，一條在測(cè)試數(shù)據(jù)上擬合（有異常值），一條在訓(xùn)練數(shù)據(jù)上擬合（無(wú)異常值）戚宦。來(lái)看看現(xiàn)在的圖形个曙，有很大差別，對(duì)吧受楼？單一的異常值會(huì)引起很大的差異垦搬。
新的回歸線斜率是多少？
（你會(huì)發(fā)現(xiàn)差異很大艳汽，多數(shù)情況下由異常值引起猴贰。下一節(jié)課將詳細(xì)介紹異常值，這樣你就有工具來(lái)檢測(cè)和處理它們了河狐。）

reg.fit(feature_test, target_test)
print reg.coef_   #2.27
plt.plot(feature_train, reg.predict(feature_train), color="b")

?著作權(quán)歸作者所有,轉(zhuǎn)載或內(nèi)容合作請(qǐng)聯(lián)系作者

人面猴
序言：七十年代末米绕，一起剝皮案震驚了整個(gè)濱河市瑟捣，隨后出現(xiàn)的幾起案子，更是在濱河造成了極大的恐慌栅干，老刑警劉巖迈套，帶你破解...
沈念sama閱讀 217,734評(píng)論 6贊 505
死咒
序言：濱河連續(xù)發(fā)生了三起死亡事件，死亡現(xiàn)場(chǎng)離奇詭異碱鳞，居然都是意外死亡桑李，警方通過(guò)查閱死者的電腦和手機(jī)，發(fā)現(xiàn)死者居然都...
沈念sama閱讀 92,931評(píng)論 3贊 394
救了他兩次的神仙讓他今天三更去死
文/潘曉璐我一進(jìn)店門窿给，熙熙樓的掌柜王于貴愁眉苦臉地迎上來(lái)贵白，“玉大人，你說(shuō)我怎么就攤上這事崩泡〗模” “怎么了？”我有些...
開封第一講書人閱讀 164,133評(píng)論 0贊 354
道士緝兇錄：失蹤的賣姜人
文/不壞的土叔我叫張陵允华，是天一觀的道長(zhǎng)圈浇。經(jīng)常有香客問(wèn)我，道長(zhǎng)靴寂，這世上最難降的妖魔是什么磷蜀？我笑而不...
開封第一講書人閱讀 58,532評(píng)論 1贊 293
?港島之戀（遺憾婚禮）
正文為了忘掉前任，我火速辦了婚禮百炬，結(jié)果婚禮上褐隆，老公的妹妹穿的比我還像新娘。我一直安慰自己剖踊，他們只是感情好庶弃，可當(dāng)我...
茶點(diǎn)故事閱讀 67,585評(píng)論 6贊 392
惡毒庶女頂嫁案：這布局不是一般人想出來(lái)的
文/花漫我一把揭開白布。她就那樣靜靜地躺著德澈，像睡著了一般歇攻。火紅的嫁衣襯著肌膚如雪。梳的紋絲不亂的頭發(fā)上梆造，一...
開封第一講書人閱讀 51,462評(píng)論 1贊 302
城市分裂傳說(shuō)
那天缴守，我揣著相機(jī)與錄音，去河邊找鬼镇辉。笑死屡穗，一個(gè)胖子當(dāng)著我的面吹牛，可吹牛的內(nèi)容都是我干的忽肛。我是一名探鬼主播村砂，決...
沈念sama閱讀 40,262評(píng)論 3贊 418
雙鴛鴦連環(huán)套：你想象不到人心有多黑
文/蒼蘭香墨我猛地睜開眼，長(zhǎng)吁一口氣：“原來(lái)是場(chǎng)噩夢(mèng)啊……” “哼屹逛！你這毒婦竟也來(lái)了础废？” 一聲冷哼從身側(cè)響起汛骂，我...
開封第一講書人閱讀 39,153評(píng)論 0贊 276
萬(wàn)榮殺人案實(shí)錄
序言：老撾萬(wàn)榮一對(duì)情侶失蹤，失蹤者是張志新（化名）和其女友劉穎色迂，沒想到半個(gè)月后香缺，有當(dāng)?shù)厝嗽跇淞掷锇l(fā)現(xiàn)了一具尸體，經(jīng)...
沈念sama閱讀 45,587評(píng)論 1贊 314
?護(hù)林員之死
正文獨(dú)居荒郊野嶺守林人離奇死亡歇僧，尸身上長(zhǎng)有42處帶血的膿包…… 初始之章·張勛以下內(nèi)容為張勛視角年9月15日...
茶點(diǎn)故事閱讀 37,792評(píng)論 3贊 336
?白月光啟示錄
正文我和宋清朗相戀三年图张，在試婚紗的時(shí)候發(fā)現(xiàn)自己被綠了。大學(xué)時(shí)的朋友給我發(fā)了我未婚夫和他白月光在一起吃飯的照片诈悍。...
茶點(diǎn)故事閱讀 39,919評(píng)論 1贊 348
活死人
序言：一個(gè)原本活蹦亂跳的男人離奇死亡祸轮，死狀恐怖，靈堂內(nèi)的尸體忽然破棺而出侥钳，到底是詐尸還是另有隱情适袜，我是刑警寧澤，帶...
沈念sama閱讀 35,635評(píng)論 5贊 345
?日本核電站爆炸內(nèi)幕
正文年R本政府宣布舷夺，位于F島的核電站苦酱，受9級(jí)特大地震影響，放射性物質(zhì)發(fā)生泄漏给猾。R本人自食惡果不足惜疫萤，卻給世界環(huán)境...
茶點(diǎn)故事閱讀 41,237評(píng)論 3贊 329
男人毒藥：我在死后第九天來(lái)索命
文/蒙蒙一、第九天我趴在偏房一處隱蔽的房頂上張望敢伸。院中可真熱鬧扯饶，春花似錦、人聲如沸池颈。這莊子的主人今日做“春日...
開封第一講書人閱讀 31,855評(píng)論 0贊 22
一樁弒父案，背后竟有這般陰謀
文/蒼蘭香墨我抬頭看了看天上的太陽(yáng)躯砰。三九已至每币，卻和暖如春，著一層夾襖步出監(jiān)牢的瞬間琢歇，已是汗流浹背脯爪。一陣腳步聲響...
開封第一講書人閱讀 32,983評(píng)論 1贊 269
情欲美人皮
我被黑心中介騙來(lái)泰國(guó)打工，沒想到剛下飛機(jī)就差點(diǎn)兒被人妖公主榨干…… 1. 我叫王不留矿微，地道東北人。一個(gè)月前我還...
沈念sama閱讀 48,048評(píng)論 3贊 370
代替公主和親
正文我出身青樓尚揣，卻偏偏與公主長(zhǎng)得像涌矢，于是被迫代替她去往敵國(guó)和親。傳聞我的和親對(duì)象是個(gè)殘疾皇子快骗，可洞房花燭夜當(dāng)晚...
茶點(diǎn)故事閱讀 44,864評(píng)論 2贊 354

回歸

1.連續(xù)輸出

2. 連續(xù)

3. 年齡：連續(xù)還是離散卧晓？

4. 天氣：連續(xù)還是離散昂勉？

7. 收入：連續(xù)還是離散？

8.連續(xù)特征

9.斜率和截距(回歸線性方程)

17. 線性回歸編碼

18. sklearn中的年齡/凈值回歸

19. 通過(guò)sklearn提取信息

20. 通過(guò) sklearn 提取分?jǐn)?shù)數(shù)據(jù)

21. 現(xiàn)在你練習(xí)提取信息

22. 線性回歸誤差

24. 誤差和擬合質(zhì)量

25. 最小化誤差平方和

26. 最小化誤差平方和的算法

28. 最小化絕對(duì)誤差的問(wèn)題

29. 肉眼評(píng)估回歸

30.SSE的問(wèn)題

31. 回歸的 R 平方指標(biāo)

32. sklearn中r2

33.可視化回歸

34.什么數(shù)據(jù)適用于線性回歸

35. 比較分類與回歸

36.多元回歸/多變量回歸/multi-variable regression

38.回歸迷你項(xiàng)目簡(jiǎn)介

40.獎(jiǎng)金目標(biāo)和特征

41. 可視化回歸數(shù)據(jù)

42.提取斜率和截距

43. 回歸分?jǐn)?shù)：訓(xùn)練數(shù)據(jù)

44. 回歸分?jǐn)?shù)：測(cè)試數(shù)據(jù)

45. 根據(jù) LTI 回歸獎(jiǎng)金

46. 工資與預(yù)測(cè)獎(jiǎng)金的 LTI

47.異常值破壞回歸

推薦閱讀更多精彩內(nèi)容