Linear regression with multiple variables

多變量線性回歸預(yù)測房價

ex1data2.txt（房屋尺寸脐瑰，臥室數(shù)量朴恳，房屋價格）

2104,3,399900
1600,3,329900
2400,3,369000
1416,2,232000
3000,4,539900
1985,4,299900
……

Part 1: Feature Normalization

對于多維特征闹击，要使用特征縮放保證其具有相近尺度，這樣梯度下降算法會更快收斂贩疙。這個例子中掀泳，房屋尺寸是臥室數(shù)量接近1000倍。

%% ================ Part 1: Feature Normalization ================
fprintf('Loading data ...\n');

data = load('ex1data2.txt');
X = data(:, 1:2); % 房屋特征2個 (m*2)
y = data(:, 3); % 價格 (m*1)

m = length(y); % 樣本數(shù)量

% 輸出10個樣本點
fprintf('First 10 examples from the dataset: \n');

% X 2個特征跪腹，y 1個價格
% 因為fprintf按列輸出矩陣褂删，所以要將矩陣先轉(zhuǎn)置，使得每一列對應(yīng)一個實例
fprintf(' x = [%.0f %.0f], y = %.0f \n', [X(1:10,:) y(1:10,:)]');
% fprintf(' x = [%.0f %.0f], y = %.0f \n', data(1:10, :)'); % 也可以

fprintf('Program paused. Press enter to continue.\n');
pause;

% 特征縮放
fprintf('Normalizing Features ...\n');
% X是特征縮放后的特征矩陣冲茸，mu是平均值矩陣屯阀，sigma是方差矩陣
[X, mu, sigma] = featureNormalize(X);
X = [ones(m, 1), X]; % X 補充 theta0

% 格式化輸出特征縮放后的矩陣
fprintf(' x = [%.6f %.6f %.6f], y = %.0f \n', [X(1:10,:) y(1:10,:)]');
fprintf('Program paused. Press enter to continue.\n');
pause;

Loading data ...
First 10 examples from the dataset: 
 x = [2104 3], y = 399900 
 x = [1600 3], y = 329900 
 x = [2400 3], y = 369000 
 x = [1416 2], y = 232000 
 x = [3000 4], y = 539900 
 x = [1985 4], y = 299900 
 x = [1534 3], y = 314900 
 x = [1427 3], y = 198999 
 x = [1380 3], y = 212000 
 x = [1494 3], y = 242500 
Program paused. Press enter to continue.
Normalizing Features ...
 x = [1.000000 0.130010 -0.223675], y = 399900 
 x = [1.000000 -0.504190 -0.223675], y = 329900 
 x = [1.000000 0.502476 -0.223675], y = 369000 
 x = [1.000000 -0.735723 -1.537767], y = 232000 
 x = [1.000000 1.257476 1.090417], y = 539900 
 x = [1.000000 -0.019732 1.090417], y = 299900 
 x = [1.000000 -0.587240 -0.223675], y = 314900 
 x = [1.000000 -0.721881 -0.223675], y = 198999 
 x = [1.000000 -0.781023 -0.223675], y = 212000 
 x = [1.000000 -0.637573 -0.223675], y = 242500 
Program paused. Press enter to continue.

featureNormalize 函數(shù)

function [X_norm, mu, sigma] = featureNormalize(X)
% 標準化 X 的特征到[0,1]之間

% 初始化
X_norm = X; % X(47*2)

% 利用matlab自帶函數(shù)mean缅帘，std求平均值和標準差
mu = mean(X); % mu(1*2) a row vector containing the mean value of each column. 
sigma = std(X); % sigma(1*2) 

% size(X, 1) 獲得 X 矩陣的行數(shù)，即樣本數(shù) = 47
% size(X, 2) 獲得 X 矩陣的列數(shù)难衰，即特征數(shù) = 2
m = size(X, 1);

% 縮放 X 的 2 個特征
for i = 1:m
    X_norm(i, 1) = ( X_norm(i, 1) - mu(1) ) / sigma(1);
    X_norm(i, 2) = ( X_norm(i, 2) - mu(2) ) / sigma(2);
end

end

特征縮放

Part 2: Gradient Descent

%% ================ Part 2: Gradient Descent ================
fprintf('Running gradient descent ...\n');

% Choose some alpha value 3個學(xué)習(xí)率
alpha = 0.01; % 學(xué)習(xí)率
alpha1 = 0.03;
alpha2 = 0.1;
num_iters = 400; % 迭代次數(shù)

% Init Theta and Run Gradient Descent
theta = zeros(3, 1); % 3個特征
theta1 = zeros(3, 1);
theta2 = zeros(3, 1);
[theta, J_history] = gradientDescentMulti(X, y, theta, alpha, num_iters); % 多變量梯度下降
[theta1, J_history1] = gradientDescentMulti(X, y, theta1, alpha1, num_iters);
[theta2, J_history2] = gradientDescentMulti(X, y, theta2, alpha2, num_iters);

% Plot the convergence graph
figure;
plot(1:numel(J_history), J_history, '-b', 'LineWidth', 2);
hold on; % 繼續(xù)畫圖
plot(1:numel(J_history1), J_history1, '-r', 'LineWidth', 2);
plot(1:numel(J_history2), J_history2, '-k', 'LineWidth', 2);
% numel: number of elements 返回元素個數(shù)钦无，作為x軸迭代次數(shù)
xlabel('Number of iterations');
ylabel('Cost J');
legend('alpha = 0.01', 'alpha = 0.03', 'alpha = 0.1'); % 圖例

% 顯示theta值
fprintf('Theta computed from gradient descent: \n');
fprintf(' %f \n', theta); % 3*1

% 估計房屋價格 1650 sq-ft, 3 br house
% x1不需要特征縮放為1，將后面的x2盖袭，x3特征縮放
% price = 0; % You should change this
price = [1, (1650 - mu(1,1)) / sigma(1,1), (3 - mu(1,2)) / sigma(1,2)] * theta;

fprintf(['Predicted price of a 1650 sq-ft, 3 br house ' ...
    '(using gradient descent): $%f\n'], price);

fprintf('Program paused. Press enter to continue.\n');
pause;

Running gradient descent ...
Theta computed from gradient descent: 
 334302.063993 
 100087.116006 
 3673.548451 
Predicted price of a 1650 sq-ft, 3 br house (using gradient descent): $289314.620338
Program paused. Press enter to continue.

不同學(xué)習(xí)率下的收斂曲線

gradientDescentMulti 函數(shù)

function [theta, J_history] = gradientDescentMulti(X, y, theta, alpha, num_iters)

% Initialize some useful values
J_history = zeros(num_iters, 1);
m = length(y);

% 迭代
for iter = 1:num_iters
    
    tmp = (X' * ( X * theta - y )); % 向量運算 求和項 (3*m)*(m*1)=(3*1)
    theta = theta - alpha / m * tmp;
    
    % Save the cost J in every iteration
    J_history(iter) = computeCostMulti(X, y, theta); % 根據(jù)theta求J
    
end

computeCostMulti 函數(shù)

function J = computeCostMulti(X, y, theta)

% Initialize some useful values
m = length(y); % number of training examples

E = X * theta - y; % 偏差矩陣(m*1)
J = (E' * E) / (2 * m); % (E'*E) 方差和

end

Part 3: Normal Equations

正規(guī)方程求 θ

%% ================ Part 3: Normal Equations ================
fprintf('Solving with normal equations...\n');

% Load Data
data = csvread('ex1data2.txt'); % load也可以
X = data(:, 1:2);
y = data(:, 3);
m = length(y);

% Add intercept term to X
X = [ones(m, 1) X];

% Calculate the parameters from the normal equation
theta = normalEqn(X, y);

% Display normal equation result
fprintf('Theta computed from the normal equations: \n');
fprintf(' %f \n', theta);

% Estimate the price of a 1650 sq-ft, 3 br house
% 不需要特征縮放
price = [1, 1650, 3] * theta;

fprintf(['Predicted price of a 1650 sq-ft, 3 br house ' ...
    '(using normal equations): $%f\n'], price);

預(yù)測結(jié)果與梯度下降法有出入失暂，正規(guī)預(yù)測的是準確值，梯度下降是遞進值鳄虱。

Solving with normal equations...
Theta computed from the normal equations: 
 89597.909544 
 139.210674 
 -8738.019113 
Predicted price of a 1650 sq-ft, 3 br house (using normal equations): $293081.464335

normalEqn 函數(shù)

function [theta] = normalEqn(X, y)

theta = pinv(X' * X) * X' * y; % 調(diào)用方程

end

最后編輯于：2017.12.10 00:58:06

?著作權(quán)歸作者所有,轉(zhuǎn)載或內(nèi)容合作請聯(lián)系作者

人面猴
序言：七十年代末弟塞，一起剝皮案震驚了整個濱河市，隨后出現(xiàn)的幾起案子拙已，更是在濱河造成了極大的恐慌决记，老刑警劉巖，帶你破解...
沈念sama閱讀 212,686評論 6贊 492
死咒
序言：濱河連續(xù)發(fā)生了三起死亡事件倍踪，死亡現(xiàn)場離奇詭異系宫，居然都是意外死亡，警方通過查閱死者的電腦和手機建车，發(fā)現(xiàn)死者居然都...
沈念sama閱讀 90,668評論 3贊 385
救了他兩次的神仙讓他今天三更去死
文/潘曉璐我一進店門扩借，熙熙樓的掌柜王于貴愁眉苦臉地迎上來，“玉大人缤至，你說我怎么就攤上這事潮罪。” “怎么了凄杯？”我有些...
開封第一講書人閱讀 158,160評論 0贊 348
道士緝兇錄：失蹤的賣姜人
文/不壞的土叔我叫張陵错洁，是天一觀的道長。經(jīng)常有香客問我戒突，道長，這世上最難降的妖魔是什么描睦？我笑而不...
開封第一講書人閱讀 56,736評論 1贊 284
?港島之戀（遺憾婚禮）
正文為了忘掉前任膊存，我火速辦了婚禮，結(jié)果婚禮上忱叭，老公的妹妹穿的比我還像新娘隔崎。我一直安慰自己，他們只是感情好韵丑，可當(dāng)我...
茶點故事閱讀 65,847評論 6贊 386
惡毒庶女頂嫁案：這布局不是一般人想出來的
文/花漫我一把揭開白布爵卒。她就那樣靜靜地躺著，像睡著了一般撵彻。火紅的嫁衣襯著肌膚如雪钓株。梳的紋絲不亂的頭發(fā)上实牡，一...
開封第一講書人閱讀 50,043評論 1贊 291
城市分裂傳說
那天，我揣著相機與錄音轴合，去河邊找鬼创坞。笑死，一個胖子當(dāng)著我的面吹牛受葛，可吹牛的內(nèi)容都是我干的题涨。我是一名探鬼主播，決...
沈念sama閱讀 39,129評論 3贊 410
雙鴛鴦連環(huán)套：你想象不到人心有多黑
文/蒼蘭香墨我猛地睜開眼总滩，長吁一口氣：“原來是場噩夢啊……” “哼纲堵！你這毒婦竟也來了？” 一聲冷哼從身側(cè)響起闰渔，我...
開封第一講書人閱讀 37,872評論 0贊 268
萬榮殺人案實錄
序言：老撾萬榮一對情侶失蹤婉支，失蹤者是張志新（化名）和其女友劉穎，沒想到半個月后澜建，有當(dāng)?shù)厝嗽跇淞掷锇l(fā)現(xiàn)了一具尸體向挖，經(jīng)...
沈念sama閱讀 44,318評論 1贊 303
?護林員之死
正文獨居荒郊野嶺守林人離奇死亡，尸身上長有42處帶血的膿包…… 初始之章·張勛以下內(nèi)容為張勛視角年9月15日...
茶點故事閱讀 36,645評論 2贊 327
?白月光啟示錄
正文我和宋清朗相戀三年炕舵，在試婚紗的時候發(fā)現(xiàn)自己被綠了何之。大學(xué)時的朋友給我發(fā)了我未婚夫和他白月光在一起吃飯的照片。...
茶點故事閱讀 38,777評論 1贊 341
活死人
序言：一個原本活蹦亂跳的男人離奇死亡咽筋，死狀恐怖溶推，靈堂內(nèi)的尸體忽然破棺而出，到底是詐尸還是另有隱情奸攻，我是刑警寧澤蒜危，帶...
沈念sama閱讀 34,470評論 4贊 333
?日本核電站爆炸內(nèi)幕
正文年R本政府宣布，位于F島的核電站睹耐，受9級特大地震影響辐赞，放射性物質(zhì)發(fā)生泄漏。R本人自食惡果不足惜硝训，卻給世界環(huán)境...
茶點故事閱讀 40,126評論 3贊 317
男人毒藥：我在死后第九天來索命
文/蒙蒙一响委、第九天我趴在偏房一處隱蔽的房頂上張望。院中可真熱鬧窖梁，春花似錦赘风、人聲如沸。這莊子的主人今日做“春日...
開封第一講書人閱讀 30,861評論 0贊 21
一樁弒父案邀窃，背后竟有這般陰謀
文/蒼蘭香墨我抬頭看了看天上的太陽。三九已至假哎，卻和暖如春瞬捕，著一層夾襖步出監(jiān)牢的瞬間鞍历，已是汗流浹背。一陣腳步聲響...
開封第一講書人閱讀 32,095評論 1贊 267
情欲美人皮
我被黑心中介騙來泰國打工山析，沒想到剛下飛機就差點兒被人妖公主榨干…… 1. 我叫王不留堰燎，地道東北人。一個月前我還...
沈念sama閱讀 46,589評論 2贊 362
代替公主和親
正文我出身青樓笋轨，卻偏偏與公主長得像秆剪，于是被迫代替她去往敵國和親。傳聞我的和親對象是個殘疾皇子爵政，可洞房花燭夜當(dāng)晚...
茶點故事閱讀 43,687評論 2贊 351

Linear regression with multiple variables

Part 1: Feature Normalization

Part 2: Gradient Descent

Part 3: Normal Equations

推薦閱讀更多精彩內(nèi)容