如何在MATLAB中一次性使用同一代码中的两个数据集



我有一个预先指定的模型,它运行平稳。问题是,对于同一个模型,我有两个数据集。这两个数据集仅根据行而变化:例如,数据集1从1变为60,数据集2从61变为115。所有的变量都是相同的。我希望避免两次计算相同的代码。我宁愿把它写得很好,一次计算出来。

我将用我的模型给你一个示例数据集:

data = rand(115,5)
Y_data = data(1:60, :) % dataset 1
Y_data = data(61:115, :) % dataset 2
% This is the model that runs nicely on dataset Y_data. I wanted to avoid to run the model twice,
% first with Y_data from row 1 to 60 and then from row to 61 to 100. I would like to do it in one shot
% the code for the model is fully automated so it's just a matter of making it work first on dataset 1 and then
% on dataset 2 in one unique code
T = size(Y_data,1);
P  = 3; % number of lags used in LP for controls
H_min = 1; 
H_max = 25; 
y  = Y_data(:,1); % endogenous variable
x  = Y_data(:,2); % shock 
w  = lagmatrix(Y_data(:,[3:5]), 1:P ); 
newData = cat(2, y, x, w)
% Remove missings from data
newData(any(isnan(newData), 2), :) = [];
% Re-declare variables after removing missings
y  = newData(:,1); % endogenous variable
x  = newData(:,2); % shock
w = newData(:,3:size(newData,2)); % control variables and lags
r = 3; 
lambda = 10000; 
slp    = locproj(y,x,w,H_min,H_max,'smooth',r,lambda); 
%% Cross-Validation Choice of Lambda
slp = locproj(y,x,w,H_min,H_max,'smooth',r,0.01);
lambda = [1:10:1000] * T;
slp    = locproj_cv(slp,5,lambda);
lambda_opt = lambda( min( slp.rss ) == slp.rss );
%% Confidence Intervals
r      = 3;
slp    = locproj(y,x,w,H_min,H_max,'smooth',r,lambda_opt); 
slp    = locproj_conf(slp,H_max,lambda_opt/2);

我认为它可以解决这个问题的是使用if/else,比如:


% This is wrong but it gives you an idea of what I was trying to do and get
% trying to tell MATLAB, fun the code first from dataset 1 (row 1:60) and then the same on dataset 2 (from row 61:115)
k = 1:60
if  k == 1
Y_data = Y_data;

else
Y_data = data(61:115, :);
end
% model code as above just here - not to make it too long
% the output therefore should save both results for dataset1 and dataset2

我被卡住了,没能继续前进。有人能帮我吗?这将是我的一天。

非常感谢!

有很多方法可以实现这一点。这里有两个简单的。

制作一个函数dataAnalysis(Y_data)并简单地执行

[slp,lambdaOpt] = dataAnalysis(data(1:60,:))
[slp2,lambdaOpt2] = dataAnalysis(data(61:115,:))

你的功能可能看起来像

function [slp,lambdaOpt] = dataAnalysis(data)
% This is the model that runs nicely on dataset Y_data. I wanted to avoid to run the model twice,
% first with Y_data from row 1 to 60 and then from row to 61 to 100. I would like to do it in one shot
% the code for the model is fully automated so it's just a matter of making it work first on dataset 1 and then
% on dataset 2 in one unique code
T = size(Y_data,1);
P  = 3; % number of lags used in LP for controls
H_min = 1; 
H_max = 25; 
% etc. etc...
% Add in whatever output variables are important

或者,使用你尝试过的循环:

indices = {1:60, 61:115};
for k = 1:2
Y_Data = data(indices{k},:);
% your model code operating on Y_data
% store any results here as either variable(k) (scalar data)
% or variable {k} (non-scalar data)
end

相关内容

  • 没有找到相关文章