Matlab数组中唯一元素的累积计数



使用Matlab 2019b。

x = [10 10 10 20 20 30]';

如何获得x中唯一元素的累积计数,它应该看起来像:

y = [1 2 3 1 2 1]';

编辑

我的实际数组实际上比上面给出的例子长得多。以下是我测试的方法:

x = randi([1 100], 100000, 1);
x = sort(x);
% method 1: check neighboring values in one loop
tic
y = ones(size(x));
for ii = 2:length(x)
if x(ii) == x(ii-1)
y(ii) = y(ii-1) + 1;
end
end
toc
% method 2 (Wolfie): count occurrence of unique values explicitly
tic
u = unique(x);
y = zeros(size(x));
for ii = 1:numel(u)
idx = (x == u(ii));
y(idx) = 1:nnz(idx);
end
toc
% method 3 (Luis Mendo): triangular matrix
tic
y = sum(triu(x==x'))';
toc

结果:

Method 1: Elapsed time is 0.016847 seconds.
Method 2: Elapsed time is 0.037124 seconds.
Method 3: Elapsed time is 10.350002 seconds.

编辑
假设x已排序:

x = [10 10 10 20 20 30].';
x = sort(x);
d = [1 ;diff(x)];
f = find(d);
d(f) = f;
ic = cummax(d);
y = (2 : numel(x) + 1).' - ic;

x未排序时,使用此:

[s, is] = sort(x);
d = [1 ;diff(s)];
f = find(d);
d(f) = f;
ic = cummax(d);
y(is) = (2 : numel(s) + 1).' - ic;

仅适用于GNU Octave的原始答案:
假设x已排序:

x = [10 10 10 20 20 30].';
x = sort(x);
[~, ic] = cummax(x);
y = (2 : numel(x) + 1).' - ic;

x未排序时,使用此:

[s, is] = sort(x);
[~, ic] = cummax(s);
y(is) = (2 : numel(s) + 1).' - ic;

您可以循环遍历唯一元素,并每次将其索引设置为1:n。。。

u = unique(x);
y = zeros(size(x));
for ii = 1:numel(u)
idx = (x == u(ii));
y(idx) = 1:nnz(idx);
end

这有点低效,因为它生成了一个中间矩阵,而实际上只需要一个三角形的一半:

y = sum(triu(x==x.')).';

这里有一个不适用于循环的版本。在我的机器上,它比以前的工作方法快一点:

% if already sorted, can omit this first and last line
[s, is] = sort(x);
[u,~,iu] = unique(s);
c = accumarray(iu,1);
cs = cumsum([0;c]);
z = (1:numel(x))'-repelem(cs(1:end-1),c);
y(is) = z;

最新更新