按姓氏排序Awk数组



在我的脚本中,我从一个竞选捐款人的文件开始,任何捐款500美元的人都有资格参加比赛。任何符合该条件的人,我都会将其添加到具有递增索引的数组中,以根据需要调整大小。每个索引的格式如下所示,X是一个电话号码。在脚本的END部分,我需要按姓氏($2)对这个数组进行排序以进行打印。我做了一些搜索,但一无所获。我并不是要求有人帮我输入剧本,只是为了给我指明更好的搜索方向或提供建议。我需要帮助对数组参赛者进行排序,因为目前它将以我需要的方式正确填充字符串值。

其中v1,2,&3是战役贡献,我在命令中使用-F'[ :]'来获得空格和冒号作为字段分隔符。

输入文件lab4.data

Fname Lname:Phone__Number:v1:v2:v3   
Mike Harrington:(510) 548-1278:250:100:175 
Christian Dobbins:(408) 538-2358:155:90:201 
Susan Dalsass:(206) 654-6279:250:60:50 
Archie McNichol:(206) 548-1348:250:100:175 
Jody Savage:(206) 548-1278:15:188:150 
Guy Quigley:(916) 343-6410:250:100:175 
Dan Savage:(406) 298-7744:450:300:275 
Nancy McNeil:(206) 548-1278:250:80:75 
John Goldenrod:(916) 348-4278:250:100:175 
Chet Main:(510) 548-5258:50:95:135   
Tom Savage:(408) 926-3456:250:168:200  
Elizabeth Stachelin:(916) 440-1763:175:75:300 

数组可容纳任何大于$500的人,$8已创建并可容纳值$5+$6+$7:数组被初始化并填充在下面给出的循环中

$8 = $5+$6+$7;
contestants[len++]

循环以检查是否将人员添加到参赛者数组。name和number是数组,它们保存各自的值以供以后使用。

for(i=0;i<=NR;i++)if(contrib[i]>500){contestants[len++]= name[i]"   "number[i] }

索引格式(参赛者[len++]所需的数组值):

[0]   Mike Harrington (510) 548-1278
[1]   Archie McNichol (206) 548-1348 
[2]   Guy Quigley (916) 343-6410
[3]   Dan Savage (406) 298-7744
[4]   John Goldenrod (916) 348-4278
[5]   Tom Savage (408) 926-3456
[6]   Elizabeth Stachelin (916) 440-1763

循环打印/检查阵列是否已正确填充(它是)

for (i=0; i <len; i++) {print contestants[i]}

输出:

Mike Harrington (510) 548-1278
Archie McNichol (206) 548-1348
Guy Quigley (916) 343-6410
Dan Savage (406) 298-7744
John Goldenrod (916) 348-4278
Tom Savage (408) 926-3456
Elizabeth Stachelin (916) 440-1763

所需的最终输出:忽略格式,因为它在我的终端中正确显示。我只是很难在这里得到所有的好。

               ***FIRST QUARTERLY REPORT***                          
            ***CAMPAIGN 2004 CONTRIBUTIONS***                       
   Name            Phone             Jan  |  Feb  |  Mar  |  Total Donated 
Mike Harrington     (510)548-1278   $ 250   $ 100   $ 175   $ 525
Christian Dobbins   (408)538-2358   $ 155   $ 90    $ 201   $ 446
Susan Dalsass       (206)654-6279   $ 250   $ 60    $ 50    $ 360
Archie McNichol     (206)548-1348   $ 250   $ 100   $ 175   $ 525
Jody Savage         (206)548-1278   $ 15    $ 188   $ 150   $ 353
Guy Quigley         (916)343-6410   $ 250   $ 100   $ 175   $ 525
Dan Savage          (406)298-7744   $ 450   $ 300   $ 275   $ 1025
Nancy McNeil        (206)548-1278   $ 250   $ 80    $ 75    $ 405
John Goldenrod      (916)348-4278   $ 250   $ 100   $ 175   $ 525
Chet Main           (510)548-5258   $ 50    $ 95    $ 135   $ 280
Tom Savage          (408)926-3456   $ 250   $ 168   $ 200   $ 618
Elizabeth Stachelin (916)440-1763   $ 175   $ 75    $ 300   $ 550
-----------------------------------------------------------------------------
SUMMARY
-----------------------------------------------------------------------------
The campaign received a total of $6137.00 for this quarter.
The average donation for the 12 contributors was $511.42.
The highest total contribution was $1025.00 made by Dan Savage.
                ***Thank you Dan Savage***                           
The following people donated over $500 to the campaign.
They are eligible for the quarterly drawing!!
Listed are their names(sorted by last names) and phone numbers.
John Goldenrod (916) 348-4278
Mike Harrington (510) 548-1278
Archie McNichol (206) 548-1348
Guy Quigley (916) 343-6410
Dan Savage (406) 298-7744
Tom Savage (408) 926-3456
Elizabeth Stachelin (916) 440-1763
Thank you all for your continued support!!

使用gawk,可以直接处理内置的排序函数,例如

BEGIN {
    data["Jane Doe (123) 456-7890"] = 600;
    data["Fred Adams (123) 456-7891"] = 800;
    data["John Smith (123) 456-7892"] = 900;
    exit;
    }
END {
    for (i in data) {
        split(i,x," ")
        data1[x[2] " " x[1] " " x[3] " " x[4]] = i;
        }
    asorti(data1,sdata1);
    for (i in sdata1) {
        print data1[sdata1[i]],"t",data[data1[sdata1[i]]];
    }
}

其产生:

Fred Adams (123) 456-7891        800
Jane Doe (123) 456-7890          600
John Smith (123) 456-7892        900

在普通的awk中,通过将数组索引写入文件,对该文件进行排序,然后使用getline读回该文件,可以获得相同的结果。

实现这一点的方法是在读取数据时生成预SUMMARY输出,这样您就不需要将所有数据存储在一个数组中,只需要将捐款超过500美元的人存储起来,然后使用插入排序算法按所需顺序将其插入数组中。

你可以这样做:

awk -F':' '
NR==1 {
    print "header stuff"
    next
}
{
    tot = $3 + $4 + $5
    printf "%-20s%10s $%5s $%5s $%5s $%5sn", $1, $2, $3, $4, $5, tot
}
tot > 500 {
    split($1,name,/ /)
    surname = name[2]
    numContribs++
    # insertion sort, check the algorithm:
    for (i=1; i<=numContribs; i++) {
        if (surname > surnames[i]) {
            for (j=numContribs; j>i; j--) {
                surnames[j+1] = surnames[j]
                contribs[j+1] = contribs[j]
            }
            surnames[i] = surname
            contribs[i] = $1 " " $2
            break
        }
    }
}
END {
    print "SUMMARY and text below it and then the list of $500+ contributors:"
    for (i=1; i<=numContribs; i++) {
        print contribs[i]
    }
}
' lab4.data

以上不是一个功能齐全的程序。它只是为了根据您的要求向您展示正确的方法。

相关内容

  • 没有找到相关文章

最新更新