所以我有一个函数,它可以抓取instagram数据并将点赞数存储到字典中。我认为我已经正确设置了函数,但当我尝试运行print语句时,我不断收到一个错误,即它找不到我要运行该函数的文件。
这是我用来装载谷歌驱动器并设置目录以存储json文件的代码。我在驱动器中验证了这一切都是正确的。
# Mount data location. Do not edit this cell except as indicated.
# Be sure you have a folder in the root of your Google Drive called APRD6342/Data.
# Data files for the course should be uploaded to that folder.
from pathlib import Path
try:
from google.colab import drive
drive.mount('/content/drive')
datadir = Path('drive/My Drive/APRD6342/Data') # you may edit this location ...
except ModuleNotFoundError:
datadir = Path('../../Data') # ... but don't change this!!!
INSTAGRAM_DIR = datadir / 'instagram'
这是我的功能:
def engagement(filename, follower_count):
"""Return the brand post engagement for the Instagram metadata file,
filename, given follower_count as the number of Instagram followers for
the brand.
Returns a decimal engagement rate rounded to 4 decimal places. Python's
standard `round` function should be used. E.g.:
>>> engagement('instagram/volvocars.json', volvocars_follower_count)
0.0125
"""
with open(datadir / filename) as f:
for line in f: #for each line in the file
line_object = json.loads(line)
likes = line_object["GraphImages"]["edge_media_preview_like"]["count"] #find count
if num in likes > 1:
engagement.append(i) #add count to engagement
comments = filename["GraphImages"]["edge_media_to_comment"]["count"] #find other count
if num in comments > 1:
engagement.append(i) #add count
engage_perc = sum(engagement)/follower_count #sum all counts and divide by the number of followers
return engage_perc
这是我试图运行的打印语句和错误:
print('Honda:',
as_percent(engagement(INSTAGRAM_DIR / 'honda.json', honda_follower_count)))
---> 12 with open(datadir / filename) as f:
13 for line in f: #for each line in the file
14 line_object = json.loads(line)
FileNotFoundError: [Errno 2] No such file or directory: 'drive/My Drive/APRD6342/Data/drive/My Drive/APRD6342/Data/instagram/honda.json'
我以前运行过装载代码,但由于某种原因,我的目录正在嵌套自己或其他什么。谢谢
使用调用函数时
print('Honda:',
as_percent(engagement(INSTAGRAM_DIR / 'honda.json', honda_follower_count)))
你提供了完整的路径,但在函数中,该路径被再次添加
with open(datadir / filename) as f:
只传递子文件夹instagram
和文件名。注意文档字符串-他们提供'instagram/volvocars.json'
,所以您需要提供'instagram/honda.json'
因此能够找到一种方法,不必使用jsonl文件,而只使用json。混淆了一些语法。
def engagement(filename, follower_count):
"""Return the brand post engagement for the Instagram metadata file,
filename, given follower_count as the number of Instagram followers for
the brand.
Returns a decimal engagement rate rounded to 4 decimal places. Python's
standard `round` function should be used. E.g.:
>>> engagement('instagram/volvocars.json', volvocars_follower_count)
0.0125
"""
engagement = [] #set empty list for like counts
data = filename #store filename to variable
with open(data) as f: #save open file as outfile
parse = json.load(f)
for n in range(0,200):
likes = parse["GraphImages"][n]["edge_media_preview_like"]["count"] #find count
if likes > 1:
engagement.append(likes) #add count to engagement
comments = parse["GraphImages"][n]["edge_media_to_comment"]["count"] #find other count
if comments > 1:
engagement.append(comments) #add count
engage_perc = (sum(engagement)/len(range(0,200)))/follower_count #sum all counts and divide by the number of followers
return round(engage_perc,4)