Skip to content

DropsFix的ExtractImages函数提取图片,文件,视频等资源不全的bug #1

@coffeehb

Description

@coffeehb

下面的代码似乎没有将所有静态资源的url提取出来,暂时不知道怎么处理比较好。

`def ExtractImages(self,webContent):
extList = ['m4u','m3u','mid','wma','flv','3g2','mkv','3gp','mp4','mov','avi','asf','mpeg','vob','mpg','wmv','fla','swf','wav','mp3','qcow2','vdi','vmdk','vmx','gpg','aes','ARC','PAQ','tar.bz2','tbk','bak','tar','tgz','gz','7z','rar','zip','djv','djvu','svg','bmp','png','gif','raw','cgm','jpeg','jpg','tif','tiff','NEF','psd','cmd','bat','sh','class','jar','java','rb','asp','cs','brd','sch','dch','dip','pl','vbs','vb','js','asm','pas','cpp','php','ldf','mdf','ibd','MYI','MYD','frm','odb','dbf','db','mdb','sql','SQLITEDB','SQLITE3','asc','lay6','lay','ms11','sldm','sldx','ppsm','ppsx','ppam','docb','mml','sxm','otg','odg','uop','potx','potm','pptx','pptm','std','sxd','pot','pps','sti','sxi','otp','odp','wb2','wks','wk1','xltx','xltm','xlsx','xlsm','xlsb','slk','xlw','xlt','xlm','xlc','dif','stc','sxc','ots','ods','hwp','dotm','dotx','docm','docx','DOT','3dm','max','3ds','xml','txt','CSV','uot','RTF','pdf','XLS','PPT','stw','sxw','ott','odt','DOC','pem','p12','csr','crt','key']
images_url = []
for ext in extList:
imgs = re.findall('http://static.wooyun.org[^<][^>]+?.'+ext, webContent)
images_url = images_url + imgs
return images_url

`

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions