期刊文献+
共找到70篇文章
< 1 2 4 >
每页显示 20 50 100
Wireless Autonomous Soft Crawlers for Adjustable Climbing Actuation
1
作者 Lei Tian Ji-Ji Tan +7 位作者 Wei-Liang Dong Bo Yang Cui-Hua Li Dai Wang Hai-Yu Huang Xin-Tong Li Cai-Zhen Zhu Jian Xu 《Chinese Journal of Polymer Science》 SCIE EI CAS CSCD 2023年第3期405-413,I0008,共10页
Artificial soft actuators,featured with non-equilibrium internal circumstance and fast,programmable shape transformations,have attracted strong research interest recently due to their flexibility,highly controllable,a... Artificial soft actuators,featured with non-equilibrium internal circumstance and fast,programmable shape transformations,have attracted strong research interest recently due to their flexibility,highly controllable,and designability.However,wireless soft actuators,achieving the locomotion on different large slopes with multiple energy conversion,have been rarely reported.Herein,we create a asymmetric bilayer strategy to construct autonomous soft crawler via“breathing”moisture to motivate the mechanical deformation.The soft crawlers present conspicuous performances including periodic tumbler locomotion predicted via improved Timoshenko’s equation,multiple reversible shape-morphing(circle,helix,despiralization,etc.)determined by their fiber orientation,controlled drive mode(front drive and rear drive)and rapid climb speed(4.76 cm/min)at wide slope angles.Through architecture design,they can be series-wound or shunt-wound to construct multijoint complex actuators.Besides climbing,a intelligent soft ring-pull with admirable cycle performance for preventing overheating or something untouchable,has been proposed.The soft crawlers also realize multiple energy conversion to be actuated by light irradiation.We envision that this soft crawler system has an enormous potential in intelligent machine,microscopic diagnosis and treatment,biosensing,energy harvesting and conversion. 展开更多
关键词 Soft crawler Tumbler locomotion Reversible shape-morphing AUTONOMOUS Energy conversion
原文传递
Modular Soft Robotic Crawlers Based on Fluidic Prestressed Composite Actuators
2
作者 Zefeng Xu Linkai Hu +2 位作者 Longya Xiao Hongjie Jiang Yitong Zhou 《Journal of Bionic Engineering》 SCIE EI CSCD 2024年第2期694-706,共13页
Soft robotic crawlers have limited payload capacity and crawling speed.This study proposes a high-performance inchworm-like modular robotic crawler based on fluidic prestressed composite(FPC)actuators.The FPC actuator... Soft robotic crawlers have limited payload capacity and crawling speed.This study proposes a high-performance inchworm-like modular robotic crawler based on fluidic prestressed composite(FPC)actuators.The FPC actuator is precurved and a pneumatic source is used to flatten it,requiring no energy cost to maintain the equilibrium curved shape.Pressurizing and depressurizing the actuators generate alternating stretching and bending motions of the actuators,achieving the crawling motion of the robotic crawler.Multi-modal locomotion(crawling,turning,and pipe climbing)is achieved by modular reconfiguration and gait design.An analytical kinematic model is proposed to characterize the quasi-static curvature and step size of a single-module crawler.Multiple configurations of robotic crawlers are fabricated to demonstrate the crawling ability of the proposed design.A set of systematic experiments are set up and conducted to understand how crawler responses vary as a function of FPC prestrains,input pressures,and actuation frequencies.As per the experiments,the maximum carrying load ratio(carrying load divided by robot weight)is found to be 22.32,and the highest crawling velocity is 3.02 body length(BL)per second(392 mm/s).Multi-modal capabilities are demonstrated by reconfiguring three soft crawlers,including a matrix crawler robot crawling in amphibious environments,and an inching crawler turning at an angular velocity of 2/s,as well as earthworm-like crawling robots climbing a 20 inclination slope and pipe. 展开更多
关键词 Soft robot Soft crawler Fluidic prestressed composite Kinematic model Enhanced loading Multi-modal capability
在线阅读 下载PDF
PathMarker:protecting web contents against inside crawlers
3
作者 Shengye Wan Yue Li Kun Sun 《Cybersecurity》 CSCD 2019年第1期100-116,共17页
Web crawlers have been misused for several malicious purposes such as downloading server data without permission from the website administrator.Moreover,armoured crawlers are evolving against new anti-crawler mechanis... Web crawlers have been misused for several malicious purposes such as downloading server data without permission from the website administrator.Moreover,armoured crawlers are evolving against new anti-crawler mechanisms in the arm races between crawler developers and crawler defenders.In this paper,based on one observation that normal users and malicious crawlers have different short-term and long-term download behaviours,we develop a new anti-crawler mechanism called PathMarker to detect and constrain persistent distributed crawlers.By adding a marker to each Uniform Resource Locator(URL),we can trace the page that leads to the access of this URL and the user identity who accesses this URL.With this supporting information,we can not only perform more accurate heuristic detection using the path related features,but also develop a Support Vector Machine based machine learning detection model to distinguish malicious crawlers from normal users via inspecting their different patterns of URL visiting paths and URL visiting timings.In addition to effectively detecting crawlers at the earliest stage,PathMarker can dramatically suppress the scraping efficiency of crawlers before they are detected.We deploy our approach on an online forum website,and the evaluation results show that PathMarker can quickly capture all 6 open-source and in-house crawlers,plus two external crawlers(i.e.,Googlebots and Yahoo Slurp). 展开更多
关键词 Anti-Crawler mechanism Stealthy distributed inside crawler Confidential Website content protection
原文传递
PathMarker:protecting web contents against inside crawlers
4
作者 Shengye Wan Yue Li Kun Sun 《Cybersecurity》 2018年第1期375-391,共17页
Web crawlers have been misused for several malicious purposes such as downloading server data without permission from the website administrator.Moreover,armoured crawlers are evolving against new anti-crawler mechanis... Web crawlers have been misused for several malicious purposes such as downloading server data without permission from the website administrator.Moreover,armoured crawlers are evolving against new anti-crawler mechanisms in the arm races between crawler developers and crawler defenders.In this paper,based on one observation that normal users and malicious crawlers have different short-term and long-term download behaviours,we develop a new anti-crawler mechanism called PathMarker to detect and constrain persistent distributed crawlers.By adding a marker to each Uniform Resource Locator(URL),we can trace the page that leads to the access of this URL and the user identity who accesses this URL.With this supporting information,we can not only perform more accurate heuristic detection using the path related features,but also develop a Support Vector Machine based machine learning detection model to distinguish malicious crawlers from normal users via inspecting their different patterns of URL visiting paths and URL visiting timings.In addition to effectively detecting crawlers at the earliest stage,PathMarker can dramatically suppress the scraping efficiency of crawlers before they are detected.We deploy our approach on an online forum website,and the evaluation results show that PathMarker can quickly capture all 6 open-source and in-house crawlers,plus two external crawlers(i.e.,Googlebots and Yahoo Slurp). 展开更多
关键词 Anti-Crawler mechanism Stealthy distributed inside crawler Confidential Website content protection
原文传递
Deep Learning-Based NLP Framework for Public Sentiment Analysis on Green Consumption:Evidence from Social Media
5
作者 Luyu Ma Xiu Cheng +2 位作者 Zongyan Xing Yue Wu Weiwei Jiang 《Computers, Materials & Continua》 2025年第11期3921-3943,共23页
Green consumption(GC)are crucial for achieving the SustainableDevelopmentGoals(SDGs).However,few studies have explored public attitudes toward GC using social media data,missing potential public concerns captured thro... Green consumption(GC)are crucial for achieving the SustainableDevelopmentGoals(SDGs).However,few studies have explored public attitudes toward GC using social media data,missing potential public concerns captured through big data.To address this gap,this study collects and analyzes public attention toward GC using web crawler technology.Based on the data from Sina Weibo,we applied RoBERTa,an advanced NLP model based on transformer architecture,to conduct fine-grained sentiment analysis of the public’s attention,attitudes and hot topics on GC,demonstrating the potential of deep learning methods in capturing dynamic and contextual emotional shifts across time and regions.Among the sample(N=188,509),53.91% expressed a positive attitude,with variation across different times and regions.Temporally,public interest in GC has shown an annual growth rate of 30.23%,gradually shifting fromfulfilling basic needs to prioritizing entertainment consumption.Spatially,GC is most prevalent in the southeast coastal regions of China,with Beijing ranking first across five evaluated domains.Individuals and government-affiliated accounts play a key role in public discussions on social networks,accounting for 45.89% and 30.01% of user reviews,respectively.A significant positive correlation exists between economic development and public attention to GC,as indicated by a Pearson correlation coefficient of 0.55.Companies,in particular,exhibit cautious behavior in the early stages of green product adoption,prioritizing profitability before making substantial investments.These findings provide valuable insights into the evolving public perception of GC,contributing to the development of more effective environmental policies in China. 展开更多
关键词 Green-consumption RoBERTa web crawler text sentiment analysis STAKEHOLDER
在线阅读 下载PDF
Teaching Reform and Practice of the“Data Collection and Web Crawler”Course Based on the Blended Teaching Mode
6
作者 Simin Wu 《Journal of Contemporary Educational Research》 2025年第7期116-122,共7页
The data collection and web crawling course has a lot of theoretical knowledge and strong practicality.Traditional teaching methods are no longer sufficient to meet teaching needs.Based on the characteristics of the c... The data collection and web crawling course has a lot of theoretical knowledge and strong practicality.Traditional teaching methods are no longer sufficient to meet teaching needs.Based on the characteristics of the course,this article constructs a mixed teaching environment based on“Learning Pass+Hongya Platform+Offline Course,”integrates teaching resource libraries and ideological and political cases,and develops a suitable evaluation system to cultivate students’innovative and critical thinking abilities,stimulate their learning initiative,improve their teamwork ability,and enhance their professional level and data literacy. 展开更多
关键词 Blended learning mode CRAWLER Course teaching reform
在线阅读 下载PDF
On-line topical importance estimation:an effective focused crawling algorithm combining link and content analysis 被引量:7
7
作者 Can WANG Zi-yu GUAN +3 位作者 Chun CHEN Jia-jun BU Jun-feng WANG Huai-zhong LIN 《Journal of Zhejiang University-Science A(Applied Physics & Engineering)》 SCIE EI CAS CSCD 2009年第8期1114-1124,共11页
Focused crawling is an important technique for topical resource discovery on the Web.The key issue in focused crawling is to prioritize uncrawled uniform resource locators(URLs) in the frontier to focus the crawling o... Focused crawling is an important technique for topical resource discovery on the Web.The key issue in focused crawling is to prioritize uncrawled uniform resource locators(URLs) in the frontier to focus the crawling on relevant pages.Traditional focused crawlers mainly rely on content analysis.Link-based techniques are not effectively exploited despite their usefulness.In this paper,we propose a new frontier prioritizing algorithm,namely the on-line topical importance estimation(OTIE) algorithm.OTIE combines link-and content-based analysis to evaluate the priority of an uncrawled URL in the frontier.We performed real crawling experiments over 30 topics selected from the Open Directory Project(ODP) and compared harvest rate and target recall of the four crawling algorithms:breadth-first,link-context-prediction,on-line page importance computation(OPIC) and our OTIE.Experimental results showed that OTIE significantly outperforms the other three algorithms on the average target recall while maintaining an acceptable harvest rate.Moreover,OTIE is much faster than the traditional focused crawling algorithm. 展开更多
关键词 Focused crawlers Topical crawlers PAGERANK Classifiers On-line topical importance estimation (OTIE) algorithm
原文传递
领域相关的Web网站抓取方法 被引量:5
8
作者 李刚 周立柱 +1 位作者 郭奇 林玲 《计算机科学》 CSCD 北大核心 2007年第2期137-140,148,共5页
本文提出了一种抓取领域相关的Web站点的方法,可以在较小的代价下准确地收集用户所关心领域内的网站。这种方法主要改进了传统的聚焦爬虫(Focused Crawler)技术,首先利用Meta-Search技术来改进传统Crawler的通过链接分析来抓取网页的方... 本文提出了一种抓取领域相关的Web站点的方法,可以在较小的代价下准确地收集用户所关心领域内的网站。这种方法主要改进了传统的聚焦爬虫(Focused Crawler)技术,首先利用Meta-Search技术来改进传统Crawler的通过链接分析来抓取网页的方法,而后利用启发式搜索大大降低了搜索代价,通过引入一种评价领域相关性的打分方法,达到了较好的准确率。本文详细地描述了上述算法并通过详细的实验验证了算法的效率和效果。 展开更多
关键词 META-SEARCH 聚焦爬虫(Focused Crawler) 启发式搜索
在线阅读 下载PDF
科技信息的网络动态监测和信息自动获取技术研究 被引量:6
9
作者 赵燕平 朱东华 《科学学研究》 CSSCI 北大核心 2003年第z1期230-237,共8页
文章综述了国内外基于Robot的Web信息检索(IR)、定题检索、智能信息检索及其代理以及相关的信息自动获取技术的有关理论、方法和技术,探讨了适合技术预测与评估的需要、用于科学领域专题信息采集的、网络科技信息智能获取的方法和策略... 文章综述了国内外基于Robot的Web信息检索(IR)、定题检索、智能信息检索及其代理以及相关的信息自动获取技术的有关理论、方法和技术,探讨了适合技术预测与评估的需要、用于科学领域专题信息采集的、网络科技信息智能获取的方法和策略。构建了网络科技信息动态监测和信息自动获取系统的总体框架,实现了一个基于Web的系统原型(简称BIT),并分析了该系统的特点。 展开更多
关键词 科技预测 技术管理 信息检索 定题检索 智能检索 智能代理 Web CRAWLER AGENT
在线阅读 下载PDF
网络信息资源构建与维护方法研究 被引量:2
10
作者 周宁 严亚兰 +1 位作者 刘玮 张芳芳 《图书情报知识》 CSSCI 北大核心 2003年第5期33-35,共3页
基于Web的信息资源组织与构建是当今IT产业面临的重大课题。设计优化的组织模型及自动构建和维护网络信息资源是一项基本建设。本文阐述了网络信息的组织原理与构建方法 ,认为利用网络机器人是组织与维护网络信息资源的有效模式 。
关键词 网络信息资源 网络信息组织 构建 CRAWLER 构建模式 维护方法 网络机器人 WEB
在线阅读 下载PDF
深度Web资源探测系统的研究与实现 被引量:7
11
作者 李涛 陈鹏 李哲 《微计算机信息》 北大核心 2007年第33期185-187,共3页
本文介绍了深度Web的资源重要性和传统爬虫工作的原理。为了能够更好的利用传统爬虫获取深度Web资源和解决传统爬虫在工作中的不足,提出了一种任务可定制化的爬虫框架,并基于可定制化的任务,实现探测网络深度资源的功能。
关键词 CRAWLER DEEP SEARCH DEEPWEB 基于站点爬行
在线阅读 下载PDF
一种并行Crawler系统中的URL分配算法设计 被引量:1
12
作者 万源 万方 王大震 《计算机工程与应用》 CSCD 北大核心 2006年第A01期117-119,共3页
研究了分布式体系结构下的并行Crawler采集模型,分析了各组件的功能及各Cmwler在并行搜索时,为保证系统的负载均衡而应遵循的基本规则,并提出了一种基于散列(hash)的URL的调度算法。
关键词 分布式Crawler 散列算法 URL分配
在线阅读 下载PDF
面向动态网页爬行的Crawler架构 被引量:7
13
作者 严亚兰 《图书情报知识》 CSSCI 北大核心 2003年第4期51-53,共3页
 本文分析了Crawler动态网页爬行功能,提出了面向动态网页爬行的Crawler架构,并对相应模块进行了探讨。
关键词 Crawler架构 爬行 动态网页
在线阅读 下载PDF
基于神经网络的增量式crawler重访频率研究 被引量:1
14
作者 周英飚 王军 《华中科技大学学报(自然科学版)》 EI CAS CSCD 北大核心 2004年第12期32-33,45,共3页
crawler是搜索引擎必备的核心组件 ,以何种频率对变化的Web页面进行重访是增量式crawler要解决的主要问题 .结合人工神经网络建立页面变化模型 ,由模型确定增量式crawler重访时间 ,同时分析模型在实践中的应用 ,提出一种应用方案 ,具有... crawler是搜索引擎必备的核心组件 ,以何种频率对变化的Web页面进行重访是增量式crawler要解决的主要问题 .结合人工神经网络建立页面变化模型 ,由模型确定增量式crawler重访时间 ,同时分析模型在实践中的应用 ,提出一种应用方案 ,具有较好的自适应性 . 展开更多
关键词 搜索引擎 CRAWLER 增量式crawler 神经网络
在线阅读 下载PDF
A New Framework for Focused Web Crawling 被引量:3
15
作者 PENG Tao HE Fengling ZUO Wanli 《Wuhan University Journal of Natural Sciences》 CAS 2006年第5期1394-1397,共4页
Focused crawlers are important tools to support applications such as specialized Web portals, online searching, and Web search engines. A topic driven crawler chooses the best URLs and relevant pages to pursue during ... Focused crawlers are important tools to support applications such as specialized Web portals, online searching, and Web search engines. A topic driven crawler chooses the best URLs and relevant pages to pursue during Web crawling. It is difficult to deal with irrelevant pages. This paper presents a novel focused crawler framework. In our focused crawler, we propose a method to overcome some of the limitations of dealing with the irrelevant pages. We also introduce the implementation of our focused crawler and present some important metrics and an evaluation function for ranking pages relevance. The experimental result shows that our crawler can obtain more "important" pages and has a high precision and recall value. 展开更多
关键词 focused crawlers irrelevant pages relevance metrics
在线阅读 下载PDF
Weighted PageRank Algorithm Search Engine Ranking Model for Web Pages 被引量:2
16
作者 S.Samsudeen Shaffi I.Muthulakshmi 《Intelligent Automation & Soft Computing》 SCIE 2023年第4期183-192,共10页
As data grows in size,search engines face new challenges in extracting more relevant content for users’searches.As a result,a number of retrieval and ranking algorithms have been employed to ensure that the results a... As data grows in size,search engines face new challenges in extracting more relevant content for users’searches.As a result,a number of retrieval and ranking algorithms have been employed to ensure that the results are relevant to the user’s requirements.Unfortunately,most existing indexes and ranking algo-rithms crawl documents and web pages based on a limited set of criteria designed to meet user expectations,making it impossible to deliver exceptionally accurate results.As a result,this study investigates and analyses how search engines work,as well as the elements that contribute to higher ranks.This paper addresses the issue of bias by proposing a new ranking algorithm based on the PageRank(PR)algorithm,which is one of the most widely used page ranking algorithms We pro-pose weighted PageRank(WPR)algorithms to test the relationship between these various measures.The Weighted Page Rank(WPR)model was used in three dis-tinct trials to compare the rankings of documents and pages based on one or more user preferences criteria.Thefindings of utilizing the Weighted Page Rank model showed that using multiple criteria to rankfinal pages is better than using only one,and that some criteria had a greater impact on ranking results than others. 展开更多
关键词 Weighted pagerank algorithms search engines web pages web crawlers World Wide Web
在线阅读 下载PDF
Monitoring Peer-to-Peer Botnets:Requirements,Challenges,and Future Works 被引量:1
17
作者 Arkan Hammoodi Hasan Kabla Mohammed Anbar +2 位作者 Selvakumar Manickam Alwan Ahmed Abdulrahman Alwan Shankar Karuppayah 《Computers, Materials & Continua》 SCIE EI 2023年第5期3375-3398,共24页
The cyber-criminal compromises end-hosts(bots)to configure a network of bots(botnet).The cyber-criminals are also looking for an evolved architecture that makes their techniques more resilient and stealthier such as P... The cyber-criminal compromises end-hosts(bots)to configure a network of bots(botnet).The cyber-criminals are also looking for an evolved architecture that makes their techniques more resilient and stealthier such as Peer-to-Peer(P2P)networks.The P2P botnets leverage the privileges of the decentralized nature of P2P networks.Consequently,the P2P botnets exploit the resilience of this architecture to be arduous against take-down procedures.Some P2P botnets are smarter to be stealthy in their Commandand-Control mechanisms(C2)and elude the standard discovery mechanisms.Therefore,the other side of this cyberwar is the monitor.The P2P botnet monitoring is an exacting mission because the monitoring must care about many aspects simultaneously.Some aspects pertain to the existing monitoring approaches,some pertain to the nature of P2P networks,and some to counter the botnets,i.e.,the anti-monitoring mechanisms.All these challenges should be considered in P2P botnet monitoring.To begin with,this paper provides an anatomy of P2P botnets.Thereafter,this paper exhaustively reviews the existing monitoring approaches of P2P botnets and thoroughly discusses each to reveal its advantages and disadvantages.In addition,this paper groups the monitoring approaches into three groups:passive,active,and hybrid monitoring approaches.Furthermore,this paper also discusses the functional and non-functional requirements of advanced monitoring.In conclusion,this paper ends by epitomizing the challenges of various aspects and gives future avenues for better monitoring of P2P botnets. 展开更多
关键词 P2P networks BOTNET P2P botnet botnet monitoring HONEYPOT crawlers
在线阅读 下载PDF
主题Web信息采集技术 被引量:1
18
作者 杜欢 《四川理工学院学报(自然科学版)》 CAS 2007年第5期10-13,共4页
在互联网高速发展的今天,搜索引擎逐渐成为用户在Web上获取信息的主要工具。传统的通用搜索引擎利用一个Crawler程序面向整个Web进行信息采集,它的缺点是采集无针对性、页面失效率高、不能满足特定专业人群的需要。针对这种情况,需要一... 在互联网高速发展的今天,搜索引擎逐渐成为用户在Web上获取信息的主要工具。传统的通用搜索引擎利用一个Crawler程序面向整个Web进行信息采集,它的缺点是采集无针对性、页面失效率高、不能满足特定专业人群的需要。针对这种情况,需要一个分类细致精确、数据全面深入、更新及时的面向主题的搜索引擎。 展开更多
关键词 搜索引擎 WEB CRAWLER 主题搜索引擎
在线阅读 下载PDF
一个自动发现确定主题下资源的系统
19
作者 朱炜 李俊 +1 位作者 王超 潘金贵 《计算机应用研究》 CSCD 北大核心 2004年第11期87-90,共4页
介绍了NDDS(NanDaDolphinSearcher)系统的设计与实现的关键技术。系统使用VSM(VectorSpaceMod el)技术来自动地确定搜索主题。智能Crawler技术能够有目标、有选择地发现新的相关资源。链接分析技术用来分析发现最重要的资源,对资源按重... 介绍了NDDS(NanDaDolphinSearcher)系统的设计与实现的关键技术。系统使用VSM(VectorSpaceMod el)技术来自动地确定搜索主题。智能Crawler技术能够有目标、有选择地发现新的相关资源。链接分析技术用来分析发现最重要的资源,对资源按重要性进行排序。NDDS的两种运行方式分别提供了个性化搜索服务和共享资源服务。 展开更多
关键词 万维网 向量空间模型 超链 智能Crawler 锚文本
在线阅读 下载PDF
基于Node.JS的轻量级定向爬虫算法的设计与应用 被引量:1
20
作者 刘书影 《哈尔滨师范大学自然科学学报》 CAS 2016年第6期26-29,共4页
首先介绍了网络爬虫的定义,给出其分类及工作原理,最后在介绍垂直爬虫Web magic框架的基础上,设计并实现了基于Node.JS的轻量级网络爬虫,并应用于交通应急网站新闻抓取模块,效果较好.
关键词 网络爬虫 Web MAGIC 搜索引擎 Light CRAWLER
在线阅读 下载PDF
上一页 1 2 4 下一页 到第
使用帮助 返回顶部