Content-based filtering E-commerce recommender system was discussed fully in this paper. Users' unique features can be explored by means of vector space model firstly. Then based on the qualitative value of products ...Content-based filtering E-commerce recommender system was discussed fully in this paper. Users' unique features can be explored by means of vector space model firstly. Then based on the qualitative value of products informa tion, the recommender lists were obtained. Since the system can adapt to the users' feedback automatically, its performance were enhanced comprehensively. Finally the evaluation of the system and the experimental results were presented.展开更多
With the flooding of pornographic information on the Internet, how to keep people away from that offensive information is becoming one of the most important research areas in network information security. Some applica...With the flooding of pornographic information on the Internet, how to keep people away from that offensive information is becoming one of the most important research areas in network information security. Some applications which can block or filter such information are used. Approaches in those systems can be roughly classified into two kinds: metadata based and content based. With the development of distributed technologies, content based filtering technologies will play a more and more important role in filtering systems. Keyword matching is a content based method used widely in harmful text filtering. Experiments to evaluate the recall and precision of the method showed that the precision of the method is not satisfactory, though the recall of the method is rather high. According to the results, a new pornographic text filtering model based on reconfirming is put forward. Experiments showed that the model is practical, has less loss of recall than the single keyword matching method, and has higher precision.展开更多
The standalone Global Positioning System (GPS) does not meet the higher accuracy requirements needed for approach and landing phase of an aircraft. To meet the Category-I Precision Approach (CAT-I PA) requirements of ...The standalone Global Positioning System (GPS) does not meet the higher accuracy requirements needed for approach and landing phase of an aircraft. To meet the Category-I Precision Approach (CAT-I PA) requirements of civil aviation, satellite based augmentation system (SBAS) has been planned by various countries including USA, Europe, Japan and India. The Indian SBAS is named as GPS Aided Geo Augmented Navigation (GAGAN). The GAGAN network consists of several dual frequency GPS receivers located at various airports around the Indian subcontinent. The ionospheric delay, which is a function of the total electron content (TEC), is one of the main sources of error affecting GPS/SBAS accuracy. A dual frequency GPS receiver can be used to estimate the TEC. However, line-of-sight TEC derived from dual frequency GPS data is corrupted by the instrumental biases of the GPS receiver and satellites. The estimation of receiver instrumental bias is particularly important for obtaining accurate estimates of ionospheric delay. In this paper, two prominent techniques based on Kalman filter and Self-Calibration Of pseudo Range Error (SCORE) algorithm are used for estimation of instrumental biases. The estimated instrumental bias and TEC results for the GPS Aided Geo Augmented Navigation (GAGAN) station at Hyderabad (78.47°E, 17.45°N), India are presented.展开更多
The coal filter cake is a product of fine coal after floatation which has an ash content of 7-13%, water content of 30±2%, and a particle size of less than 1 mm. The ash content was measured by the intensity of t...The coal filter cake is a product of fine coal after floatation which has an ash content of 7-13%, water content of 30±2%, and a particle size of less than 1 mm. The ash content was measured by the intensity of the single backscattered gamma-ray, and its accuracy is mainly dependent on the energy of the gamma-ray. The 238Pu low energy photon source is selected in this work. The energy of its gamma-ray is 15 keV, which can result not only in the best sensitivity, but also in the lowest contribution to the environment radiation. The root mean square deviation of the ash measurement is±0.33% (±1σ).展开更多
The washing of filter cakes, especially the displacement washing, represents an important aspect in science researches and industrial applications. A lot of ongoing researches are focussed on impurities, which are dis...The washing of filter cakes, especially the displacement washing, represents an important aspect in science researches and industrial applications. A lot of ongoing researches are focussed on impurities, which are dissolved in the mother liquid (e.g. sodium chloride) and washed out with the identical pure liquid without impurities. The project flushing focuses on systems with two chemically different liquids. The main aim is to exchange an organic solvent by water. This article focuses on the adsorption effects during a washing process with solid systems of different wetting behaviours.展开更多
In deduplication, index-lookup disk bottleneck is a major obstacle which limits the throughput of backup processes. One way to minimize the effect of this issue and boost speed is to use very high course-grained chunk...In deduplication, index-lookup disk bottleneck is a major obstacle which limits the throughput of backup processes. One way to minimize the effect of this issue and boost speed is to use very high course-grained chunks for deduplication at a cost of low storage saving and limited scalability. Another way is to distribute the deduplication process among multiple nodes but this approach introduces storage node island effect and also incurs high communication cost. In this paper, we explore dCACH, a content-aware clustered and hierarchical deduplication system, which implements a hybrid of inline course grained and offline fine-grained distributed deduplication where routing decisions are made for a set of files instead of single files. It utilizes bloom filters for detecting similarity between a data stream and previous data streams and performs stateful routing which solves the storage node island problem. Moreover, it exploits the negligibly small amount of content shared among chunks from different file types to create groups of files and deduplicate each group in their own fingerprint index space. It implements hierarchical deduplication to reduce the size of fingerprint indexes at the global level, where only files and big sized segments are deduplicated. Locality is created and exploited first using the big sized segments deduplicated at the global level and second by routing a set of consecutive files together to one storage node. Furthermore, the use of bloom filter for similarity detection between streams has low communication and computation cost while it enables to achieve duplicate elimination performance comparable to single node deduplication. dCACH is evaluated using a prototype deployed on a server environment distributed over four separate machines. It is shown to have 10× the speed of Extreme_Binn with a minimal communication overhead, while its duplicate elimination effectiveness is on a par with a single node deduplication system.展开更多
基金Supported bythe Hunan Teaching Reformand Re-search Project of Colleges and Universities (2003-B72) the HunanBoard of Review on Philosophic and Social Scientific Pay-off Project(0406035) the Hunan Soft Science Research Project(04ZH6005)
文摘Content-based filtering E-commerce recommender system was discussed fully in this paper. Users' unique features can be explored by means of vector space model firstly. Then based on the qualitative value of products informa tion, the recommender lists were obtained. Since the system can adapt to the users' feedback automatically, its performance were enhanced comprehensively. Finally the evaluation of the system and the experimental results were presented.
文摘With the flooding of pornographic information on the Internet, how to keep people away from that offensive information is becoming one of the most important research areas in network information security. Some applications which can block or filter such information are used. Approaches in those systems can be roughly classified into two kinds: metadata based and content based. With the development of distributed technologies, content based filtering technologies will play a more and more important role in filtering systems. Keyword matching is a content based method used widely in harmful text filtering. Experiments to evaluate the recall and precision of the method showed that the precision of the method is not satisfactory, though the recall of the method is rather high. According to the results, a new pornographic text filtering model based on reconfirming is put forward. Experiments showed that the model is practical, has less loss of recall than the single keyword matching method, and has higher precision.
文摘The standalone Global Positioning System (GPS) does not meet the higher accuracy requirements needed for approach and landing phase of an aircraft. To meet the Category-I Precision Approach (CAT-I PA) requirements of civil aviation, satellite based augmentation system (SBAS) has been planned by various countries including USA, Europe, Japan and India. The Indian SBAS is named as GPS Aided Geo Augmented Navigation (GAGAN). The GAGAN network consists of several dual frequency GPS receivers located at various airports around the Indian subcontinent. The ionospheric delay, which is a function of the total electron content (TEC), is one of the main sources of error affecting GPS/SBAS accuracy. A dual frequency GPS receiver can be used to estimate the TEC. However, line-of-sight TEC derived from dual frequency GPS data is corrupted by the instrumental biases of the GPS receiver and satellites. The estimation of receiver instrumental bias is particularly important for obtaining accurate estimates of ionospheric delay. In this paper, two prominent techniques based on Kalman filter and Self-Calibration Of pseudo Range Error (SCORE) algorithm are used for estimation of instrumental biases. The estimated instrumental bias and TEC results for the GPS Aided Geo Augmented Navigation (GAGAN) station at Hyderabad (78.47°E, 17.45°N), India are presented.
文摘The coal filter cake is a product of fine coal after floatation which has an ash content of 7-13%, water content of 30±2%, and a particle size of less than 1 mm. The ash content was measured by the intensity of the single backscattered gamma-ray, and its accuracy is mainly dependent on the energy of the gamma-ray. The 238Pu low energy photon source is selected in this work. The energy of its gamma-ray is 15 keV, which can result not only in the best sensitivity, but also in the lowest contribution to the environment radiation. The root mean square deviation of the ash measurement is±0.33% (±1σ).
文摘The washing of filter cakes, especially the displacement washing, represents an important aspect in science researches and industrial applications. A lot of ongoing researches are focussed on impurities, which are dissolved in the mother liquid (e.g. sodium chloride) and washed out with the identical pure liquid without impurities. The project flushing focuses on systems with two chemically different liquids. The main aim is to exchange an organic solvent by water. This article focuses on the adsorption effects during a washing process with solid systems of different wetting behaviours.
文摘In deduplication, index-lookup disk bottleneck is a major obstacle which limits the throughput of backup processes. One way to minimize the effect of this issue and boost speed is to use very high course-grained chunks for deduplication at a cost of low storage saving and limited scalability. Another way is to distribute the deduplication process among multiple nodes but this approach introduces storage node island effect and also incurs high communication cost. In this paper, we explore dCACH, a content-aware clustered and hierarchical deduplication system, which implements a hybrid of inline course grained and offline fine-grained distributed deduplication where routing decisions are made for a set of files instead of single files. It utilizes bloom filters for detecting similarity between a data stream and previous data streams and performs stateful routing which solves the storage node island problem. Moreover, it exploits the negligibly small amount of content shared among chunks from different file types to create groups of files and deduplicate each group in their own fingerprint index space. It implements hierarchical deduplication to reduce the size of fingerprint indexes at the global level, where only files and big sized segments are deduplicated. Locality is created and exploited first using the big sized segments deduplicated at the global level and second by routing a set of consecutive files together to one storage node. Furthermore, the use of bloom filter for similarity detection between streams has low communication and computation cost while it enables to achieve duplicate elimination performance comparable to single node deduplication. dCACH is evaluated using a prototype deployed on a server environment distributed over four separate machines. It is shown to have 10× the speed of Extreme_Binn with a minimal communication overhead, while its duplicate elimination effectiveness is on a par with a single node deduplication system.