Maximum frequent pattern generation from a large database of transactions and items for association rule mining is an important research topic in data mining. Association rule mining aims to discover interesting corre...Maximum frequent pattern generation from a large database of transactions and items for association rule mining is an important research topic in data mining. Association rule mining aims to discover interesting correlations, frequent patterns, associations, or causal structures between items hidden in a large database. By exploiting quantum computing, we propose an efficient quantum search algorithm design to discover the maximum frequent patterns. We modified Grover’s search algorithm so that a subspace of arbitrary symmetric states is used instead of the whole search space. We presented a novel quantum oracle design that employs a quantum counter to count the maximum frequent items and a quantum comparator to check with a minimum support threshold. The proposed derived algorithm increases the rate of the correct solutions since the search is only in a subspace. Furthermore, our algorithm significantly scales and optimizes the required number of qubits in design, which directly reflected positively on the performance. Our proposed design can accommodate more transactions and items and still have a good performance with a small number of qubits.展开更多
The advent of the big data era has provided many types of transportation datasets,such as metro smart card data,for studying residents’mobility and understanding how their mobility has been shaped and is shaping the ...The advent of the big data era has provided many types of transportation datasets,such as metro smart card data,for studying residents’mobility and understanding how their mobility has been shaped and is shaping the urban space.In this paper,we use metro smart card data from two Chinese metropolises,Shanghai and Shenzhen.Five metro mobility indicators are introduced,and association rules are established to explore the mobility patterns.The proportion of people entering and exiting the station is used to measure the jobs-housing balance.It is found that the average travel distance and duration of Shanghai passengers are higher than those of Shenzhen,and the proportion of metro commuters in Shanghai is higher than that of Shenzhen.The jobs-housing spatial relationship in Shenzhen based on metro travel is more balanced than that in Shanghai.The fundamental reason for the differences between the two cities is the difference in urban morphology.Compared with the monocentric structure of Shanghai,the polycentric structure of Shenzhen results in more scattered travel hotspots and more diverse travel routes,which helps Shenzhen to have a better jobs-housing balance.This paper fills a gap in comparative research among Chinese cities based on transportation big data analysis.The results provide support for planning metro routes,adjusting urban structure and land use to form a more reasonable metro network,and balancing the jobs-housing spatial relationship.展开更多
In this paper, we propose an efficient algorithm, called FFP-Growth (shortfor fast FP-Growth) , to mine frequent itemsets. Similar to FP-Growth, FFP-Growth searches theFP-tree in the bottom-up order, but need not cons...In this paper, we propose an efficient algorithm, called FFP-Growth (shortfor fast FP-Growth) , to mine frequent itemsets. Similar to FP-Growth, FFP-Growth searches theFP-tree in the bottom-up order, but need not construct conditional pattern bases and sub-FP-trees,thus, saving a substantial amount of time and space, and the FP-tree created by it is much smallerthan that created by TD-FP-Growth, hence improving efficiency. At the same time, FFP-Growth can beeasily extended for reducing the search space as TD-FP-Growth (M) and TD-FP-Growth (C). Experimentalresults show that the algorithm of this paper is effective and efficient.展开更多
文摘Maximum frequent pattern generation from a large database of transactions and items for association rule mining is an important research topic in data mining. Association rule mining aims to discover interesting correlations, frequent patterns, associations, or causal structures between items hidden in a large database. By exploiting quantum computing, we propose an efficient quantum search algorithm design to discover the maximum frequent patterns. We modified Grover’s search algorithm so that a subspace of arbitrary symmetric states is used instead of the whole search space. We presented a novel quantum oracle design that employs a quantum counter to count the maximum frequent items and a quantum comparator to check with a minimum support threshold. The proposed derived algorithm increases the rate of the correct solutions since the search is only in a subspace. Furthermore, our algorithm significantly scales and optimizes the required number of qubits in design, which directly reflected positively on the performance. Our proposed design can accommodate more transactions and items and still have a good performance with a small number of qubits.
基金National Key R&D Program of China(No.2019YFB2103102)Hong Kong Polytechnic University(No.CD06,P0042540)。
文摘The advent of the big data era has provided many types of transportation datasets,such as metro smart card data,for studying residents’mobility and understanding how their mobility has been shaped and is shaping the urban space.In this paper,we use metro smart card data from two Chinese metropolises,Shanghai and Shenzhen.Five metro mobility indicators are introduced,and association rules are established to explore the mobility patterns.The proportion of people entering and exiting the station is used to measure the jobs-housing balance.It is found that the average travel distance and duration of Shanghai passengers are higher than those of Shenzhen,and the proportion of metro commuters in Shanghai is higher than that of Shenzhen.The jobs-housing spatial relationship in Shenzhen based on metro travel is more balanced than that in Shanghai.The fundamental reason for the differences between the two cities is the difference in urban morphology.Compared with the monocentric structure of Shanghai,the polycentric structure of Shenzhen results in more scattered travel hotspots and more diverse travel routes,which helps Shenzhen to have a better jobs-housing balance.This paper fills a gap in comparative research among Chinese cities based on transportation big data analysis.The results provide support for planning metro routes,adjusting urban structure and land use to form a more reasonable metro network,and balancing the jobs-housing spatial relationship.
文摘In this paper, we propose an efficient algorithm, called FFP-Growth (shortfor fast FP-Growth) , to mine frequent itemsets. Similar to FP-Growth, FFP-Growth searches theFP-tree in the bottom-up order, but need not construct conditional pattern bases and sub-FP-trees,thus, saving a substantial amount of time and space, and the FP-tree created by it is much smallerthan that created by TD-FP-Growth, hence improving efficiency. At the same time, FFP-Growth can beeasily extended for reducing the search space as TD-FP-Growth (M) and TD-FP-Growth (C). Experimentalresults show that the algorithm of this paper is effective and efficient.