期刊文献+
共找到50篇文章
< 1 2 3 >
每页显示 20 50 100
Reconstructing the 3D digital core with a fully convolutional neural network 被引量:1
1
作者 Li Qiong Chen Zheng +4 位作者 He Jian-Jun Hao Si-Yu Wang Rui Yang Hao-Tao Sun Hua-Jun 《Applied Geophysics》 SCIE CSCD 2020年第3期401-410,共10页
In this paper, the complete process of constructing 3D digital core by fullconvolutional neural network is described carefully. A large number of sandstone computedtomography (CT) images are used as training input for... In this paper, the complete process of constructing 3D digital core by fullconvolutional neural network is described carefully. A large number of sandstone computedtomography (CT) images are used as training input for a fully convolutional neural networkmodel. This model is used to reconstruct the three-dimensional (3D) digital core of Bereasandstone based on a small number of CT images. The Hamming distance together with theMinkowski functions for porosity, average volume specifi c surface area, average curvature,and connectivity of both the real core and the digital reconstruction are used to evaluate theaccuracy of the proposed method. The results show that the reconstruction achieved relativeerrors of 6.26%, 1.40%, 6.06%, and 4.91% for the four Minkowski functions and a Hammingdistance of 0.04479. This demonstrates that the proposed method can not only reconstructthe physical properties of real sandstone but can also restore the real characteristics of poredistribution in sandstone, is the ability to which is a new way to characterize the internalmicrostructure of rocks. 展开更多
关键词 Fully convolutional neural network 3d digital core numerical simulation training set
在线阅读 下载PDF
Image-Based Flow Prediction of Vocal Folds Using 3D Convolutional Neural Networks
2
作者 Yang Zhang Tianmei Pu +1 位作者 Jiasen Xu Chunhua Zhou 《Journal of Bionic Engineering》 SCIE EI CSCD 2024年第2期991-1002,共12页
In this work,a three dimensional(3D)convolutional neural network(CNN)model based on image slices of various normal and pathological vocal folds is proposed for accurate and efficient prediction of glottal flows.The 3D... In this work,a three dimensional(3D)convolutional neural network(CNN)model based on image slices of various normal and pathological vocal folds is proposed for accurate and efficient prediction of glottal flows.The 3D CNN model is composed of the feature extraction block and regression block.The feature extraction block is capable of learning low dimensional features from the high dimensional image data of the glottal shape,and the regression block is employed to flatten the output from the feature extraction block and obtain the desired glottal flow data.The input image data is the condensed set of 2D image slices captured in the axial plane of the 3D vocal folds,where these glottal shapes are synthesized based on the equations of normal vibration modes.The output flow data is the corresponding flow rate,averaged glottal pressure and nodal pressure distributions over the glottal surface.The 3D CNN model is built to establish the mapping between the input image data and output flow data.The ground-truth flow variables of each glottal shape in the training and test datasets are obtained by a high-fidelity sharp-interface immersed-boundary solver.The proposed model is trained to predict the concerned flow variables for glottal shapes in the test set.The present 3D CNN model is more efficient than traditional Computational Fluid Dynamics(CFD)models while the accuracy can still be retained,and more powerful than previous data-driven prediction models because more details of the glottal flow can be provided.The prediction performance of the trained 3D CNN model in accuracy and efficiency indicates that this model could be promising for future clinical applications. 展开更多
关键词 Vocal folds Computational fluid dynamics Machine learning 3d convolutional neural network
在线阅读 下载PDF
Review of Artificial Intelligence for Oil and Gas Exploration: Convolutional Neural Network Approaches and the U-Net 3D Model
3
作者 Weiyan Liu 《Open Journal of Geology》 CAS 2024年第4期578-593,共16页
Deep learning, especially through convolutional neural networks (CNN) such as the U-Net 3D model, has revolutionized fault identification from seismic data, representing a significant leap over traditional methods. Ou... Deep learning, especially through convolutional neural networks (CNN) such as the U-Net 3D model, has revolutionized fault identification from seismic data, representing a significant leap over traditional methods. Our review traces the evolution of CNN, emphasizing the adaptation and capabilities of the U-Net 3D model in automating seismic fault delineation with unprecedented accuracy. We find: 1) The transition from basic neural networks to sophisticated CNN has enabled remarkable advancements in image recognition, which are directly applicable to analyzing seismic data. The U-Net 3D model, with its innovative architecture, exemplifies this progress by providing a method for detailed and accurate fault detection with reduced manual interpretation bias. 2) The U-Net 3D model has demonstrated its superiority over traditional fault identification methods in several key areas: it has enhanced interpretation accuracy, increased operational efficiency, and reduced the subjectivity of manual methods. 3) Despite these achievements, challenges such as the need for effective data preprocessing, acquisition of high-quality annotated datasets, and achieving model generalization across different geological conditions remain. Future research should therefore focus on developing more complex network architectures and innovative training strategies to refine fault identification performance further. Our findings confirm the transformative potential of deep learning, particularly CNN like the U-Net 3D model, in geosciences, advocating for its broader integration to revolutionize geological exploration and seismic analysis. 展开更多
关键词 deep Learning convolutional neural networks (cnn) Seismic Fault Identification U-Net 3d Model Geological Exploration
在线阅读 下载PDF
Audiovisual speech recognition based on a deep convolutional neural network 被引量:2
4
作者 Shashidhar Rudregowda Sudarshan Patilkulkarni +2 位作者 Vinayakumar Ravi Gururaj H.L. Moez Krichen 《Data Science and Management》 2024年第1期25-34,共10页
Audiovisual speech recognition is an emerging research topic.Lipreading is the recognition of what someone is saying using visual information,primarily lip movements.In this study,we created a custom dataset for India... Audiovisual speech recognition is an emerging research topic.Lipreading is the recognition of what someone is saying using visual information,primarily lip movements.In this study,we created a custom dataset for Indian English linguistics and categorized it into three main categories:(1)audio recognition,(2)visual feature extraction,and(3)combined audio and visual recognition.Audio features were extracted using the mel-frequency cepstral coefficient,and classification was performed using a one-dimension convolutional neural network.Visual feature extraction uses Dlib and then classifies visual speech using a long short-term memory type of recurrent neural networks.Finally,integration was performed using a deep convolutional network.The audio speech of Indian English was successfully recognized with accuracies of 93.67%and 91.53%,respectively,using testing data from 200 epochs.The training accuracy for visual speech recognition using the Indian English dataset was 77.48%and the test accuracy was 76.19%using 60 epochs.After integration,the accuracies of audiovisual speech recognition using the Indian English dataset for training and testing were 94.67%and 91.75%,respectively. 展开更多
关键词 Audiovisual speech recognition Custom dataset 1d convolution neural network(cnn) deep cnn(dcnn) Long short-term memory(LSTM) LIPREAdING dlib Mel-frequency cepstral coefficient(MFCC)
在线阅读 下载PDF
Enhancing SS-OCT 3D image reconstruction:A real-time system with stripe artifact suppression and GPU parallel acceleration
5
作者 Dandan LIU 《虚拟现实与智能硬件(中英文)》 2026年第1期115-130,共16页
Optical coherence tomography(OCT),particularly Swept-Source OCT,is widely employed in medical diagnostics and industrial inspections owing to its high-resolution imaging capabilities.However,Swept-Source OCT 3D imagin... Optical coherence tomography(OCT),particularly Swept-Source OCT,is widely employed in medical diagnostics and industrial inspections owing to its high-resolution imaging capabilities.However,Swept-Source OCT 3D imaging often suffers from stripe artifacts caused by unstable light sources,system noise,and environmental interference,posing challenges to real-time processing of large-scale datasets.To address this issue,this study introduces a real-time reconstruction system that integrates stripe-artifact suppression and parallel computing using a graphics processing unit.This approach employs a frequency-domain filtering algorithm with adaptive anti-suppression parameters,dynamically adjusted through an image quality evaluation function and optimized using a convolutional neural network for complex frequency-domain feature learning.Additionally,a graphics processing unit integrated 3D reconstruction framework is developed,enhancing data processing throughput and real-time performance via a dual-queue decoupling mechanism.Experimental results demonstrate significant improvements in structural similarity(0.92),peak signal-to-noise ratio(31.62 dB),and stripe suppression ratio(15.73 dB)compared with existing methods.On the RTX 4090 platform,the proposed system achieved an end-to-end delay of 94.36 milliseconds,a frame rate of 10.3 frames per second,and a throughput of 121.5 million voxels per second,effectively suppressing artifacts while preserving image details and enhancing real-time 3D reconstruction performance. 展开更多
关键词 Stripe artifact suppression 3d reconstruction GPU parallel computing Adaptive frequency domain filtering convolutional neural network
在线阅读 下载PDF
快速3D-CNN结合深度可分离卷积对高光谱图像分类 被引量:2
6
作者 王燕 梁琦 《计算机科学与探索》 CSCD 北大核心 2022年第12期2860-2869,共10页
针对卷积神经网络在高光谱图像特征提取和分类的过程中,存在空谱特征提取不充分以及网络层数太多引起的参数量大、计算复杂的问题,提出快速三维卷积神经网络(3D-CNN)结合深度可分离卷积(DSC)的轻量型卷积模型。该方法首先利用增量主成... 针对卷积神经网络在高光谱图像特征提取和分类的过程中,存在空谱特征提取不充分以及网络层数太多引起的参数量大、计算复杂的问题,提出快速三维卷积神经网络(3D-CNN)结合深度可分离卷积(DSC)的轻量型卷积模型。该方法首先利用增量主成分分析(IPCA)对输入的数据进行降维预处理;其次将输入模型的像素分割成小的重叠的三维小卷积块,在分割的小块上基于中心像素形成地面标签,利用三维核函数进行卷积处理,形成连续的三维特征图,保留空谱特征。用3D-CNN同时提取空谱特征,然后在三维卷积中加入深度可分离卷积对空间特征再次提取,丰富空谱特征的同时减少参数量,从而减少计算时间,分类精度也有所提高。所提模型在Indian Pines、Salinas Scene和University of Pavia公开数据集上验证,并且同其他经典的分类方法进行比较。实验结果表明,该方法不仅能大幅度节省可学习的参数,降低模型复杂度,而且表现出较好的分类性能,其中总体精度(OA)、平均分类精度(AA)和Kappa系数均可达99%以上。 展开更多
关键词 高光谱图像分类 空谱特征提取 三维卷积神经网络(3d-cnn) 深度可分离卷积(dSC) 深度学习
在线阅读 下载PDF
基于因果分析与CNN模型的临汾市O_(3)浓度预测
7
作者 宋朕 营娜 +2 位作者 王璟煦 朱向哲 薛志钢 《中国环境监测》 北大核心 2025年第S1期34-40,共7页
近年来,我国O_(3)浓度上升明显。临汾市既是我国三大焦煤生产基地之一,又属于大气污染防治重点区域,因此开展临汾市O_(3)浓度预测研究对于该区域O_(3)污染防控以及空气质量进一步改善具有重要意义。基于2020—2022年临汾国控点的污染物... 近年来,我国O_(3)浓度上升明显。临汾市既是我国三大焦煤生产基地之一,又属于大气污染防治重点区域,因此开展临汾市O_(3)浓度预测研究对于该区域O_(3)污染防控以及空气质量进一步改善具有重要意义。基于2020—2022年临汾国控点的污染物监测数据及气象数据,采用因果分析方法研究站点间的O_(3)空间分布及联系,通过卷积神经网络(CNN)模型预测未来O_(3)浓度。结果显示,因果分析可为模型筛选空间特征,使结合该空间特征构建的预测模型CNN-1的预测精度得到有效提升。临汾市各站点间存在显著的O_(3)传输规律,市委与城南两个站点对其他站点影响大,临钢医院站点受其他站点影响较小。所构建的CNN模型在夏秋季拟合更佳。削减市委和城南站点的O_(3)浓度,可有效改善临钢医院站点的空气质量。该方法可精准识别O_(3)传输源,为准确预测并提前应对O_(3)污染提供技术支撑。 展开更多
关键词 O_(3) 收敛交叉映射 空间特征 卷积神经网络 浓度预测
在线阅读 下载PDF
CurveNet:Curvature-Based Multitask Learning Deep Networks for 3D Object Recognition 被引量:4
8
作者 A.A.M.Muzahid Wanggen Wan +2 位作者 Ferdous Sohel Lianyao Wu Li Hou 《IEEE/CAA Journal of Automatica Sinica》 SCIE EI CSCD 2021年第6期1177-1187,共11页
In computer vision fields,3D object recognition is one of the most important tasks for many real-world applications.Three-dimensional convolutional neural networks(CNNs)have demonstrated their advantages in 3D object ... In computer vision fields,3D object recognition is one of the most important tasks for many real-world applications.Three-dimensional convolutional neural networks(CNNs)have demonstrated their advantages in 3D object recognition.In this paper,we propose to use the principal curvature directions of 3D objects(using a CAD model)to represent the geometric features as inputs for the 3D CNN.Our framework,namely CurveNet,learns perceptually relevant salient features and predicts object class labels.Curvature directions incorporate complex surface information of a 3D object,which helps our framework to produce more precise and discriminative features for object recognition.Multitask learning is inspired by sharing features between two related tasks,where we consider pose classification as an auxiliary task to enable our CurveNet to better generalize object label classification.Experimental results show that our proposed framework using curvature vectors performs better than voxels as an input for 3D object classification.We further improved the performance of CurveNet by combining two networks with both curvature direction and voxels of a 3D object as the inputs.A Cross-Stitch module was adopted to learn effective shared features across multiple representations.We evaluated our methods using three publicly available datasets and achieved competitive performance in the 3D object recognition task. 展开更多
关键词 3d shape analysis convolutional neural network dNNs object classification volumetric cnn
在线阅读 下载PDF
Behavior recognition algorithm based on the improved R3D and LSTM network fusion 被引量:1
9
作者 Wu Jin An Yiyuan +1 位作者 Dai Wei Zhao Bo 《High Technology Letters》 EI CAS 2021年第4期381-387,共7页
Because behavior recognition is based on video frame sequences,this paper proposes a behavior recognition algorithm that combines 3D residual convolutional neural network(R3D)and long short-term memory(LSTM).First,the... Because behavior recognition is based on video frame sequences,this paper proposes a behavior recognition algorithm that combines 3D residual convolutional neural network(R3D)and long short-term memory(LSTM).First,the residual module is extended to three dimensions,which can extract features in the time and space domain at the same time.Second,by changing the size of the pooling layer window the integrity of the time domain features is preserved,at the same time,in order to overcome the difficulty of network training and over-fitting problems,the batch normalization(BN)layer and the dropout layer are added.After that,because the global average pooling layer(GAP)is affected by the size of the feature map,the network cannot be further deepened,so the convolution layer and maxpool layer are added to the R3D network.Finally,because LSTM has the ability to memorize information and can extract more abstract timing features,the LSTM network is introduced into the R3D network.Experimental results show that the R3D+LSTM network achieves 91%recognition rate on the UCF-101 dataset. 展开更多
关键词 behavior recognition three-dimensional residual convolutional neural network(R3d) long short-term memory(LSTM) dROPOUT batch normalization(BN)
在线阅读 下载PDF
Short‐term and long‐term memory self‐attention network for segmentation of tumours in 3D medical images
10
作者 Mingwei Wen Quan Zhou +3 位作者 Bo Tao Pavel Shcherbakov Yang Xu Xuming Zhang 《CAAI Transactions on Intelligence Technology》 SCIE EI 2023年第4期1524-1537,共14页
Tumour segmentation in medical images(especially 3D tumour segmentation)is highly challenging due to the possible similarity between tumours and adjacent tissues,occurrence of multiple tumours and variable tumour shap... Tumour segmentation in medical images(especially 3D tumour segmentation)is highly challenging due to the possible similarity between tumours and adjacent tissues,occurrence of multiple tumours and variable tumour shapes and sizes.The popular deep learning‐based segmentation algorithms generally rely on the convolutional neural network(CNN)and Transformer.The former cannot extract the global image features effectively while the latter lacks the inductive bias and involves the complicated computation for 3D volume data.The existing hybrid CNN‐Transformer network can only provide the limited performance improvement or even poorer segmentation performance than the pure CNN.To address these issues,a short‐term and long‐term memory self‐attention network is proposed.Firstly,a distinctive self‐attention block uses the Transformer to explore the correlation among the region features at different levels extracted by the CNN.Then,the memory structure filters and combines the above information to exclude the similar regions and detect the multiple tumours.Finally,the multi‐layer reconstruction blocks will predict the tumour boundaries.Experimental results demonstrate that our method outperforms other methods in terms of subjective visual and quantitative evaluation.Compared with the most competitive method,the proposed method provides Dice(82.4%vs.76.6%)and Hausdorff distance 95%(HD95)(10.66 vs.11.54 mm)on the KiTS19 as well as Dice(80.2%vs.78.4%)and HD95(9.632 vs.12.17 mm)on the LiTS. 展开更多
关键词 3d medical images convolutional neural network self‐attention network TRANSFORMER tumor segmentation
在线阅读 下载PDF
BTDGCNN:面向三维点云拓扑结构的BallTree动态图卷积神经网络 被引量:4
11
作者 张学典 方慧 《小型微型计算机系统》 CSCD 北大核心 2022年第11期2342-2347,共6页
点云卷积网络对点云进行分割分类时,独立提取点云特征却忽略了点之间的几何关联,从而丢失了许多局部特征.而对稀疏、无结构、无序的点云进行输入转换则会导致数据变得更加庞大,卷积效率降低.为此构建了面向三维点云拓扑结构的BallTree... 点云卷积网络对点云进行分割分类时,独立提取点云特征却忽略了点之间的几何关联,从而丢失了许多局部特征.而对稀疏、无结构、无序的点云进行输入转换则会导致数据变得更加庞大,卷积效率降低.为此构建了面向三维点云拓扑结构的BallTree动态图卷积神经网络,利用Bat-Net变换网络(BallTree transfromation network)对初始无序点云进行空间变换,恢复点云的拓扑结构和距离向量,提高点云中各个点间的关联性,结合三层BAT边卷积模块(BallTree edge convolution network),提升其信息表征能力,以便更好地进行分类分割任务.实验结果表明,该方法在ModelNet40数据集上的分类性能均优于其他五种方法,分别提高了4.4%、2.9%、1.3%、2%和1.4%.同时在ShapeNet Parts数据集上的分割的平均交并比分别提高了1.7%、0.3%、0.3%、0.3%、0.3%,有效地提升了三维点云的分类分割性能. 展开更多
关键词 三维点云 图卷积神经网络 分类 分割
在线阅读 下载PDF
Integrating deep learning and logging data analytics for lithofacies classification and 3D modeling of tight sandstone reservoirs 被引量:5
12
作者 Jing-Jing Liu Jian-Chao Liu 《Geoscience Frontiers》 SCIE CAS CSCD 2022年第1期350-363,共14页
The lithofacies classification is essential for oil and gas reservoir exploration and development.The traditional method of lithofacies classification is based on"core calibration logging"and the experience ... The lithofacies classification is essential for oil and gas reservoir exploration and development.The traditional method of lithofacies classification is based on"core calibration logging"and the experience of geologists.This approach has strong subjectivity,low efficiency,and high uncertainty.This uncertainty may be one of the key factors affecting the results of 3 D modeling of tight sandstone reservoirs.In recent years,deep learning,which is a cutting-edge artificial intelligence technology,has attracted attention from various fields.However,the study of deep-learning techniques in the field of lithofacies classification has not been sufficient.Therefore,this paper proposes a novel hybrid deep-learning model based on the efficient data feature-extraction ability of convolutional neural networks(CNN)and the excellent ability to describe time-dependent features of long short-term memory networks(LSTM)to conduct lithological facies-classification experiments.The results of a series of experiments show that the hybrid CNN-LSTM model had an average accuracy of 87.3%and the best classification effect compared to the CNN,LSTM or the three commonly used machine learning models(Support vector machine,random forest,and gradient boosting decision tree).In addition,the borderline synthetic minority oversampling technique(BSMOTE)is introduced to address the class-imbalance issue of raw data.The results show that processed data balance can significantly improve the accuracy of lithofacies classification.Beside that,based on the fine lithofacies constraints,the sequential indicator simulation method is used to establish a three-dimensional lithofacies model,which completes the fine description of the spatial distribution of tight sandstone reservoirs in the study area.According to this comprehensive analysis,the proposed CNN-LSTM model,which eliminates class imbalance,can be effectively applied to lithofacies classification,and is expected to improve the reality of the geological model for the tight sandstone reservoirs. 展开更多
关键词 deep learning convolutional neural networks LSTM Lithological-facies classification 3d modeling Class imbalance
在线阅读 下载PDF
Panicle-3D: A low-cost 3D-modeling method for rice panicles based on deep learning, shape from silhouette, and supervoxel clustering 被引量:3
13
作者 Dan Wu Lejun Yu +10 位作者 Junli Ye Ruifang Zhai Lingfeng Duan Lingbo Liu Nai Wu Zedong Geng Jingbo Fu Chenglong Huang Shangbin Chen Qian Liu Wanneng Yang 《The Crop Journal》 SCIE CSCD 2022年第5期1386-1398,共13页
Self-occlusions are common in rice canopy images and strongly influence the calculation accuracies of panicle traits. Such interference can be largely eliminated if panicles are phenotyped at the 3 D level.Research on... Self-occlusions are common in rice canopy images and strongly influence the calculation accuracies of panicle traits. Such interference can be largely eliminated if panicles are phenotyped at the 3 D level.Research on 3 D panicle phenotyping has been limited. Given that existing 3 D modeling techniques do not focus on specified parts of a target object, an efficient method for panicle modeling of large numbers of rice plants is lacking. This paper presents an automatic and nondestructive method for 3 D panicle modeling. The proposed method integrates shoot rice reconstruction with shape from silhouette, 2 D panicle segmentation with a deep convolutional neural network, and 3 D panicle segmentation with ray tracing and supervoxel clustering. A multiview imaging system was built to acquire image sequences of rice canopies with an efficiency of approximately 4 min per rice plant. The execution time of panicle modeling per rice plant using 90 images was approximately 26 min. The outputs of the algorithm for a single rice plant are a shoot rice model, surface shoot rice model, panicle model, and surface panicle model, all represented by a list of spatial coordinates. The efficiency and performance were evaluated and compared with the classical structure-from-motion algorithm. The results demonstrated that the proposed method is well qualified to recover the 3 D shapes of rice panicles from multiview images and is readily adaptable to rice plants of diverse accessions and growth stages. The proposed algorithm is superior to the structure-from-motion method in terms of texture preservation and computational efficiency. The sample images and implementation of the algorithm are available online. This automatic, cost-efficient, and nondestructive method of 3 D panicle modeling may be applied to high-throughput 3 D phenotyping of large rice populations. 展开更多
关键词 Panicle phenotyping deep convolutional neural network 3d reconstruction Shape from silhouette Point-cloud segmentation Ray tracing Supervoxel clustering
在线阅读 下载PDF
基于CNN+D-S证据理论的多维信息源局部放电故障识别 被引量:22
14
作者 王磊 张磊 +3 位作者 牛荣泽 孙芊 李丰君 张周胜 《电力工程技术》 北大核心 2022年第5期172-179,共8页
基于多维信息源融合的局部放电故障识别方法对提高故障识别的准确性和容错性具有重要意义。文中以开关柜中的典型局部放电类型为识别对象,设置4种典型的局部放电模型(电晕放电、沿面放电、悬浮放电和气隙放电),利用超声波(Ultra)法、甚... 基于多维信息源融合的局部放电故障识别方法对提高故障识别的准确性和容错性具有重要意义。文中以开关柜中的典型局部放电类型为识别对象,设置4种典型的局部放电模型(电晕放电、沿面放电、悬浮放电和气隙放电),利用超声波(Ultra)法、甚-特高频(V-UHF)法以及脉冲电流法(PCM)采集不同放电类型产生的局放信号。首先利用深度卷积神经网络(CNN)算法对不同传感器测量数据进行训练,之后利用Dempster-Shafer(D-S)证据理论对多维信息源识别结果进行融合,并作出最终决策。结果表明,相比于基于单一信息源的故障识别模式,基于多维信息源的故障识别模式准确率更高,且当多维信息源中某一信息源出现误判时仍能正确识别放电类型,对信息源的容错性更好,识别效果良好。 展开更多
关键词 局部放电 故障识别 深度卷积神经网络(cnn) dempster-Shafer(d-S)证据理论 多维信息源 信息融合
在线阅读 下载PDF
Web3D Learning Framework for 3D Shape Retrieval Based on Hybrid Convolutional Neural Networks 被引量:1
15
作者 Wen Zhou Jinyuan Jia +1 位作者 Chengxi Huang Yongqing Cheng 《Tsinghua Science and Technology》 SCIE EI CAS CSCD 2020年第1期93-102,共10页
With the rapid development of Web3 D technologies, sketch-based model retrieval has become an increasingly important challenge, while the application of Virtual Reality and 3 D technologies has made shape retrieval of... With the rapid development of Web3 D technologies, sketch-based model retrieval has become an increasingly important challenge, while the application of Virtual Reality and 3 D technologies has made shape retrieval of furniture over a web browser feasible. In this paper, we propose a learning framework for shape retrieval based on two Siamese VGG-16 Convolutional Neural Networks(CNNs), and a CNN-based hybrid learning algorithm to select the best view for a shape. In this algorithm, the AlexNet and VGG-16 CNN architectures are used to perform classification tasks and to extract features, respectively. In addition, a feature fusion method is used to measure the similarity relation of the output features from the two Siamese networks. The proposed framework can provide new alternatives for furniture retrieval in the Web3 D environment. The primary innovation is in the employment of deep learning methods to solve the challenge of obtaining the best view of 3 D furniture,and to address cross-domain feature learning problems. We conduct an experiment to verify the feasibility of the framework and the results show our approach to be superior in comparison to many mainstream state-of-the-art approaches. 展开更多
关键词 WEB3d sketch-based model RETRIEVAL convolutional neural networks(cnns) best VIEW cross-domain
原文传递
3-Dimensional Bag of Visual Words Framework on Action Recognition 被引量:1
16
作者 Shiqi Wang Yimin Yang +1 位作者 Ruizhong Wei Qingming Jonathan Wu 《Computers, Materials & Continua》 SCIE EI 2020年第6期1081-1091,共11页
Human motion recognition plays a crucial role in the video analysis framework.However,a given video may contain a variety of noises,such as an unstable background and redundant actions,that are completely different fr... Human motion recognition plays a crucial role in the video analysis framework.However,a given video may contain a variety of noises,such as an unstable background and redundant actions,that are completely different from the key actions.These noises pose a great challenge to human motion recognition.To solve this problem,we propose a new method based on the 3-Dimensional(3D)Bag of Visual Words(BoVW)framework.Our method includes two parts:The first part is the video action feature extractor,which can identify key actions by analyzing action features.In the video action encoder,by analyzing the action characteristics of a given video,we use the deep 3D CNN pre-trained model to obtain expressive coding information.A classifier with subnetwork nodes is used for the final classification.The extensive experiments demonstrate that our method leads to an impressive effect on complex video analysis.Our approach achieves state-of-the-art performance on the datasets of UCF101(85.3%)and HMDB51(54.5%). 展开更多
关键词 Action recognition 3d cnns recurrent neural networks residual networks subnetwork nodes
在线阅读 下载PDF
An Interactive platform for low-cost 3D building modeling from VGI data using convolutional neural network 被引量:1
17
作者 Hongchao Fan Gefei Kong Chaoquan Zhang 《Big Earth Data》 EI 2021年第1期49-65,共17页
The applications of 3D building models are limited as producing them requires massive labor and time costs as well as expensive devices.In this paper,we aim to propose a novel and web-based interactive platform,VGI3D,... The applications of 3D building models are limited as producing them requires massive labor and time costs as well as expensive devices.In this paper,we aim to propose a novel and web-based interactive platform,VGI3D,to overcome these challenges.The platform is designed to reconstruct 3D building models by using free images from internet users or volunteered geographic informa-tion(VGI)platform,even though not all these images are of high quality.Our interactive platform can effectively obtain each 3D building model from images in 30 seconds,with the help of user interaction module and convolutional neural network(CNN).The user interaction module provides the boundary of building facades for 3D building modeling.And this CNN can detect facade elements even though multiple architectural styles and complex scenes are within the images.Moreover,user interaction module is designed as simple as possible to make it easier to use for both of expert and non-expert users.Meanwhile,we conducted a usability testing and collected feedback from participants to better optimize platform and user experience.In general,the usage of VGI data reduces labor and device costs,and CNN simplifies the process of elements extraction in 3D building modeling.Hence,our proposed platform offers a promising solution to the 3D modeling community. 展开更多
关键词 3d building modeling VGI convolutional neural network user interaction low cost
原文传递
基于1-D CNN的二阶段OFDM系统定时同步方法 被引量:1
18
作者 卿朝进 杨娜 +1 位作者 唐书海 饶川贵 《计算机应用研究》 CSCD 北大核心 2023年第2期565-570,共6页
针对存在多径干扰的正交频分复用系统的定时同步准确性低的问题,提出基于一维卷积神经网络(1-D CNN)的二阶段OFDM系统定时同步方法。在第一阶段,利用经典互相关方法实现路径特征初始抽取,捕获可分辨路径上的定时辅助同步点;基于定时辅... 针对存在多径干扰的正交频分复用系统的定时同步准确性低的问题,提出基于一维卷积神经网络(1-D CNN)的二阶段OFDM系统定时同步方法。在第一阶段,利用经典互相关方法实现路径特征初始抽取,捕获可分辨路径上的定时辅助同步点;基于定时辅助同步点构建1-D CNN网络学习第二阶段中的定时偏移;最后,结合两阶段处理,获得系统最终的定时同步偏移估计。相比于基于压缩感知的定时同步方法和基于极限学习机的定时同步方法,所研究的二阶段OFDM系统定时同步方法提高了定时同步准确性,并有效地降低计算复杂度与处理延迟。 展开更多
关键词 二阶段定时同步 一维卷积神经网络 正交频分复用
在线阅读 下载PDF
3D Filtering by Block Matching and Convolutional Neural Network for Image Denoising
19
作者 Bei-Ji Zou Yun-Di Guo +3 位作者 Qi He Ping-Bo Ouyang Ke Liu Zai-Liang Chen 《Journal of Computer Science & Technology》 SCIE EI CSCD 2018年第4期838-848,共11页
Block matching based 3D filtering methods have achieved great success in image denoising tasks. However the manually set filtering operation could not well describe a good model to transform noisy images to clean imag... Block matching based 3D filtering methods have achieved great success in image denoising tasks. However the manually set filtering operation could not well describe a good model to transform noisy images to clean images. In this paper, we introduce convolutional neural network (CNN) for the 3D filtering step to learn a well fitted model for denoising. With a trainable model, prior knowledge is utilized for better mapping from noisy images to clean images. This block matching and CNN joint model (BMCNN) could denoise images with different sizes and different noise intensity well, especially images with high noise levels. The experimental results demonstrate that among all competing methods, this method achieves the highest peak signal to noise ratio (PSNR) when denoising images with high noise levels (σ 〉 40), and the best visual quality when denoising images with all the tested noise levels. 展开更多
关键词 block matching convolutional neural network cnn dENOISING 3d filtering
原文传递
Mural Anomaly Region Detection Algorithm Based on Hyperspectral Multiscale Residual Attention Network
20
作者 Bolin Guo Shi Qiu +1 位作者 Pengchang Zhang Xingjia Tang 《Computers, Materials & Continua》 SCIE EI 2024年第10期1809-1833,共25页
Mural paintings hold significant historical information and possess substantial artistic and cultural value.However,murals are inevitably damaged by natural environmental factors such as wind and sunlight,as well as b... Mural paintings hold significant historical information and possess substantial artistic and cultural value.However,murals are inevitably damaged by natural environmental factors such as wind and sunlight,as well as by human activities.For this reason,the study of damaged areas is crucial for mural restoration.These damaged regions differ significantly from undamaged areas and can be considered abnormal targets.Traditional manual visual processing lacks strong characterization capabilities and is prone to omissions and false detections.Hyperspectral imaging can reflect the material properties more effectively than visual characterization methods.Thus,this study employs hyperspectral imaging to obtain mural information and proposes a mural anomaly detection algorithm based on a hyperspectral multi-scale residual attention network(HM-MRANet).The innovations of this paper include:(1)Constructing mural painting hyperspectral datasets.(2)Proposing a multi-scale residual spectral-spatial feature extraction module based on a 3D CNN(Convolutional Neural Networks)network to better capture multiscale information and improve performance on small-sample hyperspectral datasets.(3)Proposing the Enhanced Residual Attention Module(ERAM)to address the feature redundancy problem,enhance the network’s feature discrimination ability,and further improve abnormal area detection accuracy.The experimental results show that the AUC(Area Under Curve),Specificity,and Accuracy of this paper’s algorithm reach 85.42%,88.84%,and 87.65%,respectively,on this dataset.These results represent improvements of 3.07%,1.11%and 2.68%compared to the SSRN algorithm,demonstrating the effectiveness of this method for mural anomaly detection. 展开更多
关键词 MURALS anomaly detection HYPERSPECTRAL 3d cnn(convolutional neural networks) residual network
在线阅读 下载PDF
上一页 1 2 3 下一页 到第
使用帮助 返回顶部