Brooding is a major breeding investment of parental birds during the early nestling stage, and has important effects on the development and survival of nestlings. Investigating brooding behavior can help to understand...Brooding is a major breeding investment of parental birds during the early nestling stage, and has important effects on the development and survival of nestlings. Investigating brooding behavior can help to understand avian breeding investment strategies. From January to June in 2013 and 2014, we studied the brooding behaviors of long-tailed tits (Aegithalos caudatus glaucogularis) in Dongzhai National Nature Reserve, Henan Province, China. We analyzed the relationships between parental diurnal brooding duration and nestling age, brood size, temperature, relative breeding season, time of day and nestling frequencies during brooding duration. Results showed that female and male long-tailed tit parents had different breeding investment strategies during the early nestling stage. Female parents bore most of the brooding investment, while male parents performed most of the nestling feedings. In addition, helpers were not found to brood nestlings at the two cooperative breeding nests. Parental brooding duration was significantly associated with the food delivered to nestlings (F=86.10, dr=l, 193.94, P〈0.001), and was longer when the nestlings received more food. We found that parental brooding duration declined significantly as nestlings aged (F=5.99, dr=-1, 50.13, P=0.018). When nestlings were six days old, daytime parental brooding almost ceased, implying that long- tailed tit nestlings might be able to maintain their own body temperature by this age. In addition, brooding duration was affected by both brood size (F=12.74, dr=-1,32.08, P=0.001) and temperature (F=5.83, df=-l, 39.59, P=-0.021), with it being shorter in larger broods and when ambient temperature was higher.展开更多
Jinfengopteryx is a newly uncovered Archaeopteryx-like avialan bird outside Germany, which was found from the Jehol Biota of northern Hebei in northeastern China. It shares many characters only with Archaeopteryx by t...Jinfengopteryx is a newly uncovered Archaeopteryx-like avialan bird outside Germany, which was found from the Jehol Biota of northern Hebei in northeastern China. It shares many characters only with Archaeopteryx by the possession of three fenestrae in the antorbital cavity, 23 caudal vertebrae and long tail feathers attached to all the caudal vertebrae. But the former differs from the latter in the relatively short and high preorbital region of skull, more and closely packed teeth, much shorter forelimb compared to hindlimb. Such differences indicate Jinfengopteryx is even slightly more primitive than Archaeopteryx, although both birds can be placed at the root position of the avialan tree based on cladistic analysis. Shenzhouraptor is suggested to be slightly more advanced than Jinfengopteryx + Archaeopteryx, supported by some derived features in teeth, shoulder girdles and forelimbs such as the reduction of tooth number, dorsolaterally directed glenoid facet, very long forelimb and comparatively short manus. Meanwhile, the tail of Shenzhouraptor shows more primitive characters than those of Jinfengopteryx and Archaeopteryx, e.g., the strikingly longer tail composed of more caudal vertebrae and the long tail feathers attached only to distal caudal segments. The mixed primitive and advanced characters reveal the evident mosaic evolution among long-tailed avialan birds.展开更多
With the rapid increase of large-scale problems, the distribution of real-world datasets tends to be long-tailed. Existing solutions typically involve re-balancing strategies (i.e., re-sampling and re-weighting). Alth...With the rapid increase of large-scale problems, the distribution of real-world datasets tends to be long-tailed. Existing solutions typically involve re-balancing strategies (i.e., re-sampling and re-weighting). Although they can significantly promote the classifier learning of deep networks, they will unexpectedly impair the representative ability of the learned deep features to a certain extent. Therefore, this paper proposes a dual-channel learning algorithm with involution neural networks (DC-Invo) to take care of representation learning and classifier learning concurrently. In this work, the most important thing is to combine ResNet and involution to obtain higher classification accuracy because of involution’s wider coverage in the spatial dimension. The paper conducted extensive experiments on several benchmark vision tasks including Cifar-LT, Imagenet-LT, and Places-LT, showing that DC-Invo is able to achieve significant performance gained on long-tailed datasets.展开更多
Leveraging deep learning-based techniques to classify diseases has attracted extensive research interest in recent years.Nevertheless,most of the current studies only consider single-modal medical images,and the numbe...Leveraging deep learning-based techniques to classify diseases has attracted extensive research interest in recent years.Nevertheless,most of the current studies only consider single-modal medical images,and the number of ophthalmic diseases that can be classified is relatively small.Moreover,imbalanced data distribution of different ophthalmic diseases is not taken into consideration,which limits the application of deep learning techniques in realistic clinical scenes.In this paper,we propose a Multimodal Multi-disease Long-tailed Classification Network(M^(2)LC-Net)in response to the challenges mentioned above.M^(2)LC-Net leverages ResNet18-CBAM to extract features from fundus images and Optical Coherence Tomography(OCT)images,respectively,and conduct feature fusion to classify 11 common ophthalmic diseases.Moreover,Class Activation Mapping(CAM)is employed to visualize each mode to improve interpretability of M^(2)LC-Net.We conduct comprehensive experiments on realistic dataset collected from a Grade III Level A ophthalmology hospital in China,including 34,396 images of 11 disease labels.Experimental results demonstrate effectiveness of our proposed model M^(2)LC-Net.Compared with the stateof-the-art,various performance metrics have been improved significantly.Specifically,Cohen’s kappa coefficient κ has been improved by 3.21%,which is a remarkable improvement.展开更多
The long-tailed data distribution poses an enormous challenge for training neural networks in classification.A classification network can be decoupled into a feature extractor and a classifier.This paper takes a semi-...The long-tailed data distribution poses an enormous challenge for training neural networks in classification.A classification network can be decoupled into a feature extractor and a classifier.This paper takes a semi-discrete optimal transport(OT)perspective to analyze the long-tailed classification problem,where the feature space is viewed as a continuous source domain,and the classifier weights are viewed as a discrete target domain.The classifier is indeed to find a cell decomposition of the feature space with each cell corresponding to one class.An imbalanced training set causes the more frequent classes to have larger volume cells,which means that the classifier's decision boundary is biased towards less frequent classes,resulting in reduced classification performance in the inference phase.Therefore,we propose a novel OTdynamic softmax loss,which dynamically adjusts the decision boundary in the training phase to avoid overfitting in the tail classes.In addition,our method incorporates the supervised contrastive loss so that the feature space can satisfy the uniform distribution condition.Extensive and comprehensive experiments demonstrate that our method achieves state-ofthe-art performance on multiple long-tailed recognition benchmarks,including CIFAR-LT,ImageNet-LT,iNaturalist 2018,and Places-LT.展开更多
Skeleton-based action recognition has recently made significant progress.However,data imbalance is still a great challenge in real-world scenarios.The performance of current action recognition algorithms declines shar...Skeleton-based action recognition has recently made significant progress.However,data imbalance is still a great challenge in real-world scenarios.The performance of current action recognition algorithms declines sharply when training data suffers from heavy class imbalance.The imbalanced data actually degrades the representations learned by these methods and becomes the bottleneck for action recognition.How to learn unbiased representations from imbalanced action data is the key to long-tailed action recognition.In this paper,we propose a novel balanced representation learning method to address the long-tailed problem in action recognition.Firstly,a spatial-temporal action exploration strategy is presented to expand the sample space effectively,generating more valuable samples in a rebalanced manner.Secondly,we design a detached action-aware learning schedule to further mitigate the bias in the representation space.The schedule detaches the representation learning of tail classes from training and proposes an action-aware loss to impose more effective constraints.Additionally,a skip-type representation is proposed to provide complementary structural information.The proposed method is validated on four skeleton datasets,NTU RGB+D 60,NTU RGB+D 120,NW-UCLA and Kinetics.It not only achieves consistently large improvement compared to the state-of-the-art(SOTA)methods,but also demonstrates a superior generalization capacity through extensive experiments.Our code is available at https://github.com/firework8/BRL.展开更多
Federated learning(FL),a cutting-edge distributed machine learning training paradigm,aims to generate a global model by collaborating on the training of client models without revealing local private data.The co-occurr...Federated learning(FL),a cutting-edge distributed machine learning training paradigm,aims to generate a global model by collaborating on the training of client models without revealing local private data.The co-occurrence of non-independent and identically distributed(non-IID)and long-tailed distribution in FL is one challenge that substantially degrades aggregate performance.In this paper,we present a corresponding solution called federated dual-decoupling via model and logit calibration(FedDDC)for non-IID and long-tailed distributions.The model is characterized by three aspects.First,we decouple the global model into the feature extractor and the classifier to fine-tune the components affected by the joint problem.For the biased feature extractor,we propose a client confidence re-weighting scheme to assist calibration,which assigns optimal weights to each client.For the biased classifier,we apply the classifier re-balancing method for fine-tuning.Then,we calibrate and integrate the client confidence re-weighted logits with the re-balanced logits to obtain the unbiased logits.Finally,we use decoupled knowledge distillation for the first time in the joint problem to enhance the accuracy of the global model by extracting the knowledge of the unbiased model.Numerous experiments demonstrate that on non-IID and long-tailed data in FL,our approach outperforms state-of-the-art methods.展开更多
The prevalence of long-tailed distributions in real-world data often results in classification models favoring the dominant classes,neglecting the less frequent ones.Current approaches address the issues in long-taile...The prevalence of long-tailed distributions in real-world data often results in classification models favoring the dominant classes,neglecting the less frequent ones.Current approaches address the issues in long-tailed image classification by rebalancing data,optimizing weights,and augmenting information.However,these methods often struggle to balance the performance between dominant and minority classes because of inadequate representation learning of the latter.To address these problems,we introduce descriptional words into images as cross-modal privileged information and propose a cross-modal enhanced method for long-tailed image classification,referred to as CMLTNet.CMLTNet improves the learning of intraclass similarity of tail-class representations by cross-modal alignment and captures the difference between the head and tail classes in semantic space by cross-modal inference.After fusing the above information,CMLTNet achieved an overall performance that was better than those of benchmark long-tailed and cross-modal learning methods on the long-tailed cross-modal datasets,NUS-WIDE and VireoFood-172.The effectiveness of the proposed modules was further studied through ablation experiments.In a case study of feature distribution,the proposed model was better in learning representations of tail classes,and in the experiments on model attention,CMLTNet has the potential to help learn some rare concepts in the tail class through mapping to the semantic space.展开更多
Real-world data always exhibit an imbalanced and long-tailed distribution,which leads to poor performance for neural network-based classification.Existing methods mainly tackle this problem by reweighting the loss fun...Real-world data always exhibit an imbalanced and long-tailed distribution,which leads to poor performance for neural network-based classification.Existing methods mainly tackle this problem by reweighting the loss function or rebalancing the classifier.However,one crucial aspect overlooked by previous research studies is the imbalanced feature space problem caused by the imbalanced angle distribution.In this paper,the authors shed light on the significance of the angle distribution in achieving a balanced feature space,which is essential for improving model performance under long-tailed distributions.Nevertheless,it is challenging to effectively balance both the classifier norms and angle distribution due to problems such as the low feature norm.To tackle these challenges,the authors first thoroughly analyse the classifier and feature space by decoupling the classification logits into three key components:classifier norm(i.e.the magnitude of the classifier vector),feature norm(i.e.the magnitude of the feature vector),and cosine similarity between the classifier vector and feature vector.In this way,the authors analyse the change of each component in the training process and reveal three critical problems that should be solved,that is,the imbalanced angle distribution,the lack of feature discrimination,and the low feature norm.Drawing from this analysis,the authors propose a novel loss function that incorporates hyperspherical uniformity,additive angular margin,and feature norm regularisation.Each component of the loss function addresses a specific problem and synergistically contributes to achieving a balanced classifier and feature space.The authors conduct extensive experiments on three popular benchmark datasets including CIFAR-10/100-LT,ImageNet-LT,and iNaturalist 2018.The experimental results demonstrate that the authors’loss function outperforms several previous state-of-the-art methods in addressing the challenges posed by imbalanced and longtailed datasets,that is,by improving upon the best-performing baselines on CIFAR-100-LT by 1.34,1.41,1.41 and 1.33,respectively.展开更多
Intelligent detection and classification of kitchen waste can promote ecological sustainability by replacing inefficient manual processes.However,the presence of non-degradable waste mixed in kitchen waste often follo...Intelligent detection and classification of kitchen waste can promote ecological sustainability by replacing inefficient manual processes.However,the presence of non-degradable waste mixed in kitchen waste often follows a long-tailed distribution,making it challenging to train convolutional neural network-based object detectors,which results in the unsatisfactory detection of tailclass waste.To address this challenge,we propose a class-instance balanced detector(CIB-Det) for intelligent detection and classification of kitchen waste.CIB-Det implements two strategies for the loss function:the class-balanced strategy(CBS)and the instance-balanced strategy(IBS).The CBS focuses more on tail classes,and the IBS concentrates on hard-to-classify instances adaptively during training.Consequently,CIB-Det comprehensively and adaptively addresses the long-tailed issue.Our experiments on a real dataset of kitchen waste images support the effectiveness of CIB-Det for kitchen waste detection.展开更多
Objective To explore the semi-supervised learning(SSL) algorithm for long-tail endoscopic image classification with limited annotations.Method We explored semi-supervised long-tail endoscopic image classification in H...Objective To explore the semi-supervised learning(SSL) algorithm for long-tail endoscopic image classification with limited annotations.Method We explored semi-supervised long-tail endoscopic image classification in HyperKvasir,the largest gastrointestinal public dataset with 23 diverse classes.Semi-supervised learning algorithm FixMatch was applied based on consistency regularization and pseudo-labeling.After splitting the training dataset and the test dataset at a ratio of 4:1,we sampled 20%,50%,and 100% labeled training data to test the classification with limited annotations.Results The classification performance was evaluated by micro-average and macro-average evaluation metrics,with the Mathews correlation coefficient(MCC) as the overall evaluation.SSL algorithm improved the classification performance,with MCC increasing from 0.8761 to 0.8850,from 0.8983 to 0.8994,and from 0.9075 to 0.9095 with 20%,50%,and 100% ratio of labeled training data,respectively.With a 20% ratio of labeled training data,SSL improved both the micro-average and macro-average classification performance;while for the ratio of 50% and 100%,SSL improved the micro-average performance but hurt macro-average performance.Through analyzing the confusion matrix and labeling bias in each class,we found that the pseudo-based SSL algorithm exacerbated the classifier’ s preference for the head class,resulting in improved performance in the head class and degenerated performance in the tail class.Conclusion SSL can improve the classification performance for semi-supervised long-tail endoscopic image classification,especially when the labeled data is extremely limited,which may benefit the building of assisted diagnosis systems for low-volume hospitals.However,the pseudo-labeling strategy may amplify the effect of class imbalance,which hurts the classification performance for the tail class.展开更多
Consider a discrete-time insurance risk model. Within period i, i≥ 1, Xi and Yi denote the net insurance loss and the stochastic discount factor of an insurer, respectively. Assume that {(Xi, Yi), i≥1) form a seq...Consider a discrete-time insurance risk model. Within period i, i≥ 1, Xi and Yi denote the net insurance loss and the stochastic discount factor of an insurer, respectively. Assume that {(Xi, Yi), i≥1) form a sequence of independent and identically distributed random vectors following a common bivariate Sarmanov distribution. In the presence of heavy-tailed net insurance losses, an asymptotic formula is derived for the finite-time ruin probability.展开更多
Based on the previous bird and ecological environment survey and bird strike statistics of Chengdu Shuangliu International Airport, the author selected Long-tailed Shrike, one of the bird species causing the accident,...Based on the previous bird and ecological environment survey and bird strike statistics of Chengdu Shuangliu International Airport, the author selected Long-tailed Shrike, one of the bird species causing the accident, for stomach content analysis. The results showed that the feeding objects of Long-tailed Shrike were different in spring and summer, which was directly related to the seasonal food sources. However, they have something in common. The species that Long-tailed Shrike eats directly or indirectly in spring and summer are Carabidae, Columbidae, Muridae, Acridoidea, Cruciferae, Leguminosae, Moraceae, Gramineae, etc. These animal and plant species are widely distributed in the airport. We speculate that the plants and animals in the airport, as the food source of Long-tailed Shrike, to some extent attracted Long-tailed Shrike to come for food and activities. Therefore, we suggest to cut off the food source of brown-backed shrimps in the airport, such as turf management in the flight area, prevention and control of insects, soil animals and rodents, etc., so as to reduce the number of Long-tailed Shrike in the flight area and reduce its impact on flight safety.展开更多
The Waring distribution is an important two-parameter discrete distribution,commonly used in fields such as ecology,linguistics,and information science,where heavy tails are often observed.In this paper,we propose a n...The Waring distribution is an important two-parameter discrete distribution,commonly used in fields such as ecology,linguistics,and information science,where heavy tails are often observed.In this paper,we propose a new goodness-of-fit test for the Waring distribution,which is established through the hazard rate and a linear equivalent definition of the Waring distribution.We establish an asymptotic Chi-square null distribution for the proposed test and show that it is more powerful than classical methods in simulation studies.Finally,we apply the test to analyze the authorships of published papers on computer science.展开更多
Recently advancements in deep learning models have significantly facilitated the development of sequential recommender systems(SRS).However,the current deep model structures are limited in their ability to learn high-...Recently advancements in deep learning models have significantly facilitated the development of sequential recommender systems(SRS).However,the current deep model structures are limited in their ability to learn high-quality embeddings with insufficient data.Meanwhile,highly skewed long-tail distribution is very common in recommender systems.Therefore,in this paper,we focus on enhancing the representation of tail items to improve sequential recommendation performance.Through empirical studies on benchmarks,we surprisingly observe that both the ranking performance and training procedure are greatly hindered by the poorly optimized tail item embeddings.To address this issue,we propose a sequential recommendation framework named TailRec that enables contextual information of tail item well-leveraged and greatly improves its corresponding representation.Given the characteristics of the sequential recommendation task,the surrounding interaction records of each tail item are regarded as contextual information without leveraging any additional side information.This approach allows for the mining of contextual information from cross-sequence behaviors to boost the performance of sequential recommendations.Such a light contextual filtering component is plug-and-play for a series of SRS models.To verify the effectiveness of the proposed TailRec,we conduct extensive experiments over several popular benchmark recommenders.The experimental results demonstrate that TailRec can greatly improve the recommendation results and speed up the training process.The codes of our methods have been available.展开更多
The scatter-hoarding behavior of granivorous rodents plays an important role in seed dispersal and seedling regeneration of trees,as well as the evolution of several well-known mutualisms between trees and roden...The scatter-hoarding behavior of granivorous rodents plays an important role in seed dispersal and seedling regeneration of trees,as well as the evolution of several well-known mutualisms between trees and rodents in forest ecosystems.Because it is difficult to identify seed hoarders and pilferers under field conditions by traditional methods,the full costs incurred and benefits accrued by scatter-hoarding have not been fully evaluated in most systems.By using infrared radiation camera tracking and seed tagging,we investigated the benefits and losses of scatter-hoarded seeds(Camellia oleifera)for 3 sympatric rodent species(Apodemus draco,Niviventer confucianus and Leopoldamys edwardsi)in a subtropical forest of Southwest China during 2013 to 2015.We established the relationships between the rodents and the seeds at the individual level.For each rodent species,we calculated the cache recovery rate of cache owners,as well as conspecific and interspecific pilferage rates.We found that all 3 sympatric rodent species had a cache recovery advantage with rates that far exceeded average pilferage rates over a 30-day tracking period.The smallest species(A.draco)showed the highest rate of scatter-hoarding and the highest recovery advantage compared with the other 2 larger species(N.confucianus and L.edwardsi).Our results suggest that scatter-hoarding benefits cache owners in food competition,supporting the pilferage avoidance hypothesis.Therefore,scatter-hoarding behavior should be favored by natural selection,and plays a significant role in species coexistence of rodent community and in the formation of mutualism between seeds and rodents in forest ecosystems.展开更多
In this paper, we obtain sufficient and necessary conditions for local asymptotics for the maximum of a Markov modulated random walk with long-tailed increments and negative drifts, where the local asymptotics means a...In this paper, we obtain sufficient and necessary conditions for local asymptotics for the maximum of a Markov modulated random walk with long-tailed increments and negative drifts, where the local asymptotics means asymptotic behaviour of P( ∈ (x,x + z]) for each z 〉 0, as x→∞ Our results extend and improve the existing ones in the literature.展开更多
The genetic diversity and the spatial structure of a species are likely consequences of both past and recent evolutionary processes,but relevant studies are still rare in East Asia where the Pleistocene climate has un...The genetic diversity and the spatial structure of a species are likely consequences of both past and recent evolutionary processes,but relevant studies are still rare in East Asia where the Pleistocene climate has unique influences.In this study,we examined the impact of past climate change and recent anthropogenic activities on the genetic structure and population size of the greater long-tailed hamster(Tscherskia triton),an agricultural rodent pest species in northern China.DNA sequence data of 2 mitochondrial genes and genotypic data of 11 microsatellite DNA loci from 41 populations(545 individuals)were gathered.Phylogenetic and population genetic analyses,as well as species distribution modeling and coalescent simulations,were conducted to infer its historical and demographic patterns and processes.Two deeply diverged mitochondrial clades were recovered.A small one was restricted to the Shandong Peninsula while the main clade was further divided into 3 geographic clusters by their microsatellite DNA genotypes:Northwest,North-center and Northeast.Divergence dating indicated a Middle-to-Late Pleistocene divergence between the 2 clades.Demographic analysis indicated that all 3 and pooled populations showed consistent long-period expansions during last glacial period;but not during the Holocene,probably due to the impact of climate warming and human disturbances.Conflicting patterns between mtDNA and microsatellite markers imply an anthropogenic impact on North-center populations due to intensified agricultural cultivation in this region.Our study demonstrated that the impact of past glaciation on organisms in East Asia significantly differs from that of Europe and North America,and human activity is an important factor in determining the genetic diversity of a species,as well as its spatial structure.展开更多
Vehicle Color Recognition(VCR)plays a vital role in intelligent traffic management and criminal investigation assistance.However,the existing vehicle color datasets only cover 13 classes,which can not meet the current...Vehicle Color Recognition(VCR)plays a vital role in intelligent traffic management and criminal investigation assistance.However,the existing vehicle color datasets only cover 13 classes,which can not meet the current actual demand.Besides,although lots of efforts are devoted to VCR,they suffer from the problem of class imbalance in datasets.To address these challenges,in this paper,we propose a novel VCR method based on Smooth Modulation Neural Network with Multi-Scale Feature Fusion(SMNN-MSFF).Specifically,to construct the benchmark of model training and evaluation,we first present a new VCR dataset with 24 vehicle classes,Vehicle Color-24,consisting of 10091 vehicle images from a 100-hour urban road surveillance video.Then,to tackle the problem of long-tail distribution and improve the recognition performance,we propose the SMNN-MSFF model with multiscale feature fusion and smooth modulation.The former aims to extract feature information from local to global,and the latter could increase the loss of the images of tail class instances for training with class-imbalance.Finally,comprehensive experimental evaluation on Vehicle Color-24 and previously three representative datasets demonstrate that our proposed SMNN-MSFF outperformed state-of-the-art VCR methods.And extensive ablation studies also demonstrate that each module of our method is effective,especially,the smooth modulation efficiently help feature learning of the minority or tail classes.Vehicle Color-24 and the code of SMNN-MSFF are publicly available and can contact the author to obtain.展开更多
基金Foundation item: This study was supported by the National Natural Science Foundation of China (31472011)ACKNOWLEDGEMENTS We are grateful to Peng ZHANG, Zheng CHEN, Jia-Hui WANG, and Hui-Jia YUAN of Beijing Normal University for field assistance, and staff from Henan Dongzhai National Nature Reserve for help during field work. We also thank editor for revising the English, and the two reviewers for their constructive comments, which have helped to improve the manuscript.
文摘Brooding is a major breeding investment of parental birds during the early nestling stage, and has important effects on the development and survival of nestlings. Investigating brooding behavior can help to understand avian breeding investment strategies. From January to June in 2013 and 2014, we studied the brooding behaviors of long-tailed tits (Aegithalos caudatus glaucogularis) in Dongzhai National Nature Reserve, Henan Province, China. We analyzed the relationships between parental diurnal brooding duration and nestling age, brood size, temperature, relative breeding season, time of day and nestling frequencies during brooding duration. Results showed that female and male long-tailed tit parents had different breeding investment strategies during the early nestling stage. Female parents bore most of the brooding investment, while male parents performed most of the nestling feedings. In addition, helpers were not found to brood nestlings at the two cooperative breeding nests. Parental brooding duration was significantly associated with the food delivered to nestlings (F=86.10, dr=l, 193.94, P〈0.001), and was longer when the nestlings received more food. We found that parental brooding duration declined significantly as nestlings aged (F=5.99, dr=-1, 50.13, P=0.018). When nestlings were six days old, daytime parental brooding almost ceased, implying that long- tailed tit nestlings might be able to maintain their own body temperature by this age. In addition, brooding duration was affected by both brood size (F=12.74, dr=-1,32.08, P=0.001) and temperature (F=5.83, df=-l, 39.59, P=-0.021), with it being shorter in larger broods and when ambient temperature was higher.
基金financially supported by the National Basic Research Program of China(973 Project,Grant No.2006CB701405)the China Geological Survey,and the National Natural Science Foundation of China(Grant No.40272008).
文摘Jinfengopteryx is a newly uncovered Archaeopteryx-like avialan bird outside Germany, which was found from the Jehol Biota of northern Hebei in northeastern China. It shares many characters only with Archaeopteryx by the possession of three fenestrae in the antorbital cavity, 23 caudal vertebrae and long tail feathers attached to all the caudal vertebrae. But the former differs from the latter in the relatively short and high preorbital region of skull, more and closely packed teeth, much shorter forelimb compared to hindlimb. Such differences indicate Jinfengopteryx is even slightly more primitive than Archaeopteryx, although both birds can be placed at the root position of the avialan tree based on cladistic analysis. Shenzhouraptor is suggested to be slightly more advanced than Jinfengopteryx + Archaeopteryx, supported by some derived features in teeth, shoulder girdles and forelimbs such as the reduction of tooth number, dorsolaterally directed glenoid facet, very long forelimb and comparatively short manus. Meanwhile, the tail of Shenzhouraptor shows more primitive characters than those of Jinfengopteryx and Archaeopteryx, e.g., the strikingly longer tail composed of more caudal vertebrae and the long tail feathers attached only to distal caudal segments. The mixed primitive and advanced characters reveal the evident mosaic evolution among long-tailed avialan birds.
文摘With the rapid increase of large-scale problems, the distribution of real-world datasets tends to be long-tailed. Existing solutions typically involve re-balancing strategies (i.e., re-sampling and re-weighting). Although they can significantly promote the classifier learning of deep networks, they will unexpectedly impair the representative ability of the learned deep features to a certain extent. Therefore, this paper proposes a dual-channel learning algorithm with involution neural networks (DC-Invo) to take care of representation learning and classifier learning concurrently. In this work, the most important thing is to combine ResNet and involution to obtain higher classification accuracy because of involution’s wider coverage in the spatial dimension. The paper conducted extensive experiments on several benchmark vision tasks including Cifar-LT, Imagenet-LT, and Places-LT, showing that DC-Invo is able to achieve significant performance gained on long-tailed datasets.
基金the National Natural Science Foundation of China(No.62076035)。
文摘Leveraging deep learning-based techniques to classify diseases has attracted extensive research interest in recent years.Nevertheless,most of the current studies only consider single-modal medical images,and the number of ophthalmic diseases that can be classified is relatively small.Moreover,imbalanced data distribution of different ophthalmic diseases is not taken into consideration,which limits the application of deep learning techniques in realistic clinical scenes.In this paper,we propose a Multimodal Multi-disease Long-tailed Classification Network(M^(2)LC-Net)in response to the challenges mentioned above.M^(2)LC-Net leverages ResNet18-CBAM to extract features from fundus images and Optical Coherence Tomography(OCT)images,respectively,and conduct feature fusion to classify 11 common ophthalmic diseases.Moreover,Class Activation Mapping(CAM)is employed to visualize each mode to improve interpretability of M^(2)LC-Net.We conduct comprehensive experiments on realistic dataset collected from a Grade III Level A ophthalmology hospital in China,including 34,396 images of 11 disease labels.Experimental results demonstrate effectiveness of our proposed model M^(2)LC-Net.Compared with the stateof-the-art,various performance metrics have been improved significantly.Specifically,Cohen’s kappa coefficient κ has been improved by 3.21%,which is a remarkable improvement.
基金supported by the National Key Research and Development Program of China under Grant No.2021YFA1003003the National Natural Science Foundation of China under Grant Nos.61936002 and T2225012.
文摘The long-tailed data distribution poses an enormous challenge for training neural networks in classification.A classification network can be decoupled into a feature extractor and a classifier.This paper takes a semi-discrete optimal transport(OT)perspective to analyze the long-tailed classification problem,where the feature space is viewed as a continuous source domain,and the classifier weights are viewed as a discrete target domain.The classifier is indeed to find a cell decomposition of the feature space with each cell corresponding to one class.An imbalanced training set causes the more frequent classes to have larger volume cells,which means that the classifier's decision boundary is biased towards less frequent classes,resulting in reduced classification performance in the inference phase.Therefore,we propose a novel OTdynamic softmax loss,which dynamically adjusts the decision boundary in the training phase to avoid overfitting in the tail classes.In addition,our method incorporates the supervised contrastive loss so that the feature space can satisfy the uniform distribution condition.Extensive and comprehensive experiments demonstrate that our method achieves state-ofthe-art performance on multiple long-tailed recognition benchmarks,including CIFAR-LT,ImageNet-LT,iNaturalist 2018,and Places-LT.
基金supported by the National Natural Science Foundation of China(Nos.62276263,62006225 and 62071468)the Strategic Priority Research Program of Chinese Academy of Sciences(CAS),China(No.XDA27040700)the National Key Research and Development Program of China(No.2022YFC3310400).
文摘Skeleton-based action recognition has recently made significant progress.However,data imbalance is still a great challenge in real-world scenarios.The performance of current action recognition algorithms declines sharply when training data suffers from heavy class imbalance.The imbalanced data actually degrades the representations learned by these methods and becomes the bottleneck for action recognition.How to learn unbiased representations from imbalanced action data is the key to long-tailed action recognition.In this paper,we propose a novel balanced representation learning method to address the long-tailed problem in action recognition.Firstly,a spatial-temporal action exploration strategy is presented to expand the sample space effectively,generating more valuable samples in a rebalanced manner.Secondly,we design a detached action-aware learning schedule to further mitigate the bias in the representation space.The schedule detaches the representation learning of tail classes from training and proposes an action-aware loss to impose more effective constraints.Additionally,a skip-type representation is proposed to provide complementary structural information.The proposed method is validated on four skeleton datasets,NTU RGB+D 60,NTU RGB+D 120,NW-UCLA and Kinetics.It not only achieves consistently large improvement compared to the state-of-the-art(SOTA)methods,but also demonstrates a superior generalization capacity through extensive experiments.Our code is available at https://github.com/firework8/BRL.
基金supported by the National Natural Science Foundation of China(No.61702321)。
文摘Federated learning(FL),a cutting-edge distributed machine learning training paradigm,aims to generate a global model by collaborating on the training of client models without revealing local private data.The co-occurrence of non-independent and identically distributed(non-IID)and long-tailed distribution in FL is one challenge that substantially degrades aggregate performance.In this paper,we present a corresponding solution called federated dual-decoupling via model and logit calibration(FedDDC)for non-IID and long-tailed distributions.The model is characterized by three aspects.First,we decouple the global model into the feature extractor and the classifier to fine-tune the components affected by the joint problem.For the biased feature extractor,we propose a client confidence re-weighting scheme to assist calibration,which assigns optimal weights to each client.For the biased classifier,we apply the classifier re-balancing method for fine-tuning.Then,we calibrate and integrate the client confidence re-weighted logits with the re-balanced logits to obtain the unbiased logits.Finally,we use decoupled knowledge distillation for the first time in the joint problem to enhance the accuracy of the global model by extracting the knowledge of the unbiased model.Numerous experiments demonstrate that on non-IID and long-tailed data in FL,our approach outperforms state-of-the-art methods.
基金supported in part by the National Natural Science Foundation of China(62006141)the National Key R&D Program of China(2021YFC3300203)+1 种基金the Overseas Innovation Team Project of the“20 Regulations for New Universities”Funding Program of Jinan(2021GXRC073)the Excellent Youth Scholars Program of Shandong Province(2022HWYQ-048).
文摘The prevalence of long-tailed distributions in real-world data often results in classification models favoring the dominant classes,neglecting the less frequent ones.Current approaches address the issues in long-tailed image classification by rebalancing data,optimizing weights,and augmenting information.However,these methods often struggle to balance the performance between dominant and minority classes because of inadequate representation learning of the latter.To address these problems,we introduce descriptional words into images as cross-modal privileged information and propose a cross-modal enhanced method for long-tailed image classification,referred to as CMLTNet.CMLTNet improves the learning of intraclass similarity of tail-class representations by cross-modal alignment and captures the difference between the head and tail classes in semantic space by cross-modal inference.After fusing the above information,CMLTNet achieved an overall performance that was better than those of benchmark long-tailed and cross-modal learning methods on the long-tailed cross-modal datasets,NUS-WIDE and VireoFood-172.The effectiveness of the proposed modules was further studied through ablation experiments.In a case study of feature distribution,the proposed model was better in learning representations of tail classes,and in the experiments on model attention,CMLTNet has the potential to help learn some rare concepts in the tail class through mapping to the semantic space.
基金National Key Research and Development Program of China,Grant/Award Numbers:2022YFB3103900,2023YFB3106504Major Key Project of PCL,Grant/Award Numbers:PCL2022A03,PCL2023A09+5 种基金Shenzhen Basic Research,Grant/Award Number:JCYJ20220531095214031Guangdong Provincial Key Laboratory of Novel Security Intelligence Technologies,Grant/Award Number:2022B1212010005Shenzhen International Science and Technology Cooperation Project,Grant/Award Number:GJHZ20220913143008015Natural Science Foundation of Guangdong Province,Grant/Award Number:2023A1515011959Shenzhen-Hong Kong Jointly Funded Project,Grant/Award Number:SGDX20230116091246007Shenzhen Science and Technology Program,Grant/Award Numbers:RCBS20221008093131089,ZDSYS20210623091809029。
文摘Real-world data always exhibit an imbalanced and long-tailed distribution,which leads to poor performance for neural network-based classification.Existing methods mainly tackle this problem by reweighting the loss function or rebalancing the classifier.However,one crucial aspect overlooked by previous research studies is the imbalanced feature space problem caused by the imbalanced angle distribution.In this paper,the authors shed light on the significance of the angle distribution in achieving a balanced feature space,which is essential for improving model performance under long-tailed distributions.Nevertheless,it is challenging to effectively balance both the classifier norms and angle distribution due to problems such as the low feature norm.To tackle these challenges,the authors first thoroughly analyse the classifier and feature space by decoupling the classification logits into three key components:classifier norm(i.e.the magnitude of the classifier vector),feature norm(i.e.the magnitude of the feature vector),and cosine similarity between the classifier vector and feature vector.In this way,the authors analyse the change of each component in the training process and reveal three critical problems that should be solved,that is,the imbalanced angle distribution,the lack of feature discrimination,and the low feature norm.Drawing from this analysis,the authors propose a novel loss function that incorporates hyperspherical uniformity,additive angular margin,and feature norm regularisation.Each component of the loss function addresses a specific problem and synergistically contributes to achieving a balanced classifier and feature space.The authors conduct extensive experiments on three popular benchmark datasets including CIFAR-10/100-LT,ImageNet-LT,and iNaturalist 2018.The experimental results demonstrate that the authors’loss function outperforms several previous state-of-the-art methods in addressing the challenges posed by imbalanced and longtailed datasets,that is,by improving upon the best-performing baselines on CIFAR-100-LT by 1.34,1.41,1.41 and 1.33,respectively.
基金supported by the National Key Research and Development Program of China (Grant No. 2021YFC1910402)。
文摘Intelligent detection and classification of kitchen waste can promote ecological sustainability by replacing inefficient manual processes.However,the presence of non-degradable waste mixed in kitchen waste often follows a long-tailed distribution,making it challenging to train convolutional neural network-based object detectors,which results in the unsatisfactory detection of tailclass waste.To address this challenge,we propose a class-instance balanced detector(CIB-Det) for intelligent detection and classification of kitchen waste.CIB-Det implements two strategies for the loss function:the class-balanced strategy(CBS)and the instance-balanced strategy(IBS).The CBS focuses more on tail classes,and the IBS concentrates on hard-to-classify instances adaptively during training.Consequently,CIB-Det comprehensively and adaptively addresses the long-tailed issue.Our experiments on a real dataset of kitchen waste images support the effectiveness of CIB-Det for kitchen waste detection.
文摘Objective To explore the semi-supervised learning(SSL) algorithm for long-tail endoscopic image classification with limited annotations.Method We explored semi-supervised long-tail endoscopic image classification in HyperKvasir,the largest gastrointestinal public dataset with 23 diverse classes.Semi-supervised learning algorithm FixMatch was applied based on consistency regularization and pseudo-labeling.After splitting the training dataset and the test dataset at a ratio of 4:1,we sampled 20%,50%,and 100% labeled training data to test the classification with limited annotations.Results The classification performance was evaluated by micro-average and macro-average evaluation metrics,with the Mathews correlation coefficient(MCC) as the overall evaluation.SSL algorithm improved the classification performance,with MCC increasing from 0.8761 to 0.8850,from 0.8983 to 0.8994,and from 0.9075 to 0.9095 with 20%,50%,and 100% ratio of labeled training data,respectively.With a 20% ratio of labeled training data,SSL improved both the micro-average and macro-average classification performance;while for the ratio of 50% and 100%,SSL improved the micro-average performance but hurt macro-average performance.Through analyzing the confusion matrix and labeling bias in each class,we found that the pseudo-based SSL algorithm exacerbated the classifier’ s preference for the head class,resulting in improved performance in the head class and degenerated performance in the tail class.Conclusion SSL can improve the classification performance for semi-supervised long-tail endoscopic image classification,especially when the labeled data is extremely limited,which may benefit the building of assisted diagnosis systems for low-volume hospitals.However,the pseudo-labeling strategy may amplify the effect of class imbalance,which hurts the classification performance for the tail class.
基金Supported by the National Natural Science Foundation of China(11001052,11171065,11326175)China Postdoctoral Science Foundation(2012M520964)+2 种基金Natural Science Foundation of Jiangsu Province ofChina(BK20131339)Postdoctoral Research Program of Jiangsu Province(1302015C)Qing Lan Project and Project of Construction for Superior Subjects of Statistics&Audit Science and Technology of Jiangsu Higher Education Institutions
文摘Consider a discrete-time insurance risk model. Within period i, i≥ 1, Xi and Yi denote the net insurance loss and the stochastic discount factor of an insurer, respectively. Assume that {(Xi, Yi), i≥1) form a sequence of independent and identically distributed random vectors following a common bivariate Sarmanov distribution. In the presence of heavy-tailed net insurance losses, an asymptotic formula is derived for the finite-time ruin probability.
文摘Based on the previous bird and ecological environment survey and bird strike statistics of Chengdu Shuangliu International Airport, the author selected Long-tailed Shrike, one of the bird species causing the accident, for stomach content analysis. The results showed that the feeding objects of Long-tailed Shrike were different in spring and summer, which was directly related to the seasonal food sources. However, they have something in common. The species that Long-tailed Shrike eats directly or indirectly in spring and summer are Carabidae, Columbidae, Muridae, Acridoidea, Cruciferae, Leguminosae, Moraceae, Gramineae, etc. These animal and plant species are widely distributed in the airport. We speculate that the plants and animals in the airport, as the food source of Long-tailed Shrike, to some extent attracted Long-tailed Shrike to come for food and activities. Therefore, we suggest to cut off the food source of brown-backed shrimps in the airport, such as turf management in the flight area, prevention and control of insects, soil animals and rodents, etc., so as to reduce the number of Long-tailed Shrike in the flight area and reduce its impact on flight safety.
基金supported by the National Natural Science Foundation of China 12371265Natural Science Foundation of Shanghai 21ZR1420700Fundamental Research Funds for the Central Universities(2022QKT001).
文摘The Waring distribution is an important two-parameter discrete distribution,commonly used in fields such as ecology,linguistics,and information science,where heavy tails are often observed.In this paper,we propose a new goodness-of-fit test for the Waring distribution,which is established through the hazard rate and a linear equivalent definition of the Waring distribution.We establish an asymptotic Chi-square null distribution for the proposed test and show that it is more powerful than classical methods in simulation studies.Finally,we apply the test to analyze the authorships of published papers on computer science.
基金the National Key R&D Program of China(No.2021YFF0901003)。
文摘Recently advancements in deep learning models have significantly facilitated the development of sequential recommender systems(SRS).However,the current deep model structures are limited in their ability to learn high-quality embeddings with insufficient data.Meanwhile,highly skewed long-tail distribution is very common in recommender systems.Therefore,in this paper,we focus on enhancing the representation of tail items to improve sequential recommendation performance.Through empirical studies on benchmarks,we surprisingly observe that both the ranking performance and training procedure are greatly hindered by the poorly optimized tail item embeddings.To address this issue,we propose a sequential recommendation framework named TailRec that enables contextual information of tail item well-leveraged and greatly improves its corresponding representation.Given the characteristics of the sequential recommendation task,the surrounding interaction records of each tail item are regarded as contextual information without leveraging any additional side information.This approach allows for the mining of contextual information from cross-sequence behaviors to boost the performance of sequential recommendations.Such a light contextual filtering component is plug-and-play for a series of SRS models.To verify the effectiveness of the proposed TailRec,we conduct extensive experiments over several popular benchmark recommenders.The experimental results demonstrate that TailRec can greatly improve the recommendation results and speed up the training process.The codes of our methods have been available.
基金supported by the key project of the Chinese Academy of Sciences(XDB11050300,KSZD-EW-TZ-008)the key project of the National Natural Science Foundation of China(31330013).
文摘The scatter-hoarding behavior of granivorous rodents plays an important role in seed dispersal and seedling regeneration of trees,as well as the evolution of several well-known mutualisms between trees and rodents in forest ecosystems.Because it is difficult to identify seed hoarders and pilferers under field conditions by traditional methods,the full costs incurred and benefits accrued by scatter-hoarding have not been fully evaluated in most systems.By using infrared radiation camera tracking and seed tagging,we investigated the benefits and losses of scatter-hoarded seeds(Camellia oleifera)for 3 sympatric rodent species(Apodemus draco,Niviventer confucianus and Leopoldamys edwardsi)in a subtropical forest of Southwest China during 2013 to 2015.We established the relationships between the rodents and the seeds at the individual level.For each rodent species,we calculated the cache recovery rate of cache owners,as well as conspecific and interspecific pilferage rates.We found that all 3 sympatric rodent species had a cache recovery advantage with rates that far exceeded average pilferage rates over a 30-day tracking period.The smallest species(A.draco)showed the highest rate of scatter-hoarding and the highest recovery advantage compared with the other 2 larger species(N.confucianus and L.edwardsi).Our results suggest that scatter-hoarding benefits cache owners in food competition,supporting the pilferage avoidance hypothesis.Therefore,scatter-hoarding behavior should be favored by natural selection,and plays a significant role in species coexistence of rodent community and in the formation of mutualism between seeds and rodents in forest ecosystems.
基金Supported by National Natural Science Foundation of China (Grant Nos. 10901164 and 10771216) and the Scientific Research Foundation for the Returned Overseas Chinese Scholars, Ministry of Education We are grateful to Professor Zhenting Hou and the referees for many valuable comments on the draft of this paper. In particular, we wish to thank a referee for pointing out a related new topic stated in Remark 3.9.
文摘In this paper, we obtain sufficient and necessary conditions for local asymptotics for the maximum of a Markov modulated random walk with long-tailed increments and negative drifts, where the local asymptotics means asymptotic behaviour of P( ∈ (x,x + z]) for each z 〉 0, as x→∞ Our results extend and improve the existing ones in the literature.
基金This research is supported by a grant from Knowledge Innovation Program of The Chinese Academy of Sciences(number KSCX3-EW-N-005)a grant from the National Basic Research Program(2007CB109101)of the Ministry of Science and Technology,China.We are grateful to Lifeng Zhu,Yibo Hu and Liling Huang for their technical advice on data analysis,to Alexei Drummond for his suggestions on Bayesian skyline plot analysis,to Fuwen Wei for valuable discussion and comments on this manuscriptand to Cam Hudson and Terry Boyd-Zhang for improving the English writing of this manuscript.
文摘The genetic diversity and the spatial structure of a species are likely consequences of both past and recent evolutionary processes,but relevant studies are still rare in East Asia where the Pleistocene climate has unique influences.In this study,we examined the impact of past climate change and recent anthropogenic activities on the genetic structure and population size of the greater long-tailed hamster(Tscherskia triton),an agricultural rodent pest species in northern China.DNA sequence data of 2 mitochondrial genes and genotypic data of 11 microsatellite DNA loci from 41 populations(545 individuals)were gathered.Phylogenetic and population genetic analyses,as well as species distribution modeling and coalescent simulations,were conducted to infer its historical and demographic patterns and processes.Two deeply diverged mitochondrial clades were recovered.A small one was restricted to the Shandong Peninsula while the main clade was further divided into 3 geographic clusters by their microsatellite DNA genotypes:Northwest,North-center and Northeast.Divergence dating indicated a Middle-to-Late Pleistocene divergence between the 2 clades.Demographic analysis indicated that all 3 and pooled populations showed consistent long-period expansions during last glacial period;but not during the Holocene,probably due to the impact of climate warming and human disturbances.Conflicting patterns between mtDNA and microsatellite markers imply an anthropogenic impact on North-center populations due to intensified agricultural cultivation in this region.Our study demonstrated that the impact of past glaciation on organisms in East Asia significantly differs from that of Europe and North America,and human activity is an important factor in determining the genetic diversity of a species,as well as its spatial structure.
基金This work was supported by the National Natural Science Foundation of China(Grant No.62071378)the Shaanxi Province International Science and Technology Cooperation Program(2022KW-04)the Xi’an Science and Technology Plan Project(21XJZZ0072).
文摘Vehicle Color Recognition(VCR)plays a vital role in intelligent traffic management and criminal investigation assistance.However,the existing vehicle color datasets only cover 13 classes,which can not meet the current actual demand.Besides,although lots of efforts are devoted to VCR,they suffer from the problem of class imbalance in datasets.To address these challenges,in this paper,we propose a novel VCR method based on Smooth Modulation Neural Network with Multi-Scale Feature Fusion(SMNN-MSFF).Specifically,to construct the benchmark of model training and evaluation,we first present a new VCR dataset with 24 vehicle classes,Vehicle Color-24,consisting of 10091 vehicle images from a 100-hour urban road surveillance video.Then,to tackle the problem of long-tail distribution and improve the recognition performance,we propose the SMNN-MSFF model with multiscale feature fusion and smooth modulation.The former aims to extract feature information from local to global,and the latter could increase the loss of the images of tail class instances for training with class-imbalance.Finally,comprehensive experimental evaluation on Vehicle Color-24 and previously three representative datasets demonstrate that our proposed SMNN-MSFF outperformed state-of-the-art VCR methods.And extensive ablation studies also demonstrate that each module of our method is effective,especially,the smooth modulation efficiently help feature learning of the minority or tail classes.Vehicle Color-24 and the code of SMNN-MSFF are publicly available and can contact the author to obtain.