Rozdziały
Stały URI dla kolekcjihttp://hdl.handle.net/11652/4775
Przeglądaj
7 wyniki
collection.search.results.head
Pozycja Mixing Synthetic and Real-world Datasets Strategy for Improved Generalization of the CNN(Wydawnictwo Politechniki Łódzkiej, 2023) Młodzikowski, Kamil; Belter, DominikIn this paper, we deal with the problem of supervised training neural networks with an insufficient number of real-world training examples. We propose a method that at the beginning trains the neural network using a relatively simple synthetic dataset. In the following epochs, we add more challenging and real-life images to the training dataset. We compare the proposed strategy with other methods of using artificial and real-world datasets for training the neural network. The obtained results show that the proposed strategy allows for obtaining the neural network with higher generalization capabilities than competitive methods.Pozycja Improving RGB-D Visual Odometry with Depth Learned from a Better Sensor’s Output(Wydawnictwo Politechniki Łódzkiej, 2023) Kostusiak, AleksanderThis paper compares the results obtained from an indoor Visual Odometry (VO) system with RGB-D images provided by a Kinect v1 camera against those achieved by a VO with enhanced depth channel. For this purpose, we have used two classic image inpainting methods and a deeplearning approach for scene depth estimation employing Kinect v2 depth maps as reference data. The ability to enhance lower-quality data is crucial to reduce the cost of VO applications because higher-quality information can be infused through deep learning in systems using budget sensors.Pozycja A New Approach to Learning of 3D Characteristic Points for Vehicle Pose Estimation(Wydawnictwo Politechniki Łódzkiej, 2023) Nowak, Tomasz; Skrzypczyński, PiotrThis article discusses the challenges of estimating the pose of a vehicle from monocular images in an uncontrolled environment. We propose a new neural network architecture that learns 3D characteristic points of vehicles from image crops and coordinates of 2D keypoints on images. To facilitate supervised training of this network, we pre-process the ApolloCar3D dataset to obtain labelled 3D characteristic points of different car models. We evaluate our approach on the ApolloCar3D benchmark and demonstrate results competitive to state-of-the-art methods.Pozycja Towards Ontology-Driven Verification of Car Claims Settlement(Wydawnictwo Politechniki Łódzkiej, 2023) Pancerz, Krzysztof; Wolski, JacekIn the paper, we outline an intelligent tool enabling the users to automatize the process of verification of the car claims settlement. Two data sources power the tool. The first one is the source of car images in which damaged elements are recognized. The second one is the source of PDF files in which cost estimates are extracted. The designed ontology of car repair, described in the paper, is used both in the pre-processing step and in recognition of a typical situations.Pozycja Text-to-music Models and Their Evaluation Methods(Wydawnictwo Politechniki Łódzkiej, 2023) Modrzejewski, Mateusz; Rokita, PrzemysławText-to-music models are a very recent approach to generative music, allowing to generate music based on an abstract, rich description input in natural language. In this paper, we propose guidelines for evaluation in text-to-music models, highlighting the need for musical insight and clear descriptions of perceptual quality upon investigating the metrics of currently developed approaches. We also present a critical analysis of the capabilities and evaluation methods of the pioneering text-to-music models.Pozycja Spotting Advertisements from Above: Billboard Detection and Segmentation in UAV Imagery(Wydawnictwo Politechniki Łódzkiej, 2023) Ptak, Bartosz; Dominiak, Jan; Kraft, MarekIn this work, deep-learning methods were researched for billboard detection in urban environments. Billboards are one of the adversarial visual pollutants occurring in cities, causing over-saturation of visual stimulation. Due to this, we develop an algorithm that helps in the analysis and management of urban space. We utilise near real-time object detection methods to detect and segment them on images registered by unmanned aerial vehicles (UAVs). Research is based on recent algorithms from the YOLO family with modified heads for the instance segmentation task. We gathered images and prepared hand-annotated labels for training and evaluation purposes of deep learning approaches. We reached the mAP@0.5 metric of 0.61 for detection and 0.60 for segmentation, enabling us to develop smart city applications.Pozycja Fault Diagnosis in a Squirrel-Cage Induction Motor Using Thermal Imaging(Wydawnictwo Politechniki Łódzkiej, 2023) Piechocki, Mateusz; Kraft, Marek; Pajchrowski, TomaszFault diagnosis is a vivid topic in industrial applications or intelligent building solutions. One of the well-established techniques involves the measurement and analysis of current signals. However, this method has several significant drawbacks, such as the inability to inspect during machinery operation or the lack of precise information on the malfunction location. This article proposes a non-invasive method for squirrel-cage induction motor’s state classification and fault diagnosis. The approach is based on thermal image analysis that utilizes a compact convolution neural network. In addition, the gathered and annotated image set, which consists of thermal images with 640 x 512 pixels resolution, is presented.