US12593956B2
Method for building image reading model based on capsule endoscope, device, and medium
Publication
Application
Classifications
IPC Classifications
CPC Classifications
Applicants
ANKON TECHNOLOGIES CO., LTD, ANX IP HOLDING PTE. LTD.
Inventors
Tianyi Yangdai
Abstract
The present invention provides a method for building an image reading model based on a capsule endoscope, a device and a medium. The method includes: driving the capsule endoscope to move within a working area, sequentially recording position coordinates and field of view orientations of the capsule endoscope when it reaches each positioning point at a predetermined first frequency, and driving the capsule endoscope to sequentially capture and record images at a predetermined second frequency; constructing a 3D model corresponding to the outer contour of the working area based on the recorded position coordinates of the capsule endoscope at each positioning point; and mapping the recorded images onto the 3D model to create an image reading model. By mapping the obtained images onto a 3D model of the working area, to enhance the visualization effect of the examination, facilitate observation, save image reading time, and increase the detection efficiency.
Figures
Description
CROSS-REFERENCE OF RELATED APPLICATIONS
[0001]The application claims priority from Chinese Patent Application No. 202110296737.2, filed Mar. 19, 2021, entitled “Method for Building Image Reading Model Based on Capsule Endoscope, Device, and Medium”, all of which are incorporated herein by reference in their entirety.
FIELD OF INVENTION
[0002]The present invention relates to the field of medical devices, and more particularly to a method for building an image reading model based on a capsule endoscope, an electronic device, and a readable storage medium.
BACKGROUND
[0003]Capsule endoscopes are increasingly used for gastrointestinal examinations. A capsule endoscope is ingested and passes through the oral cavity, esophagus, stomach, small intestine, large intestine, and is ultimately expelled from the body. Typically, the capsule endoscope moves passively along with gastrointestinal peristalsis, capturing images at a certain frame rate during this process. The images are then used by a physician to assess the health condition of various regions of a patient's gastrointestinal tract.
[0004]Taking an example of gastric examinations using a capsule endoscope, the position and orientation of the capsule endoscope can be adjusted under the control of an external magnetic system to conduct a more thorough examination of the stomach. The usual duration for a gastric examination is 10 to 20 minutes, resulting in approximately 2400 to 4800 images when captured at a frame rate of 4 frames per second (fps). After the examination, these image data are uploaded for medical professional reference. Usually, medical professionals review the uploaded image data in a passive manner. They can only observe the images in the order they were captured or select them using a progress bar. This approach lacks the spatial context that relates the images to the actual structures within the digestive tract. Medical professionals need to rely on their own deduction and spatial reconstruction to interpret the images. This process can introduce interference in terms of image understanding and assessing the completeness of the examination, especially for less experienced examiners who may find it challenging to review the image data displayed in this manner.
SUMMARY OF THE INVENTION
[0005]In order to solve the above technical problems in the prior art, it is an object of the present invention to provide a method for building an image reading model based on a capsule endoscope, an electronic device, and a readable storage medium.
- [0007]constructing a 3D model corresponding to the outer contour of the working area based on the recorded position coordinates of the capsule endoscope at each positioning point;
- [0008]mapping the recorded images onto the 3D model to create the image reading model.
- [0010]obtaining all position coordinates of the capsule endoscope to form a raw point cloud data;
- [0011]generating a 3D model corresponding to the outer contour of the working area after applying Gaussian filtering, voxelization, voxel shell extraction, smoothing filtering, and surface reconstruction sequentially to the raw point cloud data; where
- [0012]the 3D model is represented by Ω(p),
[0013]
- [0015]dividing the 3D model into a plurality of sub-areas according to the structure of the working area;
- [0016]mapping the recorded images to each sub-area to create a set of sub-area images, with each image mapped to a unique sub-area;
- [0017]merging the set of the sub-area images on the 3D model to form the image reading model.
- [0019]iterating through each image and obtaining the positioning point with the closest capture time to the current image;
- [0020]planning a virtual ray using the position coordinates of the obtained positioning point as a starting point and the corresponding field of view orientation as an extending direction, and obtaining an intersection point between the virtual ray and the 3D model;
- [0021]obtaining the sub-area to which the position coordinates of current intersection point belong, and mapping the current image to the sub-area to which the position coordinates of current intersection point belong to form the set of the sub-area images.
[0022]In an embodiment of the present invention, the method further comprises: setting the first frequency to be higher than the second frequency.
[0023]In an embodiment of the present invention, applying an interpolation filtering over a time sequence to supplement missing positioning points based on the existing positioning points.
- [0025]verifying the images in the each set of the sub-area images, and/or verifying the quality of the images in each set of the sub-area images; where
- [0026]if an image does not belong to its current set, and/or if an image quality score of the image is below a preset value, moving the current image to the cross-verification set.
- [0028]identifying and labeling the attribute for each image in each set of the sub-area images;
- [0029]grouping the images with similar labels in each set of the sub-area images into one group;
- [0030]generating a mapping identifier separately for each group of images with similar labels on the image reading model.
[0031]In order to achieve one of the above objects of the present invention, one embodiment of the present invention provides an electronic device, comprising a memory and a processor. The memory stores a computer program that can run on the processor, and the processor executes the program to implement the steps of the method for building an image reading model based on a capsule endoscope.
[0032]In order to achieve one of the above objects of the present invention, one embodiment of the present invention provides a computer-readable storage medium for storing a computer program. The computer program is executed by the processor to implement the steps of the method for building an image reading model based on a capsule endoscope.
[0033]The present invention has the following advantages compared with the prior art. The present invention provides the method for building the image reading model based on the capsule endoscope, the electronic device, and the readable storage medium, which can, by mapping the obtained images onto a 3D model of the working area, to enhance the visualization effect of the examination, facilitate observation, save image reading time, and increase the detection efficiency.
BRIEF DESCRIPTION OF THE DRAWINGS
[0034]
[0035]
[0036]
[0037]
[0038]
DETAILED DESCRIPTION
[0039]The present invention can be described in detail below with reference to the accompanying drawings and preferred embodiments. However, the embodiments are not intended to limit the invention, and the structural, method, or functional changes made by those skilled in the art in accordance with the embodiments are included in the scope of the present invention.
- [0041]step S1, driving the capsule endoscope to move within a working area, sequentially recording position coordinates and field of view orientations of the capsule endoscope when it reaches each positioning point at a predetermined first frequency, and driving the capsule endoscope to sequentially capture images at a predetermined second frequency and record the images;
- [0042]step S2, constructing a 3D model corresponding to the outer contour of the working area based on the recorded position coordinates of the capsule endoscope at each positioning point;
- [0043]step S3, mapping the recorded images onto the 3D model to create an image reading model.
[0044]After the capsule endoscope moves into the working area, it records each working point at a predetermined frequency, and depending on specific requirements, it records the spatial coordinate value P(x, y, z), and the field of view orientation V of each working point. The field of view orientation here refers to the orientation of the capsule endoscope, which may be Euler angles (yaw, pitch, roll) for example, or quaternions, or vector coordinates of the orientation. Based on the field of view orientation, it can determine the field of view of the capsule endoscope capturing image in the orientation V at the current coordinate point. The field of view forms a conical shape with the current coordinate point as a starting point, of which, the vector direction is {right arrow over (PV)}, that is the extension of the axis of the cone. Capturing images with the capsule endoscope, orienting its positioning coordinates, and recording the field of view orientation are all established techniques in the prior art.
[0045]For example, the present invention extends to the content of Chinese patent application 201911188050.6, entitled “Positioning system and method for swallowable device”, which describes a system and method for locating a swallowable device, such as a capsule endoscope, to obtain its position coordinates and field of view orientation, and without further elaboration here.
- [0047]after applying Gaussian filtering, voxelization, voxel shell extraction, smoothing filtering, and surface reconstruction sequentially to the raw point cloud data to generate a 3D model corresponding to the outer contour of the working area.
[0048]The 3D model is represented by Ω(p),
[0049]
[0050]In this specific example, taking the working area as the stomach space as an example, the capsule endoscope, while operating within the working area, may float in the fluid inside the stomach, remain attached to the inner wall, rotating, or flipping. This may result in obtaining a very dense point cloud, as shown in the leftmost image in
[0051]In practical applications, the 3D model can be visualized on a computer front-end display interface, and the visual angle of the 3D model can be changed using external input devices such as a mouse, touchscreen, and so on. The reconstructed 3D model only contains the surface data of the working area, that is, as shown in the following formula, the data 2 of the 3D model only contains the surface data of the model, that is the surface data of the working area.
[0052]
[0053]In practical applications, there are various methods to implement the step S2, which means there are a plurality of approaches to process known point cloud data and generate 3D surface models, and without further elaboration here.
- [0055]step S31, dividing the 3D model into a plurality of sub-areas according to the structure of the working area;
- [0056]step S32, mapping the recorded images to each sub-area to create a set of sub-area images, with each image mapped to a unique sub-area;
- [0057]step S33, merging the set of the sub-area images on the 3D model to form the image reading model.
[0058]In the computer front-end display interface, when assisting in selecting a sub-area of the image reading model, it is associated to open the set of the corresponding sub-area images, and selectively open any one of the images corresponding to the current set of the sub-area images.
[0059]For step S31, the quantity of the sub-areas may be specifically set as required. In a preferred embodiment of the present invention, since the environment in which the capsule endoscope operates is typically a cavity formed by anatomical structures, it is possible to divide the sub-areas based on the specific classification of anatomical structures. This means that anatomical structures with similar attributes can be grouped together into one sub-area, or anatomical structures with similar attributes can be classified into multiple sub-areas. Preferably, when dividing the sub-areas, it is best to have one sub-area correspond to only one anatomical structure. This allows for a more specific classification, making it easier for the subsequent application of the image reading model.
[0060]Referring to
[0061]Preferably, for step S32, comprising: step S321, matching a time sequence, iterating through each image and obtaining the positioning point with the closest capture time to the current image; step S322, planning a virtual ray using the position coordinates of the obtained positioning point as a starting point and the corresponding field of view orientation as an extending direction, and obtaining an intersection point between the virtual ray and the 3D model; step S323, obtaining the sub-area to which the position coordinates of current intersection point belong, and mapping the current image to the sub-area to which the position coordinates of current intersection point belong to form a set of the sub-area images.
[0062]Preferably, for step S321, during the locating process of the position of the capsule endoscope, the number of positioning points may decrease due to environmental factors such as signal interference and motion disturbances. Thus, setting the first frequency higher than the second frequency, that is, the positioning frequency is set to be higher than the image capture frequency. For example, in a specific example of the present invention, the first frequency is set to 40 to 100 Hz, and the second frequency is set to be lower than 30 fps. As a result, it is still possible to obtain more densely spaced positioning results than image capture, thus achieving the effect of matching each image to positioning points with closer acquisition time.
[0063]Preferably, as illustrated in
[0064]Continuing with the example shown in
[0065]Further, in accordance with step S323, based on the area division of the 3D model, the sub-area to which the intersection point Q belongs can be obtained. In this example, point Q belongs to the sub-area 11. At this point, the image corresponding to point P is mapped to the sub-area 11, forming one of the images in the set of the sub-area images for the sub-area 11.
[0066]Preferably, between step S32 and step S33, the method further comprises: constructing a cross-verification set; verifying the images in the each set of the sub-area images, and/or verifying the quality of the images in each set of the sub-area images; where if an image does not belong to its current set, and/or if the image quality score is below a preset value, moving the current image to the cross-verification set.
[0067]Here, the presence of various errors, such as improper boundary settings for sub-areas, may lead to errors in image attribution set. Furthermore, low-quality images can have a detrimental impact when called upon later. Therefore, cross-verification of the images between step S32 and step S33 is performed to remove any poor-quality data from the various sets of the sub-area images. In the examples of the present invention, there are various methods for image verification. The present invention references to the Chinese patent application CN106934799A, entitled “Auxiliary image reading system and method for capsule endoscope” in its entirety to verify the images in each set of the sub-area images and determine whether they belong to the current set. In addition, the present invention references to the Chinese patent application CN111932532A, entitled “Referenceless image evaluation method for capsule endoscope, electronic device, and medium” in its entirety to verify the quality of the images in each set of the sub-area images and determine whether the image quality is appropriate. The images with inadequate image quality or low-quality scores are transferred to a cross-verification set. The scoring in the present invention may be an image quality evaluation score, and/or an image content evaluation score, and/or a composite score, as mentioned in the cited patent. Further details are not provided here.
[0068]It should be noted that, errors may also be present in the process of verifying the images in the each set of the sub-area images, and/or verifying the quality of the images in each set of the sub-area images. Therefore, the cross-verification set is retained, and the data in this cross-verification set can also be selectively accessed or reclassified later, without further elaboration here.
[0069]For step S33, the formed image reading model is visualized in the computer front-end display interface. When assisting in selecting a sub-area of the image reading model, it is associated to open the corresponding set of the sub-area images, and selectively open any one of the images corresponding to the current set of the sub-area images.
[0070]Preferably, after assigning all images to their respective sets of the sub-area images, the method further comprises: identifying and labeling the attribute for each image in each set of the sub-area images; grouping the images with similar labels in each set of the sub-area images into one group; generating a mapping identifier separately for each group of images with similar labels on the image reading model.
[0071]Here, as illustrated in
[0072]Further, one embodiment of the present invention provides an electronic device, comprising a memory and a processor. The memory stores a computer program that can run on the processor, and the processor executes the program to implement the steps of the method for building the image reading model based on the capsule endoscope.
[0073]Further, one embodiment of the present invention provides a computer-readable storage medium for storing a computer program. The computer program is executed by the processor to implement the steps of the method for building the image reading model based on the capsule endoscope.
[0074]In summary, the present invention provides the method for building the image reading model based on the capsule endoscope, the electronic device, and the readable storage medium, which can, by mapping the obtained images onto a 3D model of the working area, to enhance the visualization effect of the examination, and though various types of mapping, during subsequent use, it becomes convenient to obtain the required images from the simulated image reading model, which enhances interactivity and operability, and facilitates observation, saves images reading time, and improves detection efficiency.
[0075]It should be understood that, although the description is described in terms of embodiments, not every embodiment merely comprises an independent technical solution. Those skilled in the art should have the description as a whole, and the technical solutions in each embodiment may also be combined as appropriate to form other embodiments that can be understood by those skilled in the art.
[0076]The series of detailed descriptions set forth above are only specific descriptions of feasible embodiments of the present invention and are not intended to limit the scope of protection of the present invention. On the contrary, many modifications and variations are possible within the scope of the appended claims.
Claims
The invention claimed is:
1. A method for building an image reading model based on a capsule endoscope, comprising:
driving the capsule endoscope to move within a working area, sequentially recording position coordinates and field of view orientations of the capsule endoscope at each positioning point at a predetermined first frequency, and driving the capsule endoscope to sequentially capture images at a predetermined second frequency and record the images;
constructing a 3D model corresponding to an outline of the working area based on the recorded position coordinates of the capsule endoscope at each location point;
mapping the recorded images onto the 3D model to create the image reading model;
wherein constructing a 3D model corresponding to the outer contour of the working area based on the recorded position coordinates of the capsule endoscope at each positioning point comprises:
obtaining all position coordinates of the capsule endoscope to form a raw point cloud data;
generating a 3D model corresponding to the outer contour of the working area after applying Gaussian filtering, voxelization, voxel shell extraction, smoothing filtering, and surface reconstruction sequentially to the raw point cloud data;
Wherein the 3D model is represented by Ω(p).
2. The method of
dividing the 3D model into a plurality of sub-areas according to the structure of the working area;
mapping the recorded images to each sub-area to create a set of sub-area images, with each image mapped to a unique sub-area;
merging the set of the sub-area images on the 3D model to form the image reading model.
3. The method of
iterating through each image and obtaining the positioning point with a closest capture time to a current image;
planning a virtual ray using the position coordinates of the obtained positioning point as a starting point and a corresponding field of view orientation as an extending direction, and obtaining an intersection point between the virtual ray and the 3D model;
obtaining the sub-area to which the position coordinates of current intersection point belong, and mapping the current image to the sub-area to which the position coordinates of current intersection point belong to form the set of the sub-area images.
4. The method of
5. The method of
6. The method of
constructing a cross-verification set;
verifying the images in the set of the each sub-area images, and/or verifying quality of the images in each set of the sub-area images;
if an image does not belong to a current set, and/or if an image quality score of the image is below a preset value, moving the current image to the cross-verification set.
7. The method of
identifying and labeling an attribute for each image in each set of the sub-area images;
grouping the images with similar labels in each set of the sub-area images into one group;
generating a mapping identifier separately for each group of images with similar labels on the image reading model.
8. An electronic device, comprising
a memory and a processor, wherein
the memory stores a computer program that runs on the processor, and
the processor executes
for building an image reading model based on a capsule endoscope, wherein the method comprises:
driving the capsule endoscope to move within a working area, sequentially recording position coordinates and field of view orientations of the capsule endoscope at each positioning point at a predetermined first frequency, and driving the capsule endoscope to sequentially capture images at a predetermined second frequency and record the images;
constructing a 3D model corresponding to an outline of the working area based on the recorded position coordinates of the capsule endoscope at each location point;
mapping the recorded images onto the 3D model to create the image reading model;
wherein constructing a 3D model corresponding to the outer contour of the working area based on the recorded position coordinates of the capsule endoscope at each positioning point comprises:
obtaining all position coordinates of the capsule endoscope to form a raw point cloud data;
generating a 3D model corresponding to the outer contour of the working area after applying Gaussian filtering, voxelization, voxel shell extraction, smoothing filtering, and surface reconstruction sequentially to the raw point cloud data;
Wherein the 3D model is represented by Ω(p).