功德是什么意思| 店铺开业送什么礼物好| 女性肾虚是什么原因导致的| 桃胶有什么作用| 忽必烈和成吉思汗是什么关系| 度化是什么意思| 尿潴留是什么意思| 荣辱与共是什么生肖| 不速之客的速是什么意思| diy是什么| 胸部周围痒是什么原因| 右膝关节退行性变是什么意思| 上天眷顾是什么意思| 内窥镜是做什么检查| 外阴白斑有什么症状| 四叶草代表什么| 肠瘘是什么意思| 什么是刷酸| 脑白质缺血性改变什么意思| 骨折吃什么钙片| 肺阴虚吃什么中成药| 素金是什么意思| 手抖是什么原因| 泡茶用什么杯子最好| 胰岛素过高会导致什么| 不想长胡子有什么办法| 胆结石是什么原因造成的| 胸径是什么意思| 董五行属什么| 感冒喉咙痛吃什么药好| 名落孙山的意思是什么| 鹅口疮是什么样的图片| 什么的船只| 水痘要注意什么| 小腿肌肉抽筋是什么原因引起的| 二胎政策什么时候开放的| 性生活是什么意思| 区域经理的岗位职责是什么| 属马的跟什么属相最配| 吃什么能降铁蛋白| 孢子是什么东西| 36是什么罩杯| 什么是黑天鹅事件| 往来账是什么意思| 脸色暗沉发黑什么原因| 梅子什么时候成熟| 怀孕一个月有点见红是什么情况| 吃什么能治疗早射| 孕期血糖高可以吃什么水果| 胸闷气短挂什么科室| 为什么家里有蟑螂| 人活着有什么意思| 69年鸡是什么命| 结核病是什么病| 花生为什么叫落花生| 尿路感染去医院挂什么科| 仪轨是什么意思| 女同什么意思| 下午5点是什么时辰| 肾病什么东西不能吃| 乙酰氨基酚是什么药| 现在是吃什么水果的季节| 洛神花是什么| 能级是什么意思| 810是什么意思| 缺维生素b有什么症状| 为什么闭眼单脚站不稳| 应接不暇的暇是什么意思| 奇妙的什么| 梦见自己掉牙是什么意思| 飞行模式和关机有什么区别| 拉肚子可以喝什么饮料| 指甲有竖纹是什么原因| 检测hpv挂什么科| 丙肝是什么病| 什么是风水| 胃炎胃溃疡吃什么药| 凹儿念什么| 为什么喝中药会拉肚子| 支原体是什么| 为什么心里老是想着死| 蜂蜜加柠檬有什么功效和作用| 计算机二级什么时候查成绩| 菜板买什么材质的好| 解析是什么意思| 脱肛和痔疮有什么区别| 小孩上户口需要什么材料| 吃什么东西能减肥| 吃什么能养胃| 心跳过快是什么原因引起的| 吃什么对肺有好处| 空囊是什么原因造成的| 家里起火代表什么预兆| 一库是什么意思| 什么叫护理| 尿路感染要吃什么药| 胰岛素针头4mm和5mm有什么区别| 有病是什么意思| 铁扇公主是什么妖精| 感染是什么意思| 舒字属于五行属什么| 吃什么下火| 嘛呢是什么意思| 大荔冬枣什么时候成熟| 无忧是什么意思| 黑管是什么乐器| 神经外科主要看什么病| 腌羊肉串放什么调料| 排卵试纸什么时候测最准确| 什么是硬下疳| 发膜和护发素有什么区别| 拉红色的屎是什么原因| 低脂高钙牛奶适合什么人群| 孕妇能喝什么饮料| 龙蛇混杂是什么生肖| 日加华念什么| 脘痞什么意思| 农历11月14日是什么星座| 酸萝卜别吃什么意思| 颜文字是什么意思| 飒什么意思| 回门是什么意思| 屁股后面骨头疼是什么原因| 为什么会得肿瘤| 肤色不均匀是什么原因| 子宫内膜薄是什么原因造成的| 枭雄的意思是什么| 为什么手比脸白那么多| 五个月宝宝吃什么辅食最好| 心电图可以检查出什么| 梦见生孩子是什么意思| 神经是什么意思| 抄送和密送是什么意思| 总感觉饿是什么原因| 医学cr是什么意思| 吃什么东西减肥| 包拯属什么生肖| 噫气是什么意思| 非萎缩性胃炎伴糜烂是什么意思| 鱼香肉丝为什么叫鱼香肉丝| 胰腺炎的症状是什么| 液体套是什么| 眼睛充血什么原因| 红豆不能和什么一起吃| 早上适合做什么运动| 玛瑙是什么材质| 减肥餐吃什么| 荷尔蒙是什么意思| 长期熬夜吃什么可以补回来| 一级军士长什么待遇| 低血压是什么原因| 痦子是什么| 杂合突变型是什么意思| 什么益生菌能减肥| 发烧为什么感觉冷| 呆若木鸡的意思是什么| 五月掉床有什么说法| 眼压高是什么意思| 吃避孕药有什么危害| 吃什么对皮肤好还能美白的| 666代表什么意思| 口腔溃疡为什么那么痛| 什么是岩茶| 梦见自己出轨是什么意思| 豁出去了什么意思| 大葱喜欢什么肥料| 附件炎吃什么药最好| 脚心疼什么原因| gap是什么意思| 化疗后骨髓抑制是什么意思| 纤支镜主要检查什么| 2029年属什么生肖| 喷塑工是干什么的| jhs空调是什么牌子| 双子座是什么象星座| 女生右眼睛老是跳是什么原因| 婴儿黄疸母亲忌口什么| 白斑是什么| 什么海翻江| nerdy是什么牌子| 复方北豆根氨酚那敏片是什么药| kb是什么意思| 1991年五行属什么| 奥斯卡小金人什么意思| 手足口病是什么症状| 肚子冰凉是什么原因| 爱马仕配货是什么意思| 梦见红色的蛇是什么意思| 腰疼吃什么药效果好| 黄柏泡水喝有什么功效| 荣辱与共是什么意思| 补气血什么季节补最好| 食道癌有什么症状| 01年的属什么| 乳腺钙化灶是什么意思| 县人大主任是什么级别| 什么是居间费| 有什么病| 湿疹和荨麻疹有什么区别| 奥美拉唑是治什么病的| 嬷嬷什么意思| 魔性是什么意思| 梦见吃蜂蜜是什么预兆| 孕激素是什么| 忘情水是什么意思| 事业编有什么好处| 左眼皮一直跳什么原因| 晚上看见蛇有什么预兆| 正常龟头是什么颜色| 胸闷气短吃什么药效果好| 一个口一个甫念什么| iga什么意思| 今年52岁属什么生肖| 短杆菌是什么意思| 狮子座和什么星座最配| 1983年五行属什么| 藏红花适合什么样的人喝| 蝴蝶骨是什么| 橘子什么时候成熟| 身体年龄是什么意思| 俱往矣是什么意思| 二聚体是什么| 眼花是什么原因引起的| rf医学上是什么意思| 什么鸟不能飞| 心窝窝疼是什么原因| 梦到打死蛇是什么意思| 树膏皮是什么皮| 苏打水有什么作用和功效| 出国要办什么证件| 72年鼠是什么命| 宝宝拉肚子吃什么药| 碳13是检查什么的| 正念是什么| 现在是什么时间| 办身份证需要准备什么| 红楼梦是一部什么小说| metoo是什么意思| 为什么会得飞蚊症| 脸书是什么意思| 杏林是什么意思| 火奥念什么| 人做梦是什么原因| 无机磷偏低有什么影响| 歼灭是什么意思| 暴跳如雷是什么意思| 学位证是什么| 利郎男装是什么档次的| no医学上是什么意思| 阿奇霉素和头孢有什么区别| 7月26日是什么日子| 龟头上有小红点是什么| 阴挺是什么意思| 七月十日是什么星座| 胃痛可以吃什么水果| 教师节送什么礼物给老师| 男人吃秋葵有什么好处| 已知晓是什么意思| 蛇和什么相冲| 工薪阶层是什么意思| 人红是非多什么意思| 胃酸不能吃什么食物| 小样什么意思| 6月适合种什么菜| 百度Jump to content

武汉新洲巴徐村村民吃饭不用给钱 一吃就是大半年

From Wikipedia, the free encyclopedia
百度 徐孟南在工作间隙复习。

In computing, online analytical processing (OLAP) (/?o?l?p/), is an approach to quickly answer multi-dimensional analytical (MDA) queries.[1] The term OLAP was created as a slight modification of the traditional database term online transaction processing (OLTP).[2] OLAP is part of the broader category of business intelligence, which also encompasses relational databases, report writing and data mining.[3] Typical applications of OLAP include business reporting for sales, marketing, management reporting, business process management (BPM),[4] budgeting and forecasting, financial reporting and similar areas, with new applications emerging, such as agriculture.[5]

OLAP tools enable users to analyse multidimensional data interactively from multiple perspectives. OLAP consists of three basic analytical operations: consolidation (roll-up), drill-down, and slicing and dicing.[6]:?402–403? Consolidation involves the aggregation of data that can be accumulated and computed in one or more dimensions. For example, all sales offices are rolled up to the sales department or sales division to anticipate sales trends. By contrast, the drill-down is a technique that allows users to navigate through the details. For instance, users can view the sales by individual products that make up a region's sales. Slicing and dicing is a feature whereby users can take out (slicing) a specific set of data of the OLAP cube and view (dicing) the slices from different viewpoints. These viewpoints are sometimes called dimensions (such as looking at the same sales by salesperson, or by date, or by customer, or by product, or by region, etc.).

Databases configured for OLAP use a multidimensional data model, allowing for complex analytical and ad hoc queries with a rapid execution time.[7] They borrow aspects of navigational databases, hierarchical databases and relational databases.

OLAP is typically contrasted to OLTP (online transaction processing), which is generally characterized by much less complex queries, in a larger volume, to process transactions rather than for the purpose of business intelligence or reporting. Whereas OLAP systems are mostly optimized for read, OLTP has to process all kinds of queries (read, insert, update and delete).

Overview of OLAP systems

[edit]

At the core of any OLAP system is an OLAP cube (also called a 'multidimensional cube' or a hypercube). It consists of numeric facts called measures that are categorized by dimensions. The measures are placed at the intersections of the hypercube, which is spanned by the dimensions as a vector space. The usual interface to manipulate an OLAP cube is a matrix interface, like Pivot tables in a spreadsheet program, which performs projection operations along the dimensions, such as aggregation or averaging.

The cube metadata is typically created from a star schema or snowflake schema or fact constellation of tables in a relational database. Measures are derived from the records in the fact table and dimensions are derived from the dimension tables.

Each measure can be thought of as having a set of labels, or meta-data associated with it. A dimension is what describes these labels; it provides information about the measure.

A simple example would be a cube that contains a store's sales as a measure, and Date/Time as a dimension. Each Sale has a Date/Time label that describes more about that sale.

For example:

 Sales Fact Table
+-------------+----------+
| sale_amount | time_id  |
+-------------+----------+            Time Dimension
|       930.10|     1234 |----+     +---------+-------------------+
+-------------+----------+    |     | time_id | timestamp         |
                              |     +---------+-------------------+
                              +---->|   1234  | 20080902 12:35:43 |
                                    +---------+-------------------+

Multidimensional databases

[edit]

Multidimensional structure is defined as "a variation of the relational model that uses multidimensional structures to organize data and express the relationships between data".[6]:?177? The structure is broken into cubes and the cubes are able to store and access data within the confines of each cube. "Each cell within a multidimensional structure contains aggregated data related to elements along each of its dimensions".[6]:?178? Even when data is manipulated it remains easy to access and continues to constitute a compact database format. The data still remains interrelated. Multidimensional structure is quite popular for analytical databases that use online analytical processing (OLAP) applications.[6] Analytical databases use these databases because of their ability to deliver answers to complex business queries swiftly. Data can be viewed from different angles, which gives a broader perspective of a problem unlike other models.[8]

Aggregations

[edit]

It has been claimed that for complex queries OLAP cubes can produce an answer in around 0.1% of the time required for the same query on OLTP relational data.[9][10] The most important mechanism in OLAP which allows it to achieve such performance is the use of aggregations. Aggregations are built from the fact table by changing the granularity on specific dimensions and aggregating up data along these dimensions, using an aggregate function (or aggregation function). The number of possible aggregations is determined by every possible combination of dimension granularities.

The combination of all possible aggregations and the base data contains the answers to every query which can be answered from the data.[11]

Because usually there are many aggregations that can be calculated, often only a predetermined number are fully calculated; the remainder are solved on demand. The problem of deciding which aggregations (views) to calculate is known as the view selection problem. View selection can be constrained by the total size of the selected set of aggregations, the time to update them from changes in the base data, or both. The objective of view selection is typically to minimize the average time to answer OLAP queries, although some studies also minimize the update time. View selection is NP-complete. Many approaches to the problem have been explored, including greedy algorithms, randomized search, genetic algorithms and A* search algorithm.

Some aggregation functions can be computed for the entire OLAP cube by precomputing values for each cell, and then computing the aggregation for a roll-up of cells by aggregating these aggregates, applying a divide and conquer algorithm to the multidimensional problem to compute them efficiently.[12] For example, the overall sum of a roll-up is just the sum of the sub-sums in each cell. Functions that can be decomposed in this way are called decomposable aggregation functions, and include COUNT, MAX, MIN, and SUM, which can be computed for each cell and then directly aggregated; these are known as self-decomposable aggregation functions.[13]

In other cases, the aggregate function can be computed by computing auxiliary numbers for cells, aggregating these auxiliary numbers, and finally computing the overall number at the end; examples include AVERAGE (tracking sum and count, dividing at the end) and RANGE (tracking max and min, subtracting at the end). In other cases, the aggregate function cannot be computed without analyzing the entire set at once, though in some cases approximations can be computed; examples include DISTINCT COUNT, MEDIAN, and MODE; for example, the median of a set is not the median of medians of subsets. These latter are difficult to implement efficiently in OLAP, as they require computing the aggregate function on the base data, either computing them online (slow) or precomputing them for possible rollouts (large space).

Types

[edit]

OLAP systems have been traditionally categorized using the following taxonomy.[14]

Multidimensional OLAP (MOLAP)

[edit]

MOLAP (multi-dimensional online analytical processing) is the classic form of OLAP and is sometimes referred to as just OLAP. MOLAP stores this data in an optimized multi-dimensional array storage, rather than in a relational database.

Some MOLAP tools require the pre-computation and storage of derived data, such as consolidations – the operation known as processing. Such MOLAP tools generally utilize a pre-calculated data set referred to as a data cube. The data cube contains all the possible answers to a given range of questions. As a result, they have a very fast response to queries. On the other hand, updating can take a long time depending on the degree of pre-computation. Pre-computation can also lead to what is known as data explosion.

Other MOLAP tools, particularly those that implement the functional database model do not pre-compute derived data but make all calculations on demand other than those that were previously requested and stored in a cache.

Advantages of MOLAP

  • Fast query performance due to optimized storage, multidimensional indexing and caching.
  • Smaller on-disk size of data compared to data stored in relational database due to compression techniques.
  • Automated computation of higher-level aggregates of the data.
  • It is very compact for low dimension data sets.
  • Array models provide natural indexing.
  • Effective data extraction achieved through the pre-structuring of aggregated data.

Disadvantages of MOLAP

  • Within some MOLAP systems the processing step (data load) can be quite lengthy, especially on large data volumes. This is usually remedied by doing only incremental processing, i.e., processing only the data which have changed (usually new data) instead of reprocessing the entire data set.
  • Some MOLAP methodologies introduce data redundancy.

Products

[edit]

Examples of commercial products that use MOLAP are Cognos Powerplay, Oracle Database OLAP Option, MicroStrategy, Microsoft Analysis Services, Essbase, TM1, Jedox, and icCube.

Relational OLAP (ROLAP)

[edit]

ROLAP works directly with relational databases and does not require pre-computation. The base data and the dimension tables are stored as relational tables and new tables are created to hold the aggregated information. It depends on a specialized schema design. This methodology relies on manipulating the data stored in the relational database to give the appearance of traditional OLAP's slicing and dicing functionality. In essence, each action of slicing and dicing is equivalent to adding a "WHERE" clause in the SQL statement. ROLAP tools do not use pre-calculated data cubes but instead pose the query to the standard relational database and its tables in order to bring back the data required to answer the question. ROLAP tools feature the ability to ask any question because the methodology is not limited to the contents of a cube. ROLAP also has the ability to drill down to the lowest level of detail in the database.

While ROLAP uses a relational database source, generally the database must be carefully designed for ROLAP use. A database which was designed for OLTP will not function well as a ROLAP database. Therefore, ROLAP still involves creating an additional copy of the data. However, since it is a database, a variety of technologies can be used to populate the database.

Advantages of ROLAP

[edit]
  • ROLAP is considered to be more scalable in handling large data volumes, especially models with dimensions with very high cardinality (i.e., millions of members).
  • With a variety of data loading tools available, and the ability to fine-tune the extract, transform, load (ETL) code to the particular data model, load times are generally much shorter than with the automated MOLAP loads.
  • The data are stored in a standard relational database and can be accessed by any SQL reporting tool (the tool does not have to be an OLAP tool).
  • ROLAP tools are better at handling non-aggregable facts (e.g., textual descriptions). MOLAP tools tend to suffer from slow performance when querying these elements.
  • By decoupling the data storage from the multi-dimensional model, it is possible to successfully model data that would not otherwise fit into a strict dimensional model.
  • The ROLAP approach can leverage database authorization controls such as row-level security, whereby the query results are filtered depending on preset criteria applied, for example, to a given user or group of users (SQL WHERE clause).

Disadvantages of ROLAP

[edit]
  • There is a consensus in the industry that ROLAP tools have slower performance than MOLAP tools. However, see the discussion below about ROLAP performance.
  • The loading of aggregate tables must be managed by custom ETL code. The ROLAP tools do not help with this task. This means additional development time and more code to support.
  • When the step of creating aggregate tables is skipped, the query performance then suffers because the larger detailed tables must be queried. This can be partially remedied by adding additional aggregate tables; however it is still not practical to create aggregate tables for all combinations of dimensions/attributes.
  • ROLAP relies on the general-purpose database for querying and caching, and therefore several special techniques employed by MOLAP tools are not available (such as special hierarchical indexing). However, modern ROLAP tools take advantage of latest improvements in SQL language such as CUBE and ROLLUP operators, DB2 Cube Views, as well as other SQL OLAP extensions. These SQL improvements can mitigate the benefits of the MOLAP tools.
  • Since ROLAP tools rely on SQL for all of the computations, they are not suitable when the model is heavy on calculations which don't translate well into SQL. Examples of such models include budgeting, allocations, financial reporting and other scenarios.

Performance of ROLAP

[edit]

In the OLAP industry ROLAP is usually perceived as being able to scale for large data volumes but suffering from slower query performance as opposed to MOLAP. The OLAP Survey[usurped], the largest independent survey across all major OLAP products, being conducted for 6 years (2001 to 2006) have consistently found that companies using ROLAP report slower performance than those using MOLAP even when data volumes were taken into consideration.

However, as with any survey there are a number of subtle issues that must be taken into account when interpreting the results.

  • The survey shows that ROLAP tools have 7 times more users than MOLAP tools within each company. Systems with more users will tend to suffer more performance problems at peak usage times.
  • There is also a question about complexity of the model, measured both in number of dimensions and richness of calculations. The survey does not offer a good way to control for these variations in the data being analyzed.

Downside of flexibility

[edit]

Some companies select ROLAP because they intend to re-use existing relational database tables—these tables will frequently not be optimally designed for OLAP use. The superior flexibility of ROLAP tools allows this less-than-optimal design to work, but performance suffers. MOLAP tools in contrast would force the data to be re-loaded into an optimal OLAP design.

Hybrid OLAP (HOLAP)

[edit]

The undesirable trade-off between additional ETL cost and slow query performance has ensured that most commercial OLAP tools now use a "Hybrid OLAP" (HOLAP) approach, which allows the model designer to decide which portion of the data will be stored in MOLAP and which portion in ROLAP.

There is no clear agreement across the industry as to what constitutes "Hybrid OLAP", except that a database will divide data between relational and specialized storage.[15] For example, for some vendors, a HOLAP database will use relational tables to hold the larger quantities of detailed data and use specialized storage for at least some aspects of the smaller quantities of more-aggregate or less-detailed data. HOLAP addresses the shortcomings of MOLAP and ROLAP by combining the capabilities of both approaches. HOLAP tools can utilize both pre-calculated cubes and relational data sources.

Vertical partitioning

[edit]

In this mode HOLAP stores aggregations in MOLAP for fast query performance, and detailed data in ROLAP to optimize time of cube processing.

Horizontal partitioning

[edit]

In this mode HOLAP stores some slice of data, usually the more recent one (i.e. sliced by Time dimension) in MOLAP for fast query performance, and older data in ROLAP. Moreover, we can store some dices in MOLAP and others in ROLAP, leveraging the fact that in a large cuboid, there will be dense and sparse subregions.[16]

Products

[edit]

The first product to provide HOLAP storage was Holos, but the technology also became available in other commercial products such as Microsoft Analysis Services, Oracle Database OLAP Option, MicroStrategy and SAP AG BI Accelerator. The hybrid OLAP approach combines ROLAP and MOLAP technology, benefiting from the greater scalability of ROLAP and the faster computation of MOLAP. For example, a HOLAP server may store large volumes of detailed data in a relational database, while aggregations are kept in a separate MOLAP store. The Microsoft SQL Server 7.0 OLAP Services supports a hybrid OLAP server

Comparison

[edit]

Each type has certain benefits, although there is disagreement about the specifics of the benefits between providers.

  • Some MOLAP implementations are prone to database explosion, a phenomenon causing vast amounts of storage space to be used by MOLAP databases when certain common conditions are met: high number of dimensions, pre-calculated results and sparse multidimensional data.
  • MOLAP generally delivers better performance due to specialized indexing and storage optimizations. MOLAP also needs less storage space compared to ROLAP because the specialized storage typically includes compression techniques.[15]
  • ROLAP is generally more scalable.[15] However, large volume pre-processing is difficult to implement efficiently so it is frequently skipped. ROLAP query performance can therefore suffer tremendously.
  • Since ROLAP relies more on the database to perform calculations, it has more limitations in the specialized functions it can use.
  • HOLAP attempts to mix the best of ROLAP and MOLAP. It can generally pre-process swiftly, scale well, and offer good function support.

Other types

[edit]

The following acronyms are also sometimes used, although they are not as widespread as the ones above:

  • WOLAP – Web-based OLAP
  • DOLAPDesktop OLAP
  • RTOLAP – Real-time OLAP
  • GOLAP – Graph OLAP[17][18]
  • CaseOLAP – Context-aware Semantic OLAP,[19] developed for biomedical applications.[20] The CaseOLAP platform includes data preprocessing (e.g., downloading, extraction, and parsing text documents), indexing and searching with Elasticsearch, creating a functional document structure called Text-Cube,[21][22][23][24][25] and quantifying user-defined phrase-category relationships using the core CaseOLAP algorithm.

APIs and query languages

[edit]

Unlike relational databases, which had SQL as the standard query language, and widespread APIs such as ODBC, JDBC and OLEDB, there was no such unification in the OLAP world for a long time. The first real standard API was OLE DB for OLAP specification from Microsoft which appeared in 1997 and introduced the MDX query language. Several OLAP vendors – both server and client – adopted it. In 2001 Microsoft and Hyperion announced the XML for Analysis specification, which was endorsed by most of the OLAP vendors. Since this also used MDX as a query language, MDX became the de facto standard.[26] Since September-2011 LINQ can be used to query SSAS OLAP cubes from Microsoft .NET.[27]

Products

[edit]

History

[edit]

The first product that performed OLAP queries was Express, which was released in 1970 (and acquired by Oracle in 1995 from Information Resources).[28] However, the term did not appear until 1993 when it was coined by Edgar F. Codd, who has been described as "the father of the relational database". Codd's paper[1] resulted from a short consulting assignment which Codd undertook for former Arbor Software (later Hyperion Solutions, and in 2007 acquired by Oracle), as a sort of marketing coup.

The company had released its own OLAP product, Essbase, a year earlier. As a result, Codd's "twelve laws of online analytical processing" were explicit in their reference to Essbase. There was some ensuing controversy and when Computerworld learned that Codd was paid by Arbor, it retracted the article. The OLAP market experienced strong growth in the late 1990s with dozens of commercial products going into market. In 1998, Microsoft released its first OLAP Server – Microsoft Analysis Services, which drove wide adoption of OLAP technology and moved it into the mainstream.

Product comparison

[edit]

OLAP clients

[edit]

OLAP clients include many spreadsheet programs like Excel, web application, SQL, dashboard tools, etc. Many clients support interactive data exploration where users select dimensions and measures of interest. Some dimensions are used as filters (for slicing and dicing the data) while others are selected as the axes of a pivot table or pivot chart. Users can also vary aggregation level (for drilling-down or rolling-up) the displayed view. Clients can also offer a variety of graphical widgets such as sliders, geographic maps, heat maps and more which can be grouped and coordinated as dashboards. An extensive list of clients appears in the visualization column of the comparison of OLAP servers table.

Market structure

[edit]

Below is a list of top OLAP vendors in 2006, with figures in millions of US Dollars.[29]

Vendor Global Revenue Consolidated company
Microsoft Corporation 1,806 Microsoft
Hyperion Solutions Corporation 1,077 Oracle
Cognos 735 IBM
Business Objects 416 SAP
MicroStrategy 416 MicroStrategy
SAP AG 330 SAP
Cartesis (SAP) 210 SAP
Applix 205 IBM
Infor 199 Infor
Oracle Corporation 159 Oracle
Others 152 Others
Total 5,700

Open source

[edit]
  • Apache Pinot is used at LinkedIn, Cisco, Uber, Slack, Stripe, DoorDash, Target, Walmart, Amazon, and Microsoft to deliver scalable real time analytics with low latency.[30] It can ingest data from offline data sources (such as Hadoop and flat files) as well as online sources (such as Kafka). Pinot is designed to scale horizontally.
  • Mondrian OLAP server is an open-source OLAP server written in Java. It supports the MDX query language, the XML for Analysis and the olap4j[usurped] interface specifications.
  • Apache Doris is an open-source real-time analytical database based on MPP architecture. It can support both high-concurrency point query scenarios and high-throughput complex analysis.[31]
  • Apache Druid is a popular open-source distributed data store for OLAP queries that is used at scale in production by various organizations.
  • Apache Kylin is a distributed data store for OLAP queries originally developed by eBay.
  • Cubes (OLAP server) is another lightweight open-source toolkit implementation of OLAP functionality in the Python programming language with built-in ROLAP.
  • ClickHouse is a fairly new column-oriented DBMS focusing on fast processing and response times.
  • DuckDB[32] is an in-process SQL OLAP[33] database management system.
  • MonetDB is a mature open-source column-oriented SQL RDBMS designed for OLAP queries.

See also

[edit]

References

[edit]

Citations

[edit]
  1. ^ a b Codd E.F.; Codd S.B. & Salley C.T. (1993). "Providing OLAP (On-line Analytical Processing) to User-Analysts: An IT Mandate" (PDF). Codd & Date, Inc. Retrieved March 5, 2008.[permanent dead link]
  2. ^ "OLAP Council White Paper" (PDF). OLAP Council. 1997. Retrieved March 18, 2008.
  3. ^ Deepak Pareek (2007). Business Intelligence for Telecommunications. CRC Press. pp. 294 pp. ISBN 978-0-8493-8792-0. Retrieved March 18, 2008.
  4. ^ Apostolos Benisis (2010). Business Process Management:A Data Cube To Analyze Business Process Simulation Data For Decision Making. VDM Verlag Dr. Müller e.K. pp. 204 pp. ISBN 978-3-639-22216-6.
  5. ^ Abdullah, Ahsan (November 2009). "Analysis of mealybug incidence on the cotton crop using ADSS-OLAP (Online Analytical Processing) tool". Computers and Electronics in Agriculture. 69 (1): 59–72. Bibcode:2009CEAgr..69...59A. doi:10.1016/j.compag.2009.07.003.
  6. ^ a b c d O'Brien, J. A., & Marakas, G. M. (2009). Management information systems (9th ed.). Boston, MA: McGraw-Hill/Irwin.
  7. ^ Hari Mailvaganam (2007). "Introduction to OLAP – Slice, Dice and Drill!". Data Warehousing Review. Retrieved March 18, 2008.
  8. ^ Williams, C., Garza, V.R., Tucker, S, Marcus, A.M. (1994, January 24). Multidimensional models boost viewing options. InfoWorld, 16(4)
  9. ^ MicroStrategy, Incorporated (1995). "The Case for Relational OLAP" (PDF). Retrieved March 20, 2008.
  10. ^ Surajit Chaudhuri & Umeshwar Dayal (1997). "An overview of data warehousing and OLAP technology". SIGMOD Rec. 26 (1): 65. CiteSeerX 10.1.1.211.7178. doi:10.1145/248603.248616. S2CID 8125630.
  11. ^ Gray, Jim; Chaudhuri, Surajit; Layman, Andrew; Reichart, Don; Venkatrao, Murali; Pellow, Frank; Pirahesh, Hamid (1997). "Data Cube: {A} Relational Aggregation Operator Generalizing Group-By, Cross-Tab, and Sub-Totals". J. Data Mining and Knowledge Discovery. 1 (1): 29–53. arXiv:cs/0701155. doi:10.1023/A:1009726021843. S2CID 12502175. Retrieved March 20, 2008.
  12. ^ Zhang 2017, p. 1.
  13. ^ Jesus, Baquero & Almeida 2011, 2.1 Decomposable functions, pp. 3–4.
  14. ^ Nigel Pendse (June 27, 2006). "OLAP architectures". OLAP Report. Archived from the original on January 24, 2008. Retrieved March 17, 2008.
  15. ^ a b c Bach Pedersen, Torben; S. Jensen, Christian (December 2001). "Multidimensional Database Technology". Computer. 34 (12): 40–46. doi:10.1109/2.970558. ISSN 0018-9162.
  16. ^ Kaser, Owen; Lemire, Daniel (2006). "Attribute value reordering for efficient hybrid OLAP". Information Sciences. 176 (16): 2304–2336. arXiv:cs/0702143. doi:10.1016/j.ins.2005.09.005.
  17. ^ "This Week in Graph and Entity Analytics". Datanami. December 7, 2016. Retrieved March 8, 2018.
  18. ^ "Cambridge Semantics Announces AnzoGraph Support for Amazon Neptune and Graph Databases". Database Trends and Applications. February 15, 2018. Retrieved March 8, 2018.
  19. ^ Tao, Fangbo; Zhuang, Honglei; Yu, Chi Wang; Wang, Qi; Cassidy, Taylor; Kaplan, Lance; Voss, Clare; Han, Jiawei (2016). "Multi-Dimensional, Phrase-Based Summarization in Text Cubes" (PDF).
  20. ^ Liem, David A.; Murali, Sanjana; Sigdel, Dibakar; Shi, Yu; Wang, Xuan; Shen, Jiaming; Choi, Howard; Caufield, John H.; Wang, Wei; Ping, Peipei; Han, Jiawei (October 1, 2018). "Phrase mining of textual data to analyze extracellular matrix protein patterns across cardiovascular disease". American Journal of Physiology. Heart and Circulatory Physiology. 315 (4): H910 – H924. doi:10.1152/ajpheart.00175.2018. ISSN 1522-1539. PMC 6230912. PMID 29775406.
  21. ^ Lee, S.; Kim, N.; Kim, J. (2014). "A Multi-dimensional Analysis and Data Cube for Unstructured Text and Social Media". 2014 IEEE Fourth International Conference on Big Data and Cloud Computing. pp. 761–764. doi:10.1109/BDCloud.2014.117. ISBN 978-1-4799-6719-3. S2CID 229585.
  22. ^ Ding, B.; Lin, X.C.; Han, J.; Zhai, C.; Srivastava, A.; Oza, N.C. (December 2011). "Efficient Keyword-Based Search for Top-K Cells in Text Cube". IEEE Transactions on Knowledge and Data Engineering. 23 (12): 1795–1810. doi:10.1109/TKDE.2011.34. S2CID 13960227.
  23. ^ Ding, B.; Zhao, B.; Lin, C.X.; Han, J.; Zhai, C. (2010). "TopCells: Keyword-based search of top-k aggregated documents in text cube". 2010 IEEE 26th International Conference on Data Engineering (ICDE 2010). pp. 381–384. CiteSeerX 10.1.1.215.7504. doi:10.1109/ICDE.2010.5447838. ISBN 978-1-4244-5445-7. S2CID 14649087.
  24. ^ Lin, C.X.; Ding, B.; Han, K.; Zhu, F.; Zhao, B. (2008). "Text Cube: Computing IR Measures for Multidimensional Text Database Analysis". 2008 Eighth IEEE International Conference on Data Mining. pp. 905–910. doi:10.1109/icdm.2008.135. ISBN 978-0-7695-3502-9. S2CID 1522480.
  25. ^ Liu, X.; Tang, K.; Hancock, J.; Han, J.; Song, M.; Xu, R.; Pokorny, B. (March 21, 2013). Greenberg, A.M.; Kennedy, W.G.; Bos, N.D. (eds.). Social Computing, Behavioral-Cultural Modeling and Prediction: 6th International Conference, SBP 2013, Washington, DC, USA, April 2-5, 2013, Proceedings (7812 ed.). Berlin, Heidelberg: Springer. pp. 321–330. ISBN 978-3-642-37209-4.
  26. ^ Nigel Pendse (August 23, 2007). "Commentary: OLAP API wars". OLAP Report. Archived from the original on May 28, 2008. Retrieved March 18, 2008.
  27. ^ "SSAS Entity Framework Provider for LINQ to SSAS OLAP". Archived from the original on September 29, 2011.
  28. ^ Nigel Pendse (August 23, 2007). "The origins of today's OLAP products". OLAP Report. Archived from the original on December 21, 2007. Retrieved November 27, 2007.
  29. ^ Nigel Pendse (2006). "OLAP Market". OLAP Report. Archived from the original on March 30, 1997. Retrieved March 17, 2008.
  30. ^ Yegulalp, Serdar (June 11, 2015). "LinkedIn fills another SQL-on-Hadoop niche". InfoWorld. Retrieved November 19, 2016.
  31. ^ "Apache Doris". Github. Apache Doris Community. Retrieved April 5, 2023.
  32. ^ "An in-process SQL OLAP database management system". DuckDB. Retrieved December 10, 2022.
  33. ^ Anand, Chillar (November 17, 2022). "Common Crawl On Laptop - Extracting Subset Of Data". Avil Page. Retrieved December 10, 2022.

Sources

[edit]

Further reading

[edit]
  • Erik Thomsen. (1997). OLAP Solutions: Building Multidimensional Information Systems, 2nd Edition. John Wiley & Sons. ISBN 978-0-471-14931-6.
马齿菜有什么功效 姑爹是什么意思 墙头草是什么意思 什么邮箱最好用最安全 心口下面疼是什么原因
o型血与a型血生的孩子是什么血型 不骄不躁是什么意思 先天性心脏病是什么原因造成的 流光是什么意思 白果有什么功效与作用
hb医学上是什么意思 肾结水是什么原因造成的 倒拔垂杨柳是什么意思 什么是肌张力 什么可以补肾壮阳
pc材质是什么 梦到被蜜蜂蛰是什么意思 问诊是什么意思 东北人喜欢吃什么菜 淋巴结是什么东西
疑虑是什么意思hcv9jop6ns9r.cn 巳蛇五行属什么jingluanji.com 猫的胡子有什么作用hcv7jop9ns0r.cn 一个m是什么品牌hcv8jop1ns1r.cn 芒果什么时候成熟hcv9jop5ns6r.cn
伊犁在新疆什么位置hcv8jop6ns2r.cn nt和唐筛有什么区别jiuxinfghf.com 早上六点是什么时辰hcv8jop0ns5r.cn 吃生葵花籽有什么好处和坏处吗hcv8jop3ns5r.cn 无花果和什么煲汤好hcv9jop8ns1r.cn
淋巴细胞数高说明什么xinjiangjialails.com 血压高是什么原因引起的jingluanji.com 万象更新是什么意思hcv8jop7ns1r.cn UNIQLO什么牌子hcv8jop5ns1r.cn 贫血缺什么元素hcv9jop5ns1r.cn
宝宝辅食虾和什么搭配hcv8jop1ns8r.cn 一喝牛奶就拉肚子是什么原因hcv9jop7ns3r.cn 桑葚有什么作用hcv8jop1ns2r.cn 白带发黄吃什么药wzqsfys.com 31年属什么生肖gysmod.com
百度